Analyzing OTC Ingredient Substances

looping through files and extracting the ingredient substances

making a dictionary of the ingredient substances

convert the dictionary to a data frame, and extract the largest 5 values

analyzing 1000 files from the otc archive1 directory

analyzing the first 4000 files from the otc archive1 directory

analyzing all of the ingredient substances of the medicines in the otc archive1 directory

analyzing all of the otc xml files in all ten archive directories [part 1]

analyzing all of the otc xml files in all ten archive directories [part 2]