I am hoping to write a program that will take in a URL as input and then search through the text on that webpage to pull out relevant data.
Essentially, my plan was to have the program search the webpage on certain strings and then find and record the numbers on the same line to compile into another file. Luckily, all of the data I need are in tables on the webpage so I assumed that would make it easier to get the data.
The problem that I am facing is that I do not know where to start. I have been searching the web for answers to this but, unfortunately, the only thing people tend to recommend is using an HTML parser. This would not help since HTML code is not what I am looking for and does not include the data I seek. Having said that, simply pointing me in the right direction on this issue would be substantially helpful.
In advance, I appreciate your help.