Hi I have a task to build an application that display news from various websites(BBC News, CNN, etc)
I came up with 2 ideas to either parse an RSS Feed of the news site or parse the html pages of each news article.
However after researching abit on RSS feeds i found out it is hard to parse an image from mainly because not all rss feeds have images.
Therefore what do you recommend as a good HTML document parser which i can extract the Title, Description, Data and Image of the news article.