News aggregator for sentiment analysis

1.4k views Asked by At

I am writing a little news sentiment analysis app - in python. I want to prepare a database of news articles to train my classifier on, so I am wondering what is my best course of action for fetching news articles off of the web. I looked at newspaper, which looks like a cool module and very generic, but what I am looking for is a way of fetching old news articles - i.e all news articles of 2014. newspaper only uses RSS feeds that never go too far back. Another option would be writing a scraper for google news, and filtering by date in the url, or using API's of publishers like NYT (they have an API).

What is the best way to create a news article database like this? Is there a tool/database on the web I can use to get the articles?

0

There are 0 answers