Linked Questions

Popular Questions

Parsing News Urls with Readability

Asked by At

I am trying to automate the extraction of data from news urls with something like https://zapier.com or https://ifttt.com.

The main goal is to get a pretty version of news urls' text into a google spreadsheet. Trying to use function like "importXML" fail cause I need to analyse urls with different structures.

I was thinking about to use something like readability.js https://code.google.com/archive/p/arc90labs-readability/downloads to get the html into pretty text, since zapier allows code fragments in javascript or python.

The problem is in zapier you cannot use modules that are not standard to these programing languages (like beautifulsoup) to scrape the urls.

Is there any chance I can pass an url to the code of the readability.js https://storage.googleapis.com/google-code-archive-downloads/v2/code.google.com/arc90labs-readability/readability.js or make an api call to https://mercury.postlight.com/web-parser/ that returns a pretty text?

Related Questions