Readability-like JavaScript library

1.8k views Asked by At

Is there a (client-side) JavaScript library for doing Readability/Instapaper-style uncluttering of HTML? ie Takes HTML for a full web page and unclutters it by removing headings, sidebars etc, to just leave the core article, using some heuristics. I'm looking for something like clean(htmlPageString).

I think readability itself is implemented in JavaScript, but there's no library for it.

3

There are 3 answers

0
bitstream On BEST ANSWER

Here is readability, encapsulated in a node module: node-readability. It works fine, but not in every case. I am currently looking for alternatives.

0
Umesh Patil On

As per my reading, its built in javascript+JSON that stores the reading list of users in a queue in the form of JSON. It keeps track of the webpage requests along with time,date and some other specific information.

0
tlattimore On

I think this JavaScript library is along the lines of what you are looking for. It is a standalone version of the library used to generate Firefox's Reader View.

https://github.com/mozilla/readability