I'm using newspaper3k inside Scrapy parse method. I want to extract links but I don't want to fetch the website again.
Is it possible to use this:
newspaper.build(..)
with plain html so I can call .articles than?
I'm using newspaper3k inside Scrapy parse method. I want to extract links but I don't want to fetch the website again.
Is it possible to use this:
newspaper.build(..)
with plain html so I can call .articles than?
I found this solution: