guide to setup crawler4j

2.7k views Asked by At

I would like to setup the crawler to crawl a website, let say blog, and fetch me only the links in the website and paste the links inside a text file. Can you guide me step by step for setup the crawler? I am using Eclipse.

1

There are 1 answers

0
Ahamed On

Jsoup will do everything you need wrt html parsing. Jsoup is a java api for handling html source code. You can get

  1. Table, with which you can parse each and every row or column.
  2. List of all the links and source imports to that html(imports like css and js files).
  3. Data of particular tag.

and more.

For your purpose, here is the example code.

Hope this will help you.