How to parse html with xpath?

174 views Asked by At

I ran into a few questions on StackOverflow that asked about parsing html with xpath using Java.

This is the best answer I found so far here.

But it looks like DomSerializer is no longer available in Java 11.

How can I use DomSerializer in Java 11 ?

1

There are 1 answers

0
Anish B. On BEST ANSWER

Add the below dependency to the pom.xml :

<dependency>
    <groupId>net.sourceforge.htmlcleaner</groupId>
    <artifactId>htmlcleaner</artifactId>
    <version>2.6.1</version>
</dependency>

Or

You can download htmlcleaner-2.6.1.jar from here.

That jar (or artifact) has the DomSerializer class within it.

Link to read docs : http://htmlcleaner.sourceforge.net/doc/org/htmlcleaner/DomSerializer.html