The method createDOM not return document

Question

The method createDOM not return document

299 views Asked by AudioBubble At 13 September 2013 at 17:51

I use HtmlCleaner 2.6.1 and Xpath to parse html page in Android application. Here html page:

http://www.kino-govno.com/comments/42571-postery-kapitan-fillips-i-poslednij-rubezh
http://www.kino-govno.com/comments/42592-fantasticheskie-idei-i-mesta-ih-obitanija

The first link return document, is all right.The second link here in this place:
```
document = domSerializer.createDOM(tagNode);
```
returns nothing.

If you create a simple java project without android. That all works fine.

Here is the Code :

        String queries = "//div[starts-with(@class, 'news_text op')]/p";            
        URL url = new URL(link2);
        TagNode tagNode = new HtmlCleaner().clean(url);
        CleanerProperties cleanerProperties = new CleanerProperties();
        DomSerializer domSerializer = new DomSerializer(cleanerProperties);
        document = domSerializer.createDOM(tagNode);
        xPath = XPathFactory.newInstance().newXPath();
        pageNode = (NodeList)xPath.evaluate(queries,document, XPathConstants.NODESET);
        String val = pageNode.item(0).getFirstChild().getNodeValue();

Original Q&A

There are 1 answers

**Jens Erat** · Answer 1 · 2013-09-13T21:59:06+00:00

Jens Erat On 13 September 2013 at 21:59

That's because HtmlCleaner wraps the paragraphs of the second HTML page into another <div/>, so it is not a direct child any more. Use the descendent-or-self-axis // instead of the child-axis /:

//div[starts-with(@class, 'news_text op')]//p

TechQA.

The method createDOM not return document

There are 1 answers

Related Questions in JAVA

Related Questions in ANDROID

Related Questions in XPATH

Related Questions in HTMLCLEANER

Popular Questions

Popular Tags

Trending Questions