grabbing value from html by xpath in python

Question

grabbing value from html by xpath in python

63 views Asked by MLSC At 10 June 2015 at 05:37

I want to use xpath for grabbing WhatIwant phrase:

a="<b>AAA:</b> BBB<br/><br/><img src='line.gif' /><br/><br/><b><font size='2'>Text: </b>WahtIwant</font><br/><center>"

I want to grab WahtIwant from a:

tree=html.fromstring(a)
tree.xpath('//font[@size="2"]/text()')
['Text: ']

Original Q&A

There are 2 answers

har07 On 10 June 2015 at 06:06

In xpath point of view, the text you want is following-sibling of the <b> element that is parent of font[@size="2"] :

tree.xpath('//font[@size="2"]/parent::b/following-sibling::text()')

or, you can use xpath that select <b> element having child font with size attribute equals 2, and then select text node following that <b> :

tree.xpath('//b[font/@size="2"]/following-sibling::text()')

**falsetru** · Accepted Answer · 2015-06-10T05:40:13+00:00

Using lxml and tail property (text that directly follows the element) of the element.

>>> import lxml.html
>>> 
>>> a = "<b>AAA:</b> BBB<br/><br/><img src='line.gif' /><br/><br/><b><font size='2'>Text: </b>WahtIwant</font><br/><center>"
>>> root = lxml.html.fromstring(a)
>>> [x.tail for x in root.xpath('//font[@size="2"]/parent::b')]
['WahtIwant']

TechQA.

grabbing value from html by xpath in python

There are 2 answers

Related Questions in PYTHON

Related Questions in HTML

Related Questions in XPATH

Popular Questions

Popular Tags

Trending Questions