extracting values from html table using beautifulsoup4 (2nd row onwards, 1st and 6th column)

Question

extracting values from html table using beautifulsoup4 (2nd row onwards, 1st and 6th column)

747 views Asked by Lawren At 18 December 2013 at 16:53

I am new to python and need some guidance on extracting values from specific cells from a HTML table.

The URL that I am working on can be found here

I am looking to get the first 5 values only in the Month and Settlement columns and subsequently display them as:

"MAR 14:426'6"

Problem that I am facing is:

How do I get the loop to start from the 3rd "TR" in the table
How to get only values for td[0] and td[6].
How to restrict the loop to only retrieve values for 5 rows

This is the code that I am working on:

tableData = soup1.find("table", id="DailySettlementTable")
for rows in tableData.findAll('tr'):
    month = rows.find('td')
    print month

Thank you and appreciate any form of guidance!

Original Q&A

There are 1 answers

**Chris** · Accepted Answer · 2013-12-18T17:33:01+00:00

You probably want to use slicing.

Here's a modified snippet for your code:

table = soup.find('table', id='DailySettlementTable')

# The slice notation below, [2:7], says to take the third (index 2)
# to the eighth (index 7) values from the rows we get.
for rows in table.find_all('tr')[2:7]:
    cells = rows.find_all('td')
    month = cells[0]
    settle = cells[6]

    print month.string + ':' + settle.string

TechQA.

extracting values from html table using beautifulsoup4 (2nd row onwards, 1st and 6th column)

There are 1 answers

Related Questions in PYTHON

Related Questions in BEAUTIFULSOUP

Related Questions in HTML-TABLEEXTRACT

Popular Questions

Trending Questions