get the data from html frames

Question

get the data from html frames

1.6k views Asked by agstudy At 23 January 2014 at 14:01

I try to get the tables information in this html frames. I mean the tables having columns:

Year,Month,Oil Production m3,Gas Production Ksm3,...

Using beautifulSoup, This is so far what I have tried:

from bs4 import BeautifulSoup
from urllib import urlopen, urlretrieve, quote

url_base = 'https://www.og.decc.gov.uk/information/wells/pprs/Well_production_onshore_oil_fields/onshore_oil_fields_by_well/onshore_oil_fields_by_wel.html'
u = urlopen(url_base)
html = u.read().decode('utf-8')
u.close()
soup = BeautifulSoup(html)

But this retrieve only the main page info, not the page frames. When I change the url base by the frame link , it tells me that the requested page is outdated.

Original Q&A

There are 1 answers

**Ray** · Accepted Answer · 2014-01-23T14:13:01+00:00

I think you are copying the wrong url. When I use the following, it worked.

url_base = 'https://www.og.decc.gov.uk/information/wells/pprs/Well_production_onshore_oil_fields/onshore_oil_fields_by_well/0.htm'

Attention: it is .../onshore_oil_fields_by_well/0.htm,

instead of .../onshore_oil_fields_by_well/0.html

TechQA.

get the data from html frames

There are 1 answers

Related Questions in PYTHON

Related Questions in BEAUTIFULSOUP

Related Questions in SCREEN-SCRAPING

Related Questions in HTML-FRAMES

Popular Questions

Popular Tags

Trending Questions