Downloading pdf files using playwright-python

Question

Downloading pdf files using playwright-python

3.7k views Asked by FarNorth At 08 December 2020 at 02:47

I'm trying to download PDF files that are rendered in a browser (not shown as a popup or downloaded) using playwright (Python). No URL is exposed, so you can't simply scrape a link and download it using requests.get("file_url").

I've tried:

async def main():
    async with async_playwright() as p:
        browser = await p.chromium.launch(headless=False)
        page = await browser.newPage(acceptDownloads=True)
    
        await page.goto("www.some_landing_page.com")
            
        async with page.expect_download() as download_info:
            await page.click("a")     # selector to a pdf file
        
        download = download_info.value
        path = download.path()

I've also tried page.expect_popup() with no luck either. My understanding is that this can't be done using pyppeteer, but would welcome a solution this way as well, if possible.

Original Q&A

There are 1 answers

**FarNorth** · Accepted Answer · 2021-10-04T18:57:29+00:00

FarNorth On 04 October 2021 at 18:57 BEST ANSWER

For anyone with a similar problem, try using firefox or webkit instead of chromium for the browser. Provided a work-around for me.

TechQA.

Downloading pdf files using playwright-python

There are 1 answers

Related Questions in PYTHON-3.X

Related Questions in PLAYWRIGHT

Related Questions in PYPPETEER

Related Questions in PLAYWRIGHT-PYTHON

Popular Questions

Popular Tags

Trending Questions