Extract PDF Content Including Images For RAG

Question

Extract PDF Content Including Images For RAG

152 views Asked by Niyooooo At 26 February 2024 at 09:43

I am trying to build a PDF content extraction and chunking system for RAG in my application. I need to include images from pdf as urls,so that the llm can use that images in the response most of the solutions that i have seen only extract text content from pdf.Is there any way to extract images and text from pdf ?

Original Q&A

There are 1 answers

**Nick Magnanini - preprocess.co** · Answer 1 · 2024-03-07T08:25:59+00:00

Nick Magnanini - preprocess.co On 07 March 2024 at 08:25

PyMuPDF allows you to do that for images and tables

TechQA.

Extract PDF Content Including Images For RAG

There are 1 answers

Related Questions in PDF

Related Questions in PDF-GENERATION

Related Questions in INFORMATION-RETRIEVAL

Related Questions in LARGE-LANGUAGE-MODEL

Related Questions in RETRIEVAL-AUGMENTED-GENERATION

Popular Questions

Trending Questions