Extract text form pdf using Foxit SDK

Question

Extract text form pdf using Foxit SDK

2k views Asked by Tushar Agarwal At 27 January 2012 at 05:51

I am using Foxit SDK to extract the text from Pdf document .

Everything is okay but when I extract a pdf in other languages rather than English I don't get the correct output .

I have also used PDFBox in java but that gives me the worst output, output from Foxit SDK is better than PDFBox.

Are there ant other libraries which can solve the issue..? Or there is some other solution.

Original Q&A

There are 3 answers

**Andrew Cash** · Answer 1 · 2012-01-27T11:43:21+00:00

You might want to try the trial version of Quick PDF Library to see how it performs on your documents. http://www.quickpdflibrary.com

QP.GetPageText(7) or GetPageText(8) returns pretty good results for most PDF files.

Andrew.

Disclaimer: I do some consulting work for Quick PDF Library.

**MyKuLLSKI** · Answer 2 · 2012-01-27T06:05:51+00:00

MyKuLLSKI On 27 January 2012 at 06:05

Personally if you want it done right you have to pay for it. ComponentOne has a PDFViewer for WPF. Not sure what framework your working with since your tag is missing one.

ComponentOne PDF Viewer for WPF

**Moody Ibrahim Moody** · Answer 3 · 2013-04-16T12:49:59+00:00

If you are on windows, you can use the IFilter that adobe provides. Me, I used the IFilter adobe provides with the adobe reader 8. Here is a link to the exact example I used

http://www.codeproject.com/Articles/13391/Using-IFilter-in-C

The performance was okay (I think. I haven't used many other methods). Takes about 15 sec for a 400 page PDF.

TechQA.

Extract text form pdf using Foxit SDK

There are 3 answers

Related Questions in C#

Related Questions in JAVA

Related Questions in PDF

Related Questions in PDFTOTEXT

Related Questions in FOXIT

Popular Questions

Popular Tags

Trending Questions