I am trying to extract the background image of a PDF page to an SVG (using xpdf library). The problem I am facing is that the PDF contains additional images/graphics (presumably outside the cropbox) that are not rendered by PDF readers, but the corresponding SVG contains these images/graphics. I tried setting the viewBox attribute of the SVG to correspond to the cropBox bounds of that PDF page but the resulting SVG still displays some of the graphics objects that are not rendered by PDF. I also tried adding a clip path to the SVG - a rectangular clipping region (with bounds corresponding to PDF cropbox), but this too did not eliminate some of the additional graphics elements no seen in PDF. Any idea on what could be the problem? What is the right way to carry over PDF cropbox to SVG? Btw, the SVGs generated in both the cases mentioned above (viewbox and clipping region approaches) were fairly close in dimensions to the viewable area of the PDF page, and the additional elements were seen only close to the edges. Is it that cropbox dimensions obtained from PDF should not be used directly in SVG?
Discrepany between PDF cropbox and SVG created out of a PDF page
233 views Asked by so2 At
1
There are 1 answers
Related Questions in PDF
- How to use custom font during html to pdf conversion?
- How to get content of BLOCK types LAYOUT_TITLE, LAYOUT_SECTION_HEADER and LAYOUT_xx in Textract
- PDF form checkbox/radio button ignores content stream
- Suggest python library for rendering html to pdf files
- Problems with the order in which PDF files are created
- Centering a map element on a generated PDF
- download all pdf files from website doesn't support wildcard
- How to enter external pdf into quarto book while keeping page layout+numbering
- How do I create a website that combines user input and standard text and converts it into a pdf?
- Excel VBA error 1004 on PDF export - not a path issue
- downloading pdf using requests not working
- Creating pdf on Firestore with Pdfplum: Template path "no such object"
- Export password protected PDF from QGIS
- XPS convert PDF with Ghostscript
- Download PDF in ASP.NET MVC application
Related Questions in SVG
- inline SVG text (js)
- JavaScript svg scale issue with matter.js
- Accessing displayed hyphens of an HTML element with JavaScript
- How to colorize the 'x' in a cancel svg icon
- SvgPicture asset does not work and shows "Try connecting the name to one that is defined, or defining the name" error
- How do I set the color of the icon for my Moodle plugin of Activity type (mod) to match the color used by the theme?
- Nothing is shown when running a d3 sample code
- How can I prevent a line break at an SVG icon?
- why my css marquee wont work after an XML request on all my pages
- Eventlistener not working when mouse entering ID within svg file
- Clip image inside of enclosed letters
- How to set image in axis text in SVGGraph?
- Manipulating SVG Elements and Properties with UI
- SVG export from Adobe Illustrator generates strange code
- SVG is not picking up font
Related Questions in XPDF
- Render XPDF forms into Laravel PHP Web Page
- I can't get PDF document file path with PHP-XPDF
- Make xpdf Pdf2Txt function as thread safe
- Convert all text's color in PDF to black while ensuring text is selectable
- path should be string, bytes or os.PathLike, not InMemoryUploadedFile
- pdftotext for firebase functions?
- what difference between GhostScript and Xpdf on convert pdf to png
- xpdf (pdftotext) with language pack call from different directory
- Convert PDF to HTML in PHP similar to DocuSign
- Trying to scrape a PDF in R, my code will only scrape 6 out of 9 pages and i'm not sure why, am I missing something in my code?
- How to generate pdftotext same as pdf generated by xpdf in Laravel?
- make linker for pdftohtml causes compiler errors
- Can xpdf accept variables as path arguments in Powershell?
- pdftotext get font information (font-family, style, size)
- How to get chars/words/lines/blocks coordinates
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Turns out that the problem was due to my code not transforming the PDF cropbox attribute (as given by xpdf) to user coordinates using CTM matrix (also obtainable through xpdf). After applying the transformation, the resulting SVG matches the rendered portion of the PDF page.