Converting all document types(MS DOCs+TIFFs+JPGs) stored on Filenet CE to pdf

2.1k views Asked by At

Currently we are developing a Java application to grab documents (all types can be stored on filenet) from Filenet and displaying it in pdf format, the issue is that I need a tool for converting all MS doc types (doc, docx, ppt, pptx,...) to pdf. I've tried Apache POI and iText but they just convert the Office 2007 formats and it just gets a plain text without any formatting and without images.

Second in image documents I already converted the images to pdf but I need to get the annotations made by IBM applet image viewer printed on the image. How can I get that?

I need to an opensource/free solution, any not-free solution (like Snowbound, adptel) will be rejected by the customer.

Any support will be appreciated

1

There are 1 answers

0
MatsT On BEST ANSWER

Check out docx4java, http://www.docx4java.org/trac/docx4j

Can handle .docx and .pptx exports and the documentation gives some info on how to handle older .doc files.