Currently we are developing a Java application to grab documents (all types can be stored on filenet) from Filenet and displaying it in pdf format, the issue is that I need a tool for converting all MS doc types (doc, docx, ppt, pptx,...) to pdf. I've tried Apache POI and iText but they just convert the Office 2007 formats and it just gets a plain text without any formatting and without images.
Second in image documents I already converted the images to pdf but I need to get the annotations made by IBM applet image viewer printed on the image. How can I get that?
I need to an opensource/free solution, any not-free solution (like Snowbound, adptel) will be rejected by the customer.
Any support will be appreciated
 
                        
Check out docx4java, http://www.docx4java.org/trac/docx4j
Can handle .docx and .pptx exports and the documentation gives some info on how to handle older .doc files.