Convert .doc/.docx documents to .odt (Open document text) and vice versa using java.

5.3k views Asked by At

Is there any java library which can be used for converted Microsoft Word files (doc/docx) to Open Document Text format(.odt) formats. Free library would be preferable.

2

There are 2 answers

0
Constantijn Visinescu On BEST ANSWER

I don't know about any libraries that do it directly, but it should be relatively easy to exact the bits you're interested from a .docx using poi:

http://poi.apache.org/

and then write them to an ODT format using ODFDOM:

http://incubator.apache.org/odftoolkit/odfdom/index.html

This should be relatively straightforward for simple documents, but if your use case calls for complex doucments containing pictures etc, this might become a LOT harder.

Anyway, hope this helps at least some ;)

0
Guillaume On

I believe everything you need is in this post: http://angelozerr.wordpress.com/2012/12/06/how-to-convert-docxodt-to-pdfhtml-with-java/

For instance:

JODConverter : JODConverter automates conversions between office document formats using OpenOffice.org or LibreOffice. Supported formats include OpenDocument, PDF, RTF, HTML, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a web application.