Using Zend Lucene to search Office 2003 or older files

300 views Asked by At

I know there are already objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?

2

There are 2 answers

0
Pascal MARTIN On BEST ANSWER

There doesn't seem to be anything bundled with Zend_Search_Lucene, for those.

Still, considering it can index HTML documents, if you can find a way to convert your Office 2003 documents to HTML (at least, for indexing -- keeping to original version alonside the HTML one, for consultation), you might be able to index those...

0
Brian On

I would recommend indexing the documents with Solr and Tika together and using JSON to search your Solr/Lucene index from PHP. See the ExtractingRequestHandler (Solr wiki page) article for more information.