Full-text search on microsoft docs using Apache SolR

821 views Asked by At

Does Apache Solr allow for full text search on Microsoft documents such as word or powerpoint? if so, where can I find a tutorial?

1

There are 1 answers

0
Alessandro Hoss On BEST ANSWER

Yes. Solr uses Apache Tika for content extraction and support the majority of file types.

You'll need to configure a handler in your solrconfig.xml.

Here's a good starting documentation with examples: https://lucene.apache.org/solr/guide/6_6/uploading-data-with-solr-cell-using-apache-tika.html