I would like to use the workbench to do some tests but I could not understand how to run it without specifying a query. I would like to make the cluster of documents, without having to specify a query using the workbench. Is it possible?
Tanks
I would like to use the workbench to do some tests but I could not understand how to run it without specifying a query. I would like to make the cluster of documents, without having to specify a query using the workbench. Is it possible?
Tanks
The two simplest options are the following:
Convert your data to Carrot2 XML format and use the XML document source in Workbench, where the query is optional.
Create a Lucene index out of your data and use the Lucene document source. In this case the query is mandatory, but you can use the
*:*
catch-all query to cluster all documents from the index. This question has some hints about converting different types of document into the required Lucene index.