Is it possible to use Nutch 2.x and Apache Gora with plain filesystem as backend storage

310 views Asked by At

Is it possible to use Nutch 2.x and Apache Gora™ with plain filesysem as backend storage?

Official site says:

Nutch 2.x: An emerging alternative taking direct inspiration from 1.x, but which differs in one key area; storage is abstracted away from any specific underlying data store by using Apache Gora™ for handling object to persistent mappings.

I want to use latest version of nutch (2.1 currently), but I don't want to setup complex backend with nosql or rdbms backend for storage now. I want to choose backend storage later.

I didn't find any docs for usage of filesystem as storage for Gora. Is it possible?

1

There are 1 answers

0
Alfonso Nishikawa On

You could use the AvroStore that saves into a file (serialized with Avro). I say this only theoretically since I never used it...