I have a janusgraph database that uses Cassandra and ElasticSearch. The database is huge and I need a way to traverse through all the vertexes of the janusgraph for specific use case.
I have tried writing a Spark job using the SparkGraphComputer, but even that times out.
I am trying to check if Spark Cassandra connector can be used to traverse through the data. Is there any way which can be used to get all the vertex ids for Cassandra directly using the janusgraph_ids table or the edgestore table.
This is a known problem, see: https://www.youtube.com/watch?v=oo-b4P1kHu8
Easiest known workaround is to keep a table with vertex id's outside janusgraph (e.g. by updating it during JG ingests) and using this table to do OLTP requests to JanusGraph during spark jobs.