I'm using Cassandra database with datastax driver. I need to do batch read from Cassandra of something to the order of 2000 rows. My use case is like, I get the list of ids in my request and those ids are my partitioning keys in Cassandra. I want to know if it's a good idea to spawn 2000 threads and get data from Cassandra in parallel (in that case reading the data will efficient as it goes to just one node) or is it possible to figure out a way to group ids which live in same node so that I can optimize the reads(now in this case I need to spawn much less threads and less overhead on Cassandra). Please let me know can I achieve batch read in an efficient way apart from spawning multiple threads. Thanks! PS: Data coming back from Cassandra is not that huge to cause OOM.
Related Questions in CASSANDRA
- how to create a chess board with Queen in the central position and all its moves in assembler code
- Passing arguments to ENTRYPOINT causes the container to start and run indefinitely
- Apache Cassandra Node Driver Connection
- Simulate Cassandra DB timeout
- How to update Cassandra Lucene index with a new column? rebuild or update index?
- Cassandra JDBC connection string for logstash
- Cassandra OversizedMessageException
- dsbulk unload is failing after ran couple of hours with OOM issue
- Cassandra: "Model keyspace not set" and "Connection name doesn't exist in the registry" Errors
- Unable to cqlsh to a cassandra docker container remotely
- Forward pagination with object mapper in java asyn
- Allow filter in cassandra query
- How to fix bytes unrepaired in cassandra
- Can't install Cassandra using RPM packages for RHEL 9
- Why can't get a connection to Cassandra running on Docker from a Spring Boot instace using spring-boot-starter-data-cassandra on first boot?
Related Questions in DATASTAX-JAVA-DRIVER
- How to fix bytes unrepaired in cassandra
- Cassandra java-driver Java 17 Compatibility
- ClassCastException with PreparedStatemennt upgrading to datastax driver 4.17.0 with java 17 and spring 6 and jetty 12
- Can I ignore system peer warning when connecting to AWS Keyspaces?
- UserType class missing from Cassandra 4.x driver
- CQLSession initialization error while executing triggers with Apache Cassandra 4.0.11
- Getting Paged Results with Datastax Java Cassandra driver version 4.15.0
- TupleCodec Error: Invalid tuple type, expected Tuple(TEXT, TIMEUUID) but got Tuple(TEXT, UUID)
- Cassandra Connection Error [com.datastax.oss.driver.api.core.AllNodesFailedException:]
- Cassandra datastax ipv6 connection
- Best Cassandra/Scylla configuration with single FE node
- Adding even 50 ms delay between datacenters in Cassandra cluster leads to NoNodeAvailableException even for LOCAL_SERIAL
- Cannot write to table with UDT, getting "Cannot resolve UserDefinedType for [devices]"
- Cassandra - One materialized view VS two materialized views for timestamps as clustering keys
- How to set read timeout (read-timeout-millis 3.11V ) in latest datastax 4.x version in Cassandra
Related Questions in SPRING-DATA-CASSANDRA
- Spring Data Cassandra adhere Batch Limit
- Spring data cassandra - com.datastax.oss.driver.api.core.auth.AuthenticationException
- org.springframework.core.convert.ConverterNotFoundException: No converter found capable of converting from type Instant to type org.joda.time.DateTime
- Cassandra slow reads (by partition key) for large data rows fetched
- Spring Boot Cassandra Repository With Composite Primary Key & Not Using Seperate Key Class
- An issue with mapping an @Embeded object with cassandra spring data
- How to manage two Cassandra sessions using spring data cassandra with spring boot 2.7.x
- Cassandra - No Viable alternatve at input 'COPY' ([COPY]...)
- Cassandra Connection Error [com.datastax.oss.driver.api.core.AllNodesFailedException:]
- retreiving set from cassandra in spark scala gives type mismatch java.utils
- Is there a way to prevent Spring Data from trying to connect to Cassandra if a certain profile is provided?
- Is spring data cassandra thread safe?
- How do I properly fail a validation of spring data cassandra using jakarta validations when passed to controller?
- Cannot write to table with UDT, getting "Cannot resolve UserDefinedType for [devices]"
- Cannot resolve reference to bean 'cassandraTemplate' while setting bean property 'cassandraTemplate'
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Yes it is, you can get Token Ranges for cassandra cluster and check occurrence for tokens for you ids in the ranges, and then group ids by nodes.
In additional:
There is no need to spawn many threads, datastax driver provides asynchronous api, we use it in our project to perform a lot of queries in parallel and it works enough good, but not excellent from performance point of view.
Necessity to perform thousands requests to read data indicates unsuitable data model. You should implement data model around queries to minimize number of request to have good performance.
Updated:
I suppose, you can use method Metadata.newToken to calculate token on driver side or directly get replicas with Metadata.getReplicas for a given partition key. But before it serialize the partition key according to its type and protocol version