We have a kstreams app doing kstream-kstable inner join. Both the topics are high volume with 256 partitions each. kstreams App is deployed on 8 nodes with 8 GB heap each right now. The state store (rocksdb) persists to disk and we are running out of disk space on the containers. What are some of the options to consume data from one of the topics as KTABLE, but limit the amount of data (like if we want to hold only a days worth of keys/data or some time frame) on disk and have the previous state/files get deleted?
How to consume a high volume topic as KTABLE without exhausting memory/disk space?
211 views Asked by user2221654 At
0
There are 0 answers
Related Questions in APACHE-KAFKA
- No method found for class java.lang.String in Kafka
- How to create beans of the same class for multiple template parameters in Spring
- Troubleshoot .readStream function not working in kafka-spark streaming (pyspark in colab notebook)
- Handling and ignore UNKNOWN_TOPIC_OR_PARTITION error in Kafka Streams
- Connect Apache Flink with Apache kudu as sink using Pyflink
- Embedded Kafka Failed to Start After Spring Starter Parent Version 3.1.10
- Producer Batching Service Bus Vs Kafka
- How to create a docker composer environment where containers can communicate each other?
- Springboot Kafka Consumer unable to maintain connect to kafka cluster brokers
- Kafka integration between two micro service which can respond back to the same function initiated the request
- Configuring Apache Spark's MemoryStream to simulate Kafka stream
- Opentelemetry Surpresses Kafka Produce Message Java
- Kafka: java.lang.NoClassDefFoundError: Could not initialize class org.apache.logging.log4j.core.appender.mom.kafka.KafkaManager
- MassTransit Kafka producers configure to send several events to the same Kafka topic
- NoClassDefFoundError when running JAR file with Apache Kafka dependencies
Related Questions in APACHE-KAFKA-STREAMS
- Handling and ignore UNKNOWN_TOPIC_OR_PARTITION error in Kafka Streams
- spring-cloud-stream-binder-kafka-streams consumer shuts down when RuntimeException occurs
- Is there a way to sync applications having kafka stream to avoid duplicate message processing?
- Kafka Streams: Efficient Batch Collection and State Store Management
- Springboot kafka consumer dies permanently
- Understanding the requirements for a Kafka streams application
- Kafka Streams topology initially dropping messages to intermediate topics
- "ConfigException: Please specify a key serde or set one" although I've specified it and also set a default one in my Spring Boot + Kafka Stream app
- Kafka Streams: Kafka Stream Application getting intermittent SaslAuthenticationException
- Switch between Kafka topics
- How to insert a time/data filtered Kafka Stream into a Postgres Database
- Calling POST Rest API in kafka streams application
- Using TopologyTestDriver for testing Biconsumer
- Filtering and forwarding Kafka messages based on key alone with Kafka Streams
- How to write BatchProcessor for lambda with Kafka trigger in AWS?
Related Questions in KTABLE
- Kafka Streams join not working when schema is changed
- Efficient way to delete record in ksqldb table
- Unit test KafkaStreams gives IllegalArgumentException: Unknown topic
- State store vs Ktable in Kafka Streams
- KStream join with KTable record drops if key not exist in KTable
- Why does a ktable emit two events on the changelog?
- KTable: how does that work behind the scenes?
- Insane throughput slowdown kafka streams using join (kstream & ktable) using jmh and TopologyTestDriver
- Creating GlobalKTable using only subset of topic columns
- How do you get the latest offset from a remote query to a Table in ksqlDB?
- How to get list of keys based on a field in value from Kafka state store
- ksqlDB deleting records from KTable
- Apache Kafka - Implementing a KTable and producing event using CloudEvent
- KSQL Table timestamp for one of table is not populating as intended
- Apple M1 - Error opening store caused by RocksDBException: Column family not found when joining KStream to KTable
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)