I want to read one million records from a CSV file and write to another CSV file with WSO2 streaming integrator. The problem is the slow speed of reading and writing data, almost 10 per second. WSO2 says "Read a 100GB file with 10 million lines in just 20 mins" but I found no sample or guide to do that. can anyone help me?
WSO2 streaming integrator - read large files (one million of records) and write to another file
222 views Asked by Fateme Shahrabi At
1
There are 1 answers
Related Questions in FILE
- Helpt with reading files
- Why can't I use the file pointer after the first read attempt fails?
- Can't read the file using std::wifstream C++
- How can the scanner reread the entire file after it has already executed hasNextLine once?
- What is 'Invalid Load Key, '\x00'
- php $_FILE variable undefined index
- Data loaded from the file is not returned in the correct order
- File splitting and encryption
- Optimizing an s5cmd command that uses awk to generate a text file
- segmentation fault while reading in text file ( c++ )
- File.OpenText is adding C:\ to the front which is an error
- UTF-8 issue with excel
- How to upload files to MediaWiki APIs in Rust?
- No such file or directory: '/tmp/tmp_ejr26m6.upload.mp3' in Django
- Problems accessing zip files on the react front end from express backend
Related Questions in WSO2
- How to make gRPC request in WSO2 micro-integrator via Postman?
- Following PATTERN 3 Deployment ,Not able to consume apis in Gateway
- WSO2 change localhost - ERR_CERT_AUTHORITY_INVALID
- How to resolve unauthenticated error after calling REST API endpoint to update the Challenge Questions in WSO2 v5.11.0?
- Enable logging for API calls in wso2 identity server
- How to set weight of enpoints durin laod balancing in wso2 manager
- old apikey dont works for newly subscribed apis in wso2 apim
- WSO2 API-M: What is Synapse and why it spits following error: I/O error: Connection reset java.net.SocketException: Connection reset?
- how to upgrade httpcore-nio plugin of wso2 api manager
- Amazon RDS PostgreSQL - Caused by: java.net.UnknownHostException: yyyy.xxx.us-west-1.rds.amazonaws.com in WSO2 EI docker container
- How to Extract Queue Name, Priority, and Message ID from RabbitMQ Inbound Endpoint Messages in WSO2 MI 4.2?
- how do i manage role and permission in server side for my predefined roles and assign permission in wso2 identity server?
- Installation process of wso2 Enterprise integrator
- WSO2 - SSL troubleshooting: ERR_CERT_AUTHORITY_INVALID
- How to do error handling in Clone-Aggregator mediator in WSO2 Micro Integrator?
Related Questions in WSO2-STREAMING-INTEGRATOR
- WSO2 Streaming integrator on multiple cores
- WSO2 API Manager Integration with Prometheus and Grafana
- WSO2 siddhi-gpl-execution-geo library works in java 8 but not supporting Java 18
- WSO2 Stream Processor (SP) Json string cannot be parsed to a json object
- WSO2 Stream Processor (SP) Error "could not initialize carbon deployment engine"
- How to perform Mongodb CDC using wso2 streaming integrator?
- how to connect to PostgreSQL using Streaming Integrator
- Browsing H2 Database for WSO2 Streaming Integrator (WSO2 SI)
- Errors in Siddhi app.Different definition same as output 'define stream
- Siddhi (wso2sp) - encountering DuplicateDefinitionException when trying to perform multiple insert operations on rdbs store
- com/mongodb/MongoException in Siddhi app mongo-store extension
- Siddhi App > Cassandra Extension: Implementation of JAXB-API has not been found on module path or class path
- WSO2 streaming integrator - read large files (one million of records) and write to another file
- Trying to stack siddhi on wso2 Stream processor/Stream Integrator
- Kibana - The indices which match this index pattern don't contain any time fields
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)

Finally I found some example here https://ei.docs.wso2.com/en/7.2.0/streaming-integrator/ref/performance-analysis-results/ and with these example my speed is ok.
batch.size.maxandbuffer.sizeare useful parameters here.