Zstandard levels in hadoop

Question

Zstandard levels in hadoop

655 views Asked by ondway At 12 October 2019 at 09:25

Compression level in org.apache.hadoop.io.compress.zstd.ZStandardCompressor does't seem to work. I see the reset function getting called in ZStandardCompressor constructor which is turn call init(level, stream) to call native function which I believe to be only place setting zstd parameter. In my test, I am ensuring that this is being called but calling it different levels like 1, 5, 10. 20 etc did not make any difference as output size is exact same.

Hadoop doesn't seem to use zstd-jni and use own stuff to use zstd. I am sure people are using different levels in hadoop. Could you someone point I should go around chasing for next step

Original Q&A

There are 1 answers

**ondway** · Answer 1 · 2020-09-07T14:14:01+00:00

ondway On 07 September 2020 at 14:14

Given that people are finding this question without answer, I am adding solution which I used. InternalParquetRecordWriter has compressor as argument, so I integrated zstd-jni library here by creating a compressor by extending BytesInputCompressor.

TechQA.

Zstandard levels in hadoop

There are 1 answers

Related Questions in HADOOP

Related Questions in ZSTANDARD

Popular Questions

Popular Tags

Trending Questions