Hadoop's support for zlib library lacks support to perform flushes (Z_SYNC_FLUSH and Z_FULL_FLUSH)

Key: HADOOP-6297
URL: https://issues.apache.org/jira/browse/HADOOP-6297
Project: Hadoop Common
Issue Type: Improvement
Components: io
Affects Versions: 0.21.0
Reporter: Kevin J. Price
Priority: Minor

The zlib library supports the ability to perform two types of flushes when deflating data. It can perform both a Z_SYNC_FLUSH, which forces all input to be written as output and byte-aligned and resets the Huffman coding, and it also supports a Z_FULL_FLUSH, which does the same thing but additionally resets the compression dictionary. The Hadoop wrapper for the zlib library does not support either of these two methods.

Adding support should be fairly trivial. An additional deflate method that takes a fourth "flush" parameter, and a modification to the native c code to accept this fourth parameter and pass it along to the zlib library. I can submit a patch for this if desired.

It should be noted that the native SUN Java API is likewise missing this functionality, as has been noted for over a decade here: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4206909

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
postedOct 6, '09 at 3:01p
activeOct 6, '09 at 3:01p

1 user in discussion

Kevin J. Price (JIRA): 1 post



site design / logo © 2022 Grokbase