Crate flate2

Crate flate2 

Source
Expand description

A DEFLATE-based stream compression/decompression library

This library provides support for compression and decompression of DEFLATE-based streams:

  • the DEFLATE format itself
  • the zlib format
  • gzip

These three formats are all closely related and largely only differ in their headers/footers. This crate has three types in each submodule for dealing with these three formats.

§Implementation

In addition to supporting three formats, this crate supports several different backends, controlled through this crate’s features flags:

  • default, or rust_backend - this implementation currently uses the miniz_oxide crate which is a port of miniz.c to Rust. This feature does not require a C compiler, and only uses safe Rust code.

    Note that the rust_backend feature may at some point be switched to use zlib-rs, and that miniz_oxide should be used explicitly if this is not desired.

  • zlib-rs - this implementation utilizes the zlib-rs crate, a Rust rewrite of zlib. This backend is the fastest, at the cost of some unsafe Rust code.

Several backends implemented in C are also available. These are useful in case you are already using a specific C implementation and need the result of compression to be bit-identical. See the crate’s README for details on the available C backends.

The zlib-rs backend typically outperforms all the C implementations.

§Feature Flags

  • miniz_oxide (enabled by default) — This implementation uses only safe Rust code and doesn’t require a C compiler. It provides good performance for most use cases while being completely portable.
  • default — The default backend using pure Rust implementation via miniz_oxide. This provides a safe, portable compression implementation without requiring a C compiler.

§User-Facing Backend Features

Choose one of these features to select the compression backend. Only one backend should be enabled at a time, or else one will see an unstable order which is currently zlib-ng, zlib-rs, cloudflare_zlib, miniz_oxide and which may change at any time.

  • zlib-rs — Use the zlib-rs backend, a pure Rust rewrite of zlib. This is the fastest backend overall, providing excellent performance with some unsafe code. It does not require a C compiler but uses unsafe Rust for optimization.

  • rust_backend (enabled by default) — Use the pure Rust miniz_oxide backend (default). This implementation uses only safe Rust code and doesn’t require a C compiler. It provides good performance for most use cases while being completely portable.

    Note that this feature at some point may be switched to use zlib-rs instead.

  • zlib — Use the system’s installed zlib library. This is useful when you need compatibility with other C code that uses zlib, or when you want to use the system-provided zlib for consistency.

  • zlib-default — Use the system’s installed zlib library with default features enabled. Similar to zlib but enables additional features from libz-sys.

  • zlib-ng-compat — Use zlib-ng in zlib-compat mode via libz-sys. This provides zlib-ng’s performance improvements while maintaining compatibility. Note: If any crate in your dependency graph uses stock zlib, you’ll get stock zlib instead. For guaranteed zlib-ng, use the zlib-ng feature. When using this feature, if any crate in your dependency graph explicitly requests stock zlib, or uses libz-sys directly without default-features = false, you’ll get stock zlib rather than zlib-ng. See the libz-sys README for details. To avoid that, use the "zlib-ng" feature instead.

  • zlib-ng — Use the high-performance zlib-ng library directly. This typically provides better performance than stock zlib and works even when other dependencies use zlib. Requires a C compiler.

  • cloudflare_zlib — Use Cloudflare’s optimized zlib implementation. This provides better performance than stock zlib on x86-64 (with SSE 4.2) and ARM64 (with NEON & CRC).

    • ⚠ Does not support 32-bit CPUs and is incompatible with mingw.
    • ⚠ May cause conflicts if other crates use different zlib versions.
  • miniz-sys — Deprecated alias for rust_backend, provided for backwards compatibility. Use rust_backend instead.

§Internal Features

These features are used internally for backend selection and should not be enabled directly by users. They are documented here to aid with maintenance.

  • any_zlibInternal: Marker feature indicating that any zlib-based C backend is enabled. This is automatically enabled by zlib-rs, zlib, zlib-ng, zlib-ng-compat, and cloudflare_zlib. Do not enable this feature directly; instead, choose a specific backend feature.
  • any_c_zlibInternal: Marker feature indicating that any C based fully zlib compatible backend is enabled. This is automatically enabled by zlib, zlib-ng, zlib-ng-compat, and cloudflare_zlib. Do not enable this feature directly; instead, choose a specific backend feature.
  • any_impl (enabled by default)Internal: Marker feature indicating that any compression backend is enabled. This is automatically enabled by all backend features to ensure at least one implementation is available. Do not enable this feature directly; instead, choose a specific backend feature.

§Ambiguous feature selection

As Cargo features are additive, while backends are not, there is an order in which backends become active if multiple are selected.

  • zlib-ng
  • zlib-rs
  • cloudflare_zlib
  • miniz_oxide

§Organization

This crate consists of three main modules: bufread, read, and write. Each module implements DEFLATE, zlib, and gzip for std::io::BufRead input types, std::io::Read input types, and std::io::Write output types respectively.

Use the bufread implementations if you can provide a BufRead type for the input. The &[u8] slice type implements the BufRead trait.

The read implementations conveniently wrap a Read type in a BufRead implementation. However, the read implementations may read past the end of the input data, making the Read type useless for subsequent reads of the input. If you need to re-use the Read type, wrap it in a std::io::BufReader, use the bufread implementations, and perform subsequent reads on the BufReader.

The write implementations are most useful when there is no way to create a BufRead type, notably when reading async iterators (streams).

use futures::{Stream, StreamExt};
use std::io::{Result, Write as _};

async fn decompress_gzip_stream<S, I>(stream: S) -> Result<Vec<u8>>
where
    S: Stream<Item = I>,
    I: AsRef<[u8]>
{
    let mut stream = std::pin::pin!(stream);
    let mut w = Vec::<u8>::new();
    let mut decoder = flate2::write::GzDecoder::new(w);
    while let Some(input) = stream.next().await {
        decoder.write_all(input.as_ref())?;
    }
    decoder.finish()
}

Note that types which operate over a specific trait often implement the mirroring trait as well. For example a bufread::DeflateDecoder<T> also implements the Write trait if T: Write. That is, the “dual trait” is forwarded directly to the underlying object if available.

§About multi-member Gzip files

While most gzip files one encounters will have a single member that can be read with the GzDecoder, there may be some files which have multiple members.

A GzDecoder will only read the first member of gzip data, which may unexpectedly provide partial results when a multi-member gzip file is encountered. GzDecoder is appropriate for data that is designed to be read as single members from a multi-member file. bufread::GzDecoder and write::GzDecoder also allow non-gzip data following gzip data to be handled.

The MultiGzDecoder on the other hand will decode all members of a gzip file into one consecutive stream of bytes, which hides the underlying members entirely. If a file contains non-gzip data after the gzip data, MultiGzDecoder will emit an error after decoding the gzip data. This behavior matches the gzip, gunzip, and zcat command line tools.

Modules§

bufread
Types which operate over BufRead streams, both encoders and decoders for various formats.
read
Types which operate over Read streams, both encoders and decoders for various formats.
write
Types which operate over Write streams, both encoders and decoders for various formats.

Structs§

Compress
Raw in-memory compression stream for blocks of data.
CompressError
Error returned when a compression object is used incorrectly or otherwise generates an error.
Compression
When compressing data, the compression level can be specified by a value in this struct.
Crc
The CRC calculated by a CrcReader.
CrcReader
A wrapper around a Read that calculates the CRC.
CrcWriter
A wrapper around a Write that calculates the CRC.
Decompress
Raw in-memory decompression stream for blocks of data.
DecompressError
Error returned when a decompression object finds that the input stream of bytes was not a valid input stream of bytes.
GzBuilder
A builder structure to create a new gzip Encoder.
GzHeader
A structure representing the header of a gzip stream.

Enums§

FlushCompress
Values which indicate the form of flushing to be used when compressing in-memory data.
FlushDecompress
Values which indicate the form of flushing to be used when decompressing in-memory data.
Status
Possible status results of compressing some data or successfully decompressing a block of data.