Zlib

zlib (/ˈziːlɪb/ or "zeta-lib", /ˈziːtəˌlɪb/)^[3]^[4] is a data format and a lossless data compression software library created by Jean-Loup Gailly and Mark Adler. The library implements the Deflate algorithm and supports compressing and decompressing data using the zlib data format, the gzip file format, or simply as a Deflate stream.^[5]

Developers

^[1]

Release1 May 1995

Stable release

1.3.2^[2] / 17 February 2026; 4 months ago

Written inC

Quick facts Developers, Release ...

zlib (library)

Developers	Jean-Loup Gailly Mark Adler ^[1]
Release	1 May 1995 (1995-05-01)

Stable release	1.3.2^[2] / 17 February 2026; 4 months ago (17 February 2026)

Written in	C
Operating system	Cross-platform
Type	Data compression
License	zlib License
Website	zlib.net
Repository	github.com/madler/zlib

Close

zlib is also a crucial component of many software platforms, including Linux, macOS, and iOS. It has also been used in gaming consoles such as the PlayStation 4, PlayStation 3, Wii U, Wii, Xbox One and Xbox 360.^{[citation needed]}

The first public version of Zlib, 0.9, was released on 1 May 1995 and was originally intended for use with the libpng image library. It is free software, distributed under the zlib License.

RFC 1950 specifies the zlib data format.^[6]

Data format

Quick facts Internet media type, Size limitation ...

zlib (data format)
Internet media type	`application/zlib`^[7]
Size limitation	Unlimited
Developed by	Jean-Loup Gailly Mark Adler ^[8]
Type of format	Data compression
Compression	Lossless
Container for	Deflate stream
Standard	RFC 1950

Close

The zlib data format consists of a 2-byte header, an optional extra header field, Deflate-compressed data, and a 4-byte trailer.

Endianness is big-endian.^[9] Bit 0 is the least significant bit in a byte.^[10]

More information Offset (bytes), Field ...

Offset (bytes)	Field^[11]	Size (bytes)	Description
0	CMF	1	This byte is divided into two 4-bit fields: Bits 0 to 3: `CM` (compression method). Must be 8 (Deflate). Bits 4 to 7: `CINFO` (compression info). Base-2 logarithm of the LZ77 window size, minus eight. The maximum allowed value is 7 (i.e. 32 KiB window size). 6 indicates 16 KiB, 5 indicates 8 KiB, etc...
1	FLG	1	Flags. Bits 0 to 4: `FCHECK`. This value must be chosen such that `CMF256 + FLG` is a multiple of 31. Bit 5: `FDICT`. See `DICTID` below. Bits 6 to 7: `FLEVEL` (compression level*). Fastest (0), fast (1), default (2) or maximum (3). Not needed for decompression, but indicates if recompression might be worthwhile.
2	DICTID	0 or 4	Adler-32 checksum of the preset DEFLATE dictionary used during compression. Present if the FDICT flag is set. The decompressor can use this checksum to determine which dictionary has been used by the compressor (e.g. `deflateSetDictionary` in `zlib.h`).
Varies	Compressed data	Varies	Deflate stream.
ADLER32	4	Adler-32 checksum of the uncompressed data (excluding any dictionary data).

Close

Library

Encapsulation

Deflate compression typically uses the zlib data format or the gzip file format, which add a header and a trailer to the compressed data. This provides stream identification and error detection that are not provided by a Deflate stream.

The zlib format (at least 6 bytes) is smaller than the gzip format (at least 18 bytes) as the latter stores filesystem metadata.

Algorithm

As of September 2018^[update], zlib only supports one algorithm, called DEFLATE, which uses a combination of a variation of LZ77 (Lempel–Ziv 1977) and Huffman coding.^[12] This algorithm provides good compression on a wide variety of data with minimal use of system resources. This is also the algorithm used in the Zip archive format. The header makes allowance for other algorithms, but none are currently implemented.

Resource use

zlib provides facilities for control of processor and memory use. A compression level value may be supplied that trades speed for compression. There are also facilities for conserving memory, useful in restricted memory environments, such as some embedded systems.

Strategy

The compression can be optimized for specific types of data. If one is using the library to always compress specific types of data, then using a specific strategy may improve compression and performance. For example, if the data contain long lengths of repeated bytes, the run-length encoding (RLE) strategy may give good results at a higher speed. For general data, the default strategy is preferred.

Error handling

Errors in compressed data may be detected and skipped. Further, if "full-flush" points are written to the compressed stream, then corrupt data can be skipped, and the decompression will resynchronize at the next flush point—although no error recovery of the corrupt data is provided. Full-flush points are useful for large data streams on unreliable channels, where some data loss is unimportant, such as in some multimedia applications. However, creating many flush points can affect the speed as well as the amount (ratio) of compression.

Data length

There is no limit to the length of data that can be compressed or decompressed. Repeated calls to the library allow an unlimited number of blocks of data to be handled. Some ancillary code (counters) may suffer from overflow for long data streams, but this does not affect the actual compression or decompression.

When compressing a long (or infinite) data stream, it is advisable to write regular full-flush points.

Applications

Today, zlib is something of a de facto standard, to the point that zlib and DEFLATE are often used interchangeably in standards documents, with thousands of applications relying on it for compression, either directly or indirectly.^[13] These include:

The Linux kernel, where zlib is used to implement compressed network protocols, compressed file systems, and to decompress the kernel image at boot time.
GNU Binutils and GNU Debugger (GDB)
libpng, the reference implementation for the PNG image format, which specifies DEFLATE as the stream compression for its bitmap data.
libwww, an API for web applications like web browsers.
The Apache HTTP Server, which uses zlib to implement HTTP/1.1.
Similarly, the cURL library uses zlib to decompress HTTP responses.^[14]
The OpenSSH client and server, which rely on zlib to perform the optional compression offered by the Secure Shell protocol.
The OpenSSL and GnuTLS security libraries, which can optionally use zlib to compress TLS connections.
The FFmpeg multimedia library, which uses zlib to read and write the DEFLATE-compressed parts of stream formats, such as Matroska.
The rsync remote file synchronizer, which uses zlib to implement optional protocol compression.
The dpkg and RPM package managers, which use zlib to unpack files from compressed software packages.
The Apache Subversion and CVS version control systems, which use zlib to compress traffic to and from remote repositories.
The Apache ORC column-oriented data storage format use ZLib as its default compression method.^[15]
The Git version control system uses zlib to store the contents of its data objects (blobs, trees, commits and tags).
The PostgreSQL RDBMS uses zlib with custom dump format (pg_dump -Fc) for database backups.
The class System.IO.Compression.DeflateStream of the Microsoft .NET Framework 2.0 and higher.^[16]
The "deflate" utility in TORNADO as part of VxWorks Operating System made by Wind River Systems uses zlib to compress boot ROM images.
zlib-flate,^[17] raw zlib compression program, part of qpdf^[18]
The MySQL RDBMS uses ZLib LZ77 for compression in InnoDB Tables ^[19]

zlib is also used in many embedded devices, such as the Apple iPhone and Sony PlayStation 3, because the code is portable, liberally licensed, and has a relatively small memory footprint.

Forks

A commonly used library built on an old codebase, zlib is also frequently forked by third-parties that claim improvements to this library:

Intel has a high-performance fork of zlib.^[20]
Cloudflare maintains a high-performance fork with "massive" improvements.^[21]
zlib-ng is a zlib replacement fork for "next generation" systems. It removes workaround code for compilers that do not support ANSI C, integrates Cloudflare and Intel optimizations, adds hardware acceleration (SIMD and intrinsic functions), and uses code sanitizers, fuzzing, and code coverage to help find bugs.^[22]

Data format

Library

Encapsulation

Algorithm

Resource use

Strategy

Error handling

Data length

Applications

Forks

See also

References

External links

Related Articles