Lzip

 [lzip logo] [ English | Español | Français | Italiano ]

Introduction

Lzip is a lossless data compressor based on the LZMA algorithm, with very safe integrity checking and a user interface similar to the one of gzip or bzip2. Lzip decompresses almost as fast as gzip and compresses better than bzip2, which makes it well suited for software distribution and data archiving.

Lzip uses the same well-defined exit status values used by bzip2, which makes it safer when used in pipes or scripts than compressors returning ambiguous warning values, like gzip.

If you ever need to recover data from a damaged lzip file, try the lziprecover program. Lziprecover makes lzip files resistant to bit-flip, one of the most common forms of data corruption, and its recovery capabilities contribute to make of the lzip format one of the best options for long-term data archiving.

In this lzip benchmark page you can find some tests showing how well lzip can replace gzip and bzip2 for software distribution and data archiving.

The lzip file format (.lz) is an improved successor of the unsafe and deprecated lzma-alone file format (.lzma). Software tools that decompress both formats or can convert lzma-alone files to lzip format are listed below, in the "Related projects" and "Links" sections.

Lzip replaces every file given in the command line with a compressed version of itself, with the name "original_name.lz". Each compressed file has the same modification date, permissions, and, when possible, ownership as the corresponding original, so that these properties can be correctly restored at decompression time. Lzip is able to read from some types of non regular files if the "--stdout" option is specified.

If no file names are specified, lzip compresses (or decompresses) from standard input to standard output. In this case, lzip will decline to write compressed output to a terminal, as this would be entirely incomprehensible and therefore pointless.

Lzip will correctly decompress a file which is the concatenation of two or more compressed files. The result is the concatenation of the corresponding uncompressed files. Integrity testing of concatenated compressed files is also supported.

Lzip can produce multi-member files and safely recover, with lziprecover, the undamaged members in case of file damage. Lzip can also split the compressed output in volumes of a given size, even when reading from standard input. This allows the direct creation of multivolume compressed tar archives.

Lzip will automatically use the smallest possible dictionary size without exceeding the given limit. It is important to appreciate that the decompression memory requirement is affected at compression time by the choice of dictionary size limit.

As a self-check for your protection, lzip stores in the member trailer the 32-bit CRC of the original data, the size of the original data and the size of the member. These values, together with the value remaining in the range decoder and the end-of-stream marker, provide a very safe 4 factor integrity checking which guarantees that the decompressed version of the data is identical to the original. This guards against corruption of the compressed data, and against undetected bugs in lzip (hopefully very unlikely). The chances of data corruption going undetected are microscopic. Be aware, though, that the check occurs upon decompression, so it can only tell you that something is wrong. It can't help you recover the original uncompressed data.

Lzip implements a simplified version of the LZMA (Lempel-Ziv-Markov chain-Algorithm) algorithm. The high compression of LZMA comes from combining two basic, well-proven compression ideas: sliding dictionaries (LZ77/78) and markov models (the thing used by every compression algorithm that uses a range encoder or similar order-0 entropy coder as its last stage) with segregation of contexts according to what the bits are used for.

The ideas embodied in lzip are due to (at least) the following people: Abraham Lempel and Jacob Ziv (for the LZ algorithm), Andrey Markov (for the definition of Markov chains), G.N.N. Martin (for the definition of range encoding), Igor Pavlov (for putting all the above together in LZMA), and Julian Seward (for bzip2's CLI and the idea of unzcrash).

Related projects

Plzip - A multi-threaded compressor using the lzip file format.

Lzlib - A compression library for the lzip file format written in C.

Lunzip - A small decompressor for lzip files written in C.

Clzip - A C implementation of lzip for systems lacking a C++ compiler.

Lziprecover - A data recovery tool and decompressor for lzip files.

Zutils - Replacement for zcat, zdiff, zgrep, etc, that understands bzip2, gzip and lzip formats.

Pdlzip - A limited, "public domain" C implementation of the lzip data compressor, intended for those who can't distribute GPL licensed Free Software. Pdlzip is also able to decompress legacy lzma-alone (.lzma) files.

Lzd - An educational decompressor for lzip files.

Documentation

The manual is available in the info system of the GNU Operating System. Use info to access the top level info page. Use info lzip to access the lzip section directly.

An online manual for lzip can be found here.

Download

The latest released version of lzip can be found here. You may also subscribe to Freecode and receive an email every time a new version is released.

Jonatan Yong kindly provided a Windows32 port of lzip that can also be downloaded from the Savannah download link just above.

How to Get Help

For general discussion of bugs in lzip the mailing list lzip-bug@nongnu.org is the most appropriate forum. Please send messages as plain text. Please do not send messages encoded as HTML nor encoded as base64 MIME nor included as multiple formats. Please include a descriptive subject line. If all of the subject are "bug in lzip" it is impossible to differentiate them.

An archive of the bug report mailing list is available at http://lists.nongnu.org/mailman/listinfo/lzip-bug.

How to Help

To contact the author, either to report a bug or to contribute fixes or improvements, send mail to lzip-bug@nongnu.org. Please send messages as plain text. If posting patches they should be in unified diff format against the latest version. They should include a text description.

See also the lzip project page at Savannah.

Links

Atool - A command line archive manager that understands lzip files.

GNU Automake - A Makefile generator able to create lzip-compressed tarballs.

Documentation as an indicator of code quality - A different review of lzip.

Dragora GNU/Linux - A GNU/Linux distribution using lzip in its package system.

Easylzma - C library and tools for lzip and lzma-alone file formats.

File Roller - An archive manager for GNOME that understands lzip files.

Lesspipe.sh - View the contents of lzipped files with the pager less.

Littleutils - Convert your files to lzip format.

Man-db - An implementation of the Unix man command able to read lzipped pages.

GNU Midnight Commander - A visual file manager that understands lzip files.

RPM - RPM Package Manager that uses lzip to compress packages.

GNU Tar, Star - Automatically create and extract lzip-compressed tar archives.

GNU Texinfo - The GNU Documentation System understands lzip-compressed manuals.

Z - A simple, safe and convenient front-end for bzip2, gzip and lzip.

Download lzip for AIX, ALT Linux, Amiga, Arch Linux, Debian, Exherbo, Fedora, FreeBSD, Gentoo, HP-UX, Mac (fink), Mac (softpedia), NetBSD, NixOS, OS/2, Slackware, Solaris (OpenCSW), Ubuntu, Windows (Cygwin).

Licensing

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

Valid HTML 4.01!


Copyright © 2013 Antonio Diaz Diaz
Lzip logo Copyright © 2013 Sonia Diaz Pacheco

You are free to copy, modify and distribute all or part of this article without limitation.

Updated: 2013/06/19