xref: /freebsd/usr.bin/compress/compress.1 (revision 4e9e907d636ef7f776d89d98bd778f740ee69cdd)
19b50d902SRodney W. Grimes.\" Copyright (c) 1986, 1990, 1993
29b50d902SRodney W. Grimes.\"	The Regents of the University of California.  All rights reserved.
39b50d902SRodney W. Grimes.\"
49b50d902SRodney W. Grimes.\" This code is derived from software contributed to Berkeley by
59b50d902SRodney W. Grimes.\" James A. Woods, derived from original work by Spencer Thomas
69b50d902SRodney W. Grimes.\" and Joseph Orost.
79b50d902SRodney W. Grimes.\"
89b50d902SRodney W. Grimes.\" Redistribution and use in source and binary forms, with or without
99b50d902SRodney W. Grimes.\" modification, are permitted provided that the following conditions
109b50d902SRodney W. Grimes.\" are met:
119b50d902SRodney W. Grimes.\" 1. Redistributions of source code must retain the above copyright
129b50d902SRodney W. Grimes.\"    notice, this list of conditions and the following disclaimer.
139b50d902SRodney W. Grimes.\" 2. Redistributions in binary form must reproduce the above copyright
149b50d902SRodney W. Grimes.\"    notice, this list of conditions and the following disclaimer in the
159b50d902SRodney W. Grimes.\"    documentation and/or other materials provided with the distribution.
169b50d902SRodney W. Grimes.\" 3. All advertising materials mentioning features or use of this software
179b50d902SRodney W. Grimes.\"    must display the following acknowledgement:
189b50d902SRodney W. Grimes.\"	This product includes software developed by the University of
199b50d902SRodney W. Grimes.\"	California, Berkeley and its contributors.
209b50d902SRodney W. Grimes.\" 4. Neither the name of the University nor the names of its contributors
219b50d902SRodney W. Grimes.\"    may be used to endorse or promote products derived from this software
229b50d902SRodney W. Grimes.\"    without specific prior written permission.
239b50d902SRodney W. Grimes.\"
249b50d902SRodney W. Grimes.\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
259b50d902SRodney W. Grimes.\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
269b50d902SRodney W. Grimes.\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
279b50d902SRodney W. Grimes.\" ARE DISCLAIMED.  IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
289b50d902SRodney W. Grimes.\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
299b50d902SRodney W. Grimes.\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
309b50d902SRodney W. Grimes.\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
319b50d902SRodney W. Grimes.\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
329b50d902SRodney W. Grimes.\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
339b50d902SRodney W. Grimes.\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
349b50d902SRodney W. Grimes.\" SUCH DAMAGE.
359b50d902SRodney W. Grimes.\"
369b50d902SRodney W. Grimes.\"     @(#)compress.1	8.2 (Berkeley) 4/18/94
37c3aac50fSPeter Wemm.\" $FreeBSD$
389b50d902SRodney W. Grimes.\"
39f5ba2b90STim J. Robbins.Dd May 17, 2002
409b50d902SRodney W. Grimes.Dt COMPRESS 1
4162500372SRuslan Ermilov.Os
429b50d902SRodney W. Grimes.Sh NAME
439b50d902SRodney W. Grimes.Nm compress ,
4451f98e58SRuslan Ermilov.Nm uncompress
459b50d902SRodney W. Grimes.Nd compress and expand data
469b50d902SRodney W. Grimes.Sh SYNOPSIS
478fe908efSRuslan Ermilov.Nm
4825b20fc0SGary W. Swearingen.Op Fl fv
499b50d902SRodney W. Grimes.Op Fl b Ar bits
509b50d902SRodney W. Grimes.Op Ar
5125b20fc0SGary W. Swearingen.Nm
5225b20fc0SGary W. Swearingen.Fl c
5325b20fc0SGary W. Swearingen.Op Fl b Ar bits
5425b20fc0SGary W. Swearingen.Op Ar file
559b50d902SRodney W. Grimes.Nm uncompress
5625b20fc0SGary W. Swearingen.Op Fl f
579b50d902SRodney W. Grimes.Op Ar
5825b20fc0SGary W. Swearingen.Nm uncompress
5925b20fc0SGary W. Swearingen.Fl c
6025b20fc0SGary W. Swearingen.Op Ar file
619b50d902SRodney W. Grimes.Sh DESCRIPTION
623898680cSPhilippe CharnierThe
633898680cSPhilippe Charnier.Nm
6425b20fc0SGary W. Swearingenutility reduces the size of files using adaptive Lempel-Ziv coding.
659b50d902SRodney W. GrimesEach
669b50d902SRodney W. Grimes.Ar file
679b50d902SRodney W. Grimesis renamed to the same name plus the extension
684e9e907dSRuslan Ermilov.Pa .Z .
6925b20fc0SGary W. SwearingenA
7025b20fc0SGary W. Swearingen.Ar file
7125b20fc0SGary W. Swearingenargument with a
724e9e907dSRuslan Ermilov.Pa .Z
7325b20fc0SGary W. Swearingenextension will be ignored except it will cause an
7425b20fc0SGary W. Swearingenerror exit after other arguments are processed.
759b50d902SRodney W. GrimesIf compression would not reduce the size of a
769b50d902SRodney W. Grimes.Ar file ,
779b50d902SRodney W. Grimesthe file is ignored.
789b50d902SRodney W. Grimes.Pp
793898680cSPhilippe CharnierThe
803898680cSPhilippe Charnier.Nm uncompress
8125b20fc0SGary W. Swearingenutility restores compressed files to their original form, renaming the
829b50d902SRodney W. Grimesfiles by deleting the
834e9e907dSRuslan Ermilov.Pa .Z
8425b20fc0SGary W. Swearingenextensions.
8525b20fc0SGary W. SwearingenA file specification need not include the file's
864e9e907dSRuslan Ermilov.Pa .Z
879b50d902SRodney W. Grimesextension.
8825b20fc0SGary W. SwearingenIf a file's name in its file system does not have a
894e9e907dSRuslan Ermilov.Pa .Z
9025b20fc0SGary W. Swearingenextension, it will not be uncompressed and it will cause
9125b20fc0SGary W. Swearingenan error exit after other arguments are processed.
929b50d902SRodney W. Grimes.Pp
939b50d902SRodney W. GrimesIf renaming the files would cause files to be overwritten and the standard
949b50d902SRodney W. Grimesinput device is a terminal, the user is prompted (on the standard error
959b50d902SRodney W. Grimesoutput) for confirmation.
969b50d902SRodney W. GrimesIf prompting is not possible or confirmation is not received, the files
979b50d902SRodney W. Grimesare not overwritten.
989b50d902SRodney W. Grimes.Pp
9925b20fc0SGary W. SwearingenAs many of the modification time, access time, file flags, file mode,
10025b20fc0SGary W. Swearingenuser ID, and group ID as allowed by permissions are retained in the
10125b20fc0SGary W. Swearingennew file.
10225b20fc0SGary W. Swearingen.Pp
103b4771590STim J. RobbinsIf no files are specified or a
104b4771590STim J. Robbins.Ar file
105b4771590STim J. Robbinsargument is a single dash
1063971fc8cSRuslan Ermilov.Pq Sq Fl ,
107b4771590STim J. Robbinsthe standard input is compressed or uncompressed to the standard output.
1089b50d902SRodney W. GrimesIf either the input and output files are not regular files, the checks for
1099b50d902SRodney W. Grimesreduction in size and file overwriting are not performed, the input file is
11025b20fc0SGary W. Swearingennot removed, and the attributes of the input file are not retained
11125b20fc0SGary W. Swearingenin the output file.
1129b50d902SRodney W. Grimes.Pp
1139b50d902SRodney W. GrimesThe options are as follows:
11425b20fc0SGary W. Swearingen.Bl -tag -width ".Fl b Ar bits"
11525b20fc0SGary W. Swearingen.It Fl b Ar bits
11625b20fc0SGary W. SwearingenThe code size (see below) is limited to
11725b20fc0SGary W. Swearingen.Ar bits ,
11825b20fc0SGary W. Swearingenwhich must be in the range 9..16.
11925b20fc0SGary W. SwearingenThe default is 16.
1209b50d902SRodney W. Grimes.It Fl c
1219b50d902SRodney W. GrimesCompressed or uncompressed output is written to the standard output.
1229b50d902SRodney W. GrimesNo files are modified.
12325b20fc0SGary W. SwearingenThe
12425b20fc0SGary W. Swearingen.Fl v
12525b20fc0SGary W. Swearingenoption is ignored.
12625b20fc0SGary W. SwearingenCompression is attempted even if the results will be larger than the
12725b20fc0SGary W. Swearingenoriginal.
1289b50d902SRodney W. Grimes.It Fl f
12925b20fc0SGary W. SwearingenFiles are overwritten without prompting for confirmation.
13025b20fc0SGary W. SwearingenAlso, for
13125b20fc0SGary W. Swearingen.Nm compress ,
13225b20fc0SGary W. Swearingenfiles are compressed even if they are not actually reduced in size.
1339b50d902SRodney W. Grimes.It Fl v
1349b50d902SRodney W. GrimesPrint the percentage reduction of each file.
13525b20fc0SGary W. SwearingenIgnored by
13625b20fc0SGary W. Swearingen.Nm uncompress
13725b20fc0SGary W. Swearingenor if the
13825b20fc0SGary W. Swearingen.Fl c
13925b20fc0SGary W. Swearingenoption is also used.
1409b50d902SRodney W. Grimes.El
1419b50d902SRodney W. Grimes.Pp
1423898680cSPhilippe CharnierThe
1433898680cSPhilippe Charnier.Nm
1443898680cSPhilippe Charnierutility uses a modified Lempel-Ziv algorithm.
1459b50d902SRodney W. GrimesCommon substrings in the file are first replaced by 9-bit codes 257 and up.
1469b50d902SRodney W. GrimesWhen code 512 is reached, the algorithm switches to 10-bit codes and
1479b50d902SRodney W. Grimescontinues to use more bits until the
1489b50d902SRodney W. Grimeslimit specified by the
1499b50d902SRodney W. Grimes.Fl b
15025b20fc0SGary W. Swearingenoption or its default is reached.
1519b50d902SRodney W. Grimes.Pp
15225b20fc0SGary W. SwearingenAfter the limit is reached,
153fae643c5SPhilippe Charnier.Nm
1549b50d902SRodney W. Grimesperiodically checks the compression ratio.
1559b50d902SRodney W. GrimesIf it is increasing,
156fae643c5SPhilippe Charnier.Nm
1579b50d902SRodney W. Grimescontinues to use the existing code dictionary.
1589b50d902SRodney W. GrimesHowever, if the compression ratio decreases,
159fae643c5SPhilippe Charnier.Nm
1606a3e8b0aSRuslan Ermilovdiscards the table of substrings and rebuilds it from scratch.
1616a3e8b0aSRuslan ErmilovThis allows
1629b50d902SRodney W. Grimesthe algorithm to adapt to the next "block" of the file.
1639b50d902SRodney W. Grimes.Pp
1649b50d902SRodney W. GrimesThe
1659b50d902SRodney W. Grimes.Fl b
16625b20fc0SGary W. Swearingenoption is unavailable for
16725537080SPhilippe Charnier.Nm uncompress
1689b50d902SRodney W. Grimessince the
1699b50d902SRodney W. Grimes.Ar bits
1709b50d902SRodney W. Grimesparameter specified during compression
1719b50d902SRodney W. Grimesis encoded within the output, along with
1729b50d902SRodney W. Grimesa magic number to ensure that neither decompression of random data nor
1739b50d902SRodney W. Grimesrecompression of compressed data is attempted.
1749b50d902SRodney W. Grimes.Pp
1759b50d902SRodney W. GrimesThe amount of compression obtained depends on the size of the
1769b50d902SRodney W. Grimesinput, the number of
1779b50d902SRodney W. Grimes.Ar bits
1789b50d902SRodney W. Grimesper code, and the distribution of common substrings.
1799b50d902SRodney W. GrimesTypically, text such as source code or English is reduced by 50\-60%.
1809b50d902SRodney W. GrimesCompression is generally much better than that achieved by Huffman
1819b50d902SRodney W. Grimescoding (as used in the historical command pack), or adaptive Huffman
1829b50d902SRodney W. Grimescoding (as used in the historical command compact), and takes less
1839b50d902SRodney W. Grimestime to compute.
184a866e170SRuslan Ermilov.Sh EXIT STATUS
1853e4d070bSTom Rhodes.Ex -std compress uncompress
1866c1b63bbSTim J. Robbins.Pp
1876c1b63bbSTim J. RobbinsThe
1886c1b63bbSTim J. Robbins.Nm compress
18925b20fc0SGary W. Swearingenutility exits 2 if attempting to compress a file would not reduce its size
1906c1b63bbSTim J. Robbinsand the
1916c1b63bbSTim J. Robbins.Fl f
19225b20fc0SGary W. Swearingenoption was not specified and if no other error occurs.
1939b50d902SRodney W. Grimes.Sh SEE ALSO
1943e4d070bSTom Rhodes.Xr gunzip 1 ,
1953e4d070bSTom Rhodes.Xr gzexe 1 ,
1963e4d070bSTom Rhodes.Xr gzip 1 ,
1973e4d070bSTom Rhodes.Xr zcat 1 ,
1983e4d070bSTom Rhodes.Xr zmore 1 ,
1993e4d070bSTom Rhodes.Xr znew 1
2009b50d902SRodney W. Grimes.Rs
2019b50d902SRodney W. Grimes.%A Welch, Terry A.
2029b50d902SRodney W. Grimes.%D June, 1984
2039b50d902SRodney W. Grimes.%T "A Technique for High Performance Data Compression"
2049b50d902SRodney W. Grimes.%J "IEEE Computer"
2059b50d902SRodney W. Grimes.%V 17:6
2069b50d902SRodney W. Grimes.%P pp. 8-19
2079b50d902SRodney W. Grimes.Re
208f5ba2b90STim J. Robbins.Sh STANDARDS
209f5ba2b90STim J. RobbinsThe
210f5ba2b90STim J. Robbins.Nm compress
211f5ba2b90STim J. Robbinsand
212f5ba2b90STim J. Robbins.Nm uncompress
213f5ba2b90STim J. Robbinsutilities conform to
214f5ba2b90STim J. Robbins.St -p1003.1-2001 .
2159b50d902SRodney W. Grimes.Sh HISTORY
2169b50d902SRodney W. GrimesThe
2179b50d902SRodney W. Grimes.Nm
2189b50d902SRodney W. Grimescommand appeared in
2199b50d902SRodney W. Grimes.Bx 4.3 .
22025b20fc0SGary W. Swearingen.Sh BUGS
22125b20fc0SGary W. SwearingenSome of these might be considered otherwise-undocumented features.
22225b20fc0SGary W. Swearingen.Pp
22325b20fc0SGary W. Swearingen.Nm compress :
22425b20fc0SGary W. SwearingenIf the utility does not compress a file because doing so would not
2254e9e907dSRuslan Ermilovreduce its size, and a file of the same name except with an
2264e9e907dSRuslan Ermilov.Pa .Z
22725b20fc0SGary W. Swearingenextension exists, the named file is not really ignored as stated above;
22825b20fc0SGary W. Swearingenit causes a prompt to confirm the overwriting of the file with the extension.
22925b20fc0SGary W. SwearingenIf the operation is confirmed, that file is deleted.
23025b20fc0SGary W. Swearingen.Pp
23125b20fc0SGary W. Swearingen.Nm uncompress :
23225b20fc0SGary W. SwearingenIf an empty file is compressed (using
23325b20fc0SGary W. Swearingen.Fl f ) ,
23425b20fc0SGary W. Swearingenthe resulting
2354e9e907dSRuslan Ermilov.Pa .Z
23625b20fc0SGary W. Swearingenfile is also empty.
23725b20fc0SGary W. SwearingenThat seems right, but if
23825b20fc0SGary W. Swearingen.Nm uncompress
23925b20fc0SGary W. Swearingenis then used on that file, an error will occur.
24025b20fc0SGary W. Swearingen.Pp
24125b20fc0SGary W. SwearingenBoth utilities: If a
24225b20fc0SGary W. Swearingen.Sq Fl
24325b20fc0SGary W. Swearingenargument is used and the utility prompts the user, the standard input
24425b20fc0SGary W. Swearingenis taken as the user's reply to the prompt.
24525b20fc0SGary W. Swearingen.Pp
24625b20fc0SGary W. SwearingenBoth utilities:
24725b20fc0SGary W. SwearingenIf the specified file does not exist, but a similarly-named one with (for
24825b20fc0SGary W. Swearingen.Nm compress )
24925b20fc0SGary W. Swearingenor without (for
25025b20fc0SGary W. Swearingen.Nm uncompress )
25125b20fc0SGary W. Swearingena
2524e9e907dSRuslan Ermilov.Pa .Z
25325b20fc0SGary W. Swearingenextension does exist, the utility will waste the user's time by not
25425b20fc0SGary W. Swearingenimmediately emitting an error message about the missing file and
25525b20fc0SGary W. Swearingencontinuing.
25625b20fc0SGary W. SwearingenInstead, it first asks for confirmation to overwrite
2574e9e907dSRuslan Ermilovthe existing file and then does not overwrite it.
258