Skip to main content
Glossary Term

Data compression

Lossless Compression - Lossless compression is the process of encoding information using fewer bits without losing any information. - Most real-world data exhibits statistical redundancy, which allows for lossless compression. - Run-length encoding is a basic example of lossless compression, where repeated strings of data are substituted with table entries. - The Lempel-Ziv (LZ) compression methods, such as LZW, are popular algorithms for lossless storage. - Probabilistic models, like prediction by partial matching and the Burrows-Wheeler transform, are used in modern lossless compressors. Lossy Compression - Lossy compression methods accept some loss of information to save storage space. - Perceptual differences in how humans perceive data are exploited in lossy compression schemes. - Transform coding, particularly the discrete cosine transform (DCT), is widely used in lossy compression. - Lossy compression is extensively used in multimedia formats for images, video, and audio. - Psychoacoustics and psychovisuals are used in lossy compression for sound, images, and video. Compression and Decompression Processes - Data compression is performed by an encoder, while decompression is performed by a decoder. - Compression reduces the resources required to store and transmit data. - The design of compression schemes involves trade-offs between compression degree, distortion introduced, and computational resources required. - Lossy compression methods use specialized techniques, such as psychoacoustics and speech coding, for specific types of data. - Lossy compression can cause generation loss. Theoretical Basis for Compression - Information theory and Shannon's source coding theorem provide the theoretical basis for compression. - Algorithmic information theory is used for lossless compression, while rate-distortion theory is used for lossy compression. - Claude Shannon is credited with creating the foundations of compression through his fundamental papers in the late 1940s and early 1950s. - Coding theory and statistics are also associated with compression. - Compression is based on the principles of reducing redundancy and exploiting statistical patterns in data. Compression Formats and Applications - DEFLATE is a popular compression format optimized for decompression speed and compression ratio. - LZW algorithm is widely used in compression systems, including GIF images and programs like PKZIP. - Arithmetic coding, a more modern technique, can achieve superior compression compared to other methods like Huffman coding. - Compression formats like HEVC, MPEG, and JPEG are used in multimedia applications for video and image compression. - Lossy compression is used in digital cameras, DVDs, Blu-ray, streaming video, and internet telephony.