Glossary Term
Data compression
Lossless Compression
- Lossless compression is the process of encoding information using fewer bits without losing any information.
- Most real-world data exhibits statistical redundancy, which allows for lossless compression.
- Run-length encoding is a basic example of lossless compression, where repeated strings of data are substituted with table entries.
- The Lempel-Ziv (LZ) compression methods, such as LZW, are popular algorithms for lossless storage.
- Probabilistic models, like prediction by partial matching and the Burrows-Wheeler transform, are used in modern lossless compressors.
Lossy Compression
- Lossy compression methods accept some loss of information to save storage space.
- Perceptual differences in how humans perceive data are exploited in lossy compression schemes.
- Transform coding, particularly the discrete cosine transform (DCT), is widely used in lossy compression.
- Lossy compression is extensively used in multimedia formats for images, video, and audio.
- Psychoacoustics and psychovisuals are used in lossy compression for sound, images, and video.
Compression and Decompression Processes
- Data compression is performed by an encoder, while decompression is performed by a decoder.
- Compression reduces the resources required to store and transmit data.
- The design of compression schemes involves trade-offs between compression degree, distortion introduced, and computational resources required.
- Lossy compression methods use specialized techniques, such as psychoacoustics and speech coding, for specific types of data.
- Lossy compression can cause generation loss.
Theoretical Basis for Compression
- Information theory and Shannon's source coding theorem provide the theoretical basis for compression.
- Algorithmic information theory is used for lossless compression, while rate-distortion theory is used for lossy compression.
- Claude Shannon is credited with creating the foundations of compression through his fundamental papers in the late 1940s and early 1950s.
- Coding theory and statistics are also associated with compression.
- Compression is based on the principles of reducing redundancy and exploiting statistical patterns in data.
Compression Formats and Applications
- DEFLATE is a popular compression format optimized for decompression speed and compression ratio.
- LZW algorithm is widely used in compression systems, including GIF images and programs like PKZIP.
- Arithmetic coding, a more modern technique, can achieve superior compression compared to other methods like Huffman coding.
- Compression formats like HEVC, MPEG, and JPEG are used in multimedia applications for video and image compression.
- Lossy compression is used in digital cameras, DVDs, Blu-ray, streaming video, and internet telephony.