- published: 11 Sep 2009
- views: 53699
An archive format is the file format of an archive file. The archive format is determined by the file archiver. Some archive formats are well-defined by their authors and have become conventions supported by multiple vendors and/or open-source communities.
Archive formats support features such as file concatenation, data compression, encryption, file spanning, parity/Cyclic redundancy check, checksum, self-extraction, self-installation, volume and directory structure information, package notes/description, and other meta-data.
Note: a comprehensive List of archive formats and Comparison of archive formats is available.
Unix operating systems utilize the tar file format, ar, and shar to concatenate files. These archive formats can then be compressed into gzip format.
On Windows platforms, the most widely-used archive format is ZIP; other formats are CAB, RAR (file format), and ACE. Windows Installer is a high-level archive format for distribution of software.
A file format is a particular way that information is encoded for storage in a computer file, as files need a way to be represented as bits when stored on a disc drive or other digital storage medium. File formats can be divided into two types: proprietary and open formats.[contradictory]
Some file formats are designed for very particular types of data: PNG files, for example, store bitmapped images using lossless data compression. Other file formats, however, are designed for storage of several different types of data: the Ogg format can act as a container for different types of multimedia, including any combination of audio and video, with or without text (such as subtitles), and metadata. A text file can contain any stream of characters, including possible control characters, and is encoded in one of various character encoding schemes. Some file formats, such as HTML, scalable vector graphics, and the source code of computer software are text files with defined syntaxes that allow them to be used for specific purposes.
An archive file is a file that is composed of one or more files along with metadata that can include source volume and medium information, file directory structure, error detection and recovery information, file comments, and usually employs some form of lossless compression. Archive files may be encrypted in part or as a whole. Archive files are used to collect multiple data files together into a single file for easier portability and storage, or simply to compress files to use less storage space.
Computer archive files are created by file archiver software, optical disc authoring software, or disk image software that uses an archive format determined by that software. The file extension or file header of the archive file are indicators of the file format used.
Archive files are sometimes accompanied by separate parity archive (PAR) files that allow for additional error detection and recovery, particularly in recovery of missing files in a multi-file archive.
In operating systems that distinguish different file types by filename extensions, archive extensions include .zip, .rar, .7z, and .tar, (of these examples, .tar is the only non-compressed format). Java also introduced a whole family of archive extensions such as .jar and .war ('j' is was Java and 'w' is for web). They are used to exchange entire byte-code deployment. Sometimes they are also used to exchange source codes and other text/html/xml files. By default they are all compressed.