Comprehensive guides to document, image, and data file formats
Formats for documents, reports, and printable content.
PDF, short for Portable Document Format, is a file format developed to present documents consistently across every device, operating system, and software application.
DOCX is the default document format for Microsoft Word and the most widely used word-processing file type in the world.
XLSX is the default spreadsheet format for Microsoft Excel and the most widely adopted file type for tabular data, calculations, and data analysis in business, science, and finance.
PPTX is the default presentation format for Microsoft PowerPoint and the dominant file type for slide-based visual communication in business, education, and public speaking.
ODT is the word-processing file format defined by the OpenDocument Format (ODF) standard, an open, vendor-neutral specification maintained by the OASIS technical committee and published as ISO/IEC 26300.
RTF, or Rich Text Format, is a document file format created by Microsoft that stores formatted text along with basic structural elements like tables, images, and font specifications using plain-text control codes.
EPUB is the most widely adopted open standard for reflowable digital books and publications.
Formats for photographs, graphics, icons, and illustrations.
JPEG (commonly referred to as JPG due to the three-character file extension convention of early operating systems) is the most ubiquitous lossy image compression format in the world.
PNG is a lossless raster image format designed as a patent-free replacement for GIF.
WebP is a modern image format developed by Google that provides both lossy and lossless compression in a single container, along with animation support and alpha-channel transparency.
SVG is an XML-based vector image format for two-dimensional graphics, developed and maintained by the World Wide Web Consortium (W3C).
GIF is a bitmap image format that has endured for nearly four decades primarily because of its animation capability.
TIFF is a flexible, adaptable raster image format widely used in professional photography, publishing, medical imaging, geospatial analysis, and archival digitization.
HEIC (High Efficiency Image Container) is a modern image format based on the HEIF (High Efficiency Image File Format) standard that uses HEVC (H.
BMP (Bitmap) is one of the oldest raster image formats still in use, developed by Microsoft for the Windows operating system.
ICO is the icon file format used by Microsoft Windows to store small images that represent applications, files, folders, and shortcuts in the operating system's graphical user interface.
PSD is the native file format of Adobe Photoshop, the world's most widely used professional image editing software.
EPS (Encapsulated PostScript) is a graphics file format based on the PostScript page description language, designed to encapsulate a single page of vector and/or raster artwork in a self-contained file that can be embedded within other documents.
Formats for structured data, configuration, and data exchange.
TXT, or plain text, is the most fundamental digital document format — a file containing nothing but a sequence of characters with no formatting, styling, or structural metadata.
CSV is the simplest and most universally supported format for tabular data exchange.
JSON (JavaScript Object Notation) is a lightweight data interchange format that has become the dominant standard for transmitting structured data between web servers and clients, between microservices, and between applications and their configuration files.
YAML is a human-friendly data serialization language designed to be more readable than JSON and less verbose than XML for configuration files, data exchange, and structured document authoring.
XML (Extensible Markup Language) is a markup language and data serialization format that provides a flexible, self-describing structure for representing hierarchical information.
HTML is the standard markup language for creating web pages and web applications.
Markdown is a lightweight markup language that uses simple, intuitive plain-text formatting syntax to create structured documents that can be converted to HTML and other formats.