0% found this document useful (0 votes)
35 views

Common Document File Formats: de Facto

There are many incompatible document file formats. Some open standards for document formats include DocBook, XHTML, OpenDocument, and Office Open XML. In 1993, the ITU-T tried to establish a single standard called Open Document Architecture, but it did not succeed. PDF and HTML have become de facto standards for documents that users can view but not edit. The proprietary .doc format from Microsoft has also become widespread, though it is not fully supported by all word processors.

Uploaded by

Thái Là Thái
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views

Common Document File Formats: de Facto

There are many incompatible document file formats. Some open standards for document formats include DocBook, XHTML, OpenDocument, and Office Open XML. In 1993, the ITU-T tried to establish a single standard called Open Document Architecture, but it did not succeed. PDF and HTML have become de facto standards for documents that users can view but not edit. The proprietary .doc format from Microsoft has also become widespread, though it is not fully supported by all word processors.

Uploaded by

Thái Là Thái
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

A document file format is a text or binary file format for storing documents on a storage media, especially

for use by computers. There currently exist a multitude of incompatible document file formats.

Examples of XML-based open standards are DocBook, XHTML, and, more recently,
the ISO/IEC standards OpenDocument (ISO 26300:2006) and Office Open XML (ISO 29500:2008).

In 1993, the ITU-T tried to establish a standard for document file formats, known as the Open Document
Architecture (ODA) which was supposed to replace all competing document file formats. It is described in
ITU-T documents T.411 through T.421, which are equivalent to ISO 8613. It did not succeed.

Page description languages such as PostScript and PDF have become the de facto standard for
documents that a typical user should only be able to create and read, not edit. In 2001, a series
of ISO/IEC standards for PDF began to be published, including the specification for PDF itself, ISO-32000.

HTML is the most used and open international standard and it is also used as document file format. It has
also become ISO/IEC standard (ISO 15445:2000).

The default binary file format used by Microsoft Word (.doc) has become widespread de facto standard for
office documents, but it is a proprietary format and is not always fully supported by other word processors.

Common document file formats[edit]


 ASCII, UTF-8 — plain text formats
 Amigaguide
 .doc for Microsoft Word — Structural binary format developed by Microsoft (specifications
available since 2008 under the Open Specification Promise)[1][2]
 DjVu — file format designed primarily to store scanned documents[3]
 DocBook — an XML format for technical documentation
 HTML (.html, .htm), (open standard, ISO from 2000), in combination with possible image
files referred to.
 FictionBook (.fb2) — open XML-based e-book format
 Markdown (.md) — markup language for creating formatted text using plain text
 Office Open XML — .docx (XML-based standard for office documents)
 OpenDocument — .odt (XML-based standard for office documents)
 OpenOffice.org XML — .sxw (open, XML-based format for office documents)
 OXPS — Open XML Paper Specification (Windows 8.1 and above, older version is XPS used
in Windows 7)
 PalmDoc — handheld document format
 .pages for Pages
 PDF — Open standard for document exchange. ISO standards
include PDF/X (eXchange), PDF/A (Archive), PDF/E (Engineering), ISO
32000 (PDF), PDF/UA (Accessibility) and PDF/VT (Variable data and transactional printing).
PDF is readable on almost every platform with free or open source readers. Open source PDF
creators are also available.
 PostScript — .ps
 Rich Text Format (RTF) — meta data format being developed by Microsoft since 1987 for
Microsoft products and cross-platform document interchange[4][5][6][7][8]
 SYmbolic LinK (SYLK)
 Scalable Vector Graphics (SVG) - Graphics format primarily for vector-based images.
 TeX — Open-source typesetting program and format. First successful mathematical notation
language.
 TEI — XML format for digital publication
 Troff
 Uniform Office Format — Chinese standard
 WordPerfect (.wpd, .wp, .wp7, .doc) (Note: possible confusion with Word format extension)

You might also like