Re-organizing json documentation. Needs more work.

author Peter Geoghegan <[email protected]>

Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)

committer Peter Geoghegan <[email protected]>

Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)
author Peter Geoghegan <[email protected]>
Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)
committer Peter Geoghegan <[email protected]>
Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)
diff --git a/doc/src/sgml/datatype.sgml b/doc/src/sgml/datatype.sgml

index 0e1865baedce593ab17d5dfc9cef687d030b9ece..cc458b4753f43554bfde8f50a406efb5582e45b2 100644 (file)
--- a/doc/src/sgml/datatype.sgml
+++ b/doc/src/sgml/datatype.sgml
@@ -4226,72 +4226,7 @@ SET xmloption TO { DOCUMENT | CONTENT };
     </sect2>
    </sect1>
  
-  <sect1 id="datatype-json">
-   <title><acronym>JSON</> Types</title>
-
-   <indexterm zone="datatype-json">
-    <primary>JSON</primary>
-   </indexterm>
-
-   <indexterm zone="datatype-json">
-    <primary>JSONB</primary>
-   </indexterm>
-
-   <para>
-    JSON data types are for storing JSON (JavaScript Object Notation)
-    data, as specified in
-    <ulink url="http://rfc7159.net/rfc7159">RFC 7159</ulink>. Such
-    data can also be stored as <type>text</type>, but the JSON data
-    types have the advantage of enforcing that each stored value is a
-    valid JSON value. There are also related support functions
-    available; see <xref linkend="functions-json">.
-   </para>
-
-   <para>
-    There are two JSON data types: <type>json</type> and
-    <type>jsonb</type>.  Both accept <emphasis>almost</emphasis>
-    identical sets of values as input. The difference is primarily a
-    matter of efficiency.  The <type>json</type> data type stores an
-    exact copy of the the input text, which processing functions must
-    continually reparse, while the <type>jsonb</type> is stored in a
-    decomposed binary format that makes it slightly less efficient to
-    input due to added serialization overhead, but significantly
-    faster to process, since it never needs reparsing.
-    <type>jsonb</type> also supports advanced <acronym>GiST</acronym>
-    and <acronym>GIN</acronym> indexing, which is a further
-    significant advantage.
-   </para>
-
-   <para>
-    The other difference between the types is that the
-    <type>json</type> type is guaranteed to contain an exact copy of
-    the input, including preservation of semantically insignificant
-    white space, and the order of keys within JSON objects. Also,
-    because the exact text is kept, if a JSON object within the value
-    contains the same key more than once, all the key/value pairs are
-    kept. In that case, the processing functions consider the last
-    value as the operative one. By contrast, <type>jsonb</type> does
-    not preserve white space, does not preserve the order of object
-    keys, and does not keep duplicate object keys. Only the last value
-    for a key specified in the input is kept.
-   </para>
-
-   <para>
-    In general, most applications will prefer to store JSON data as
-    <type>jsonb</type>, unless there are quite specialised needs.
-   </para>
-
-   <para>
-    <productname>PostgreSQL</productname> allows only one server encoding
-    per database.  It is therefore not possible for the JSON types to conform
-    rigidly to the specification unless the server encoding is UTF-8. Attempts
-    to directly include characters which cannot be represented in the server
-    encoding will fail; conversely, characters which can be represented in
-    the server encoding but not in UTF-8 will be allowed.
-    <literal>\uXXXX</literal> escapes are allowed regardless of the server
-    encoding, and are checked only for syntactic correctness.
-   </para>
-  </sect1>
+  &json;
  
    &array;
  
diff --git a/doc/src/sgml/filelist.sgml b/doc/src/sgml/filelist.sgml

index 0e863ee064e11f7fb40494b4576506e68068cab8..ecd8060a44118067233c527703267bbac83b2344 100644 (file)
--- a/doc/src/sgml/filelist.sgml
+++ b/doc/src/sgml/filelist.sgml
@@ -22,6 +22,7 @@
  <!ENTITY dml        SYSTEM "dml.sgml">
  <!ENTITY func       SYSTEM "func.sgml">
  <!ENTITY indices    SYSTEM "indices.sgml">
+<!ENTITY json       SYSTEM "json.sgml">
  <!ENTITY mvcc       SYSTEM "mvcc.sgml">
  <!ENTITY perform    SYSTEM "perform.sgml">
  <!ENTITY queries    SYSTEM "queries.sgml">
diff --git a/doc/src/sgml/func.sgml b/doc/src/sgml/func.sgml

index 7d0dd3e49f1d1bd5d28ddcd05a1840e1e81c0281..f136b154434ba48578f47fa4a7e9fc5bb28cde9a 100644 (file)
--- a/doc/src/sgml/func.sgml
+++ b/doc/src/sgml/func.sgml
@@ -10065,12 +10065,13 @@ table2-mapping
    </indexterm>
  
     <para>
-   <xref linkend="functions-json-op-table"> shows the operators that are
-   available for use with JSON (see <xref linkend="datatype-json">) data.
+   <xref linkend="functions-json-op-table"> shows the operators that
+   are available for use with the two JSON datatypes (see <xref
+   linkend="datatype-json">) data.
    </para>
  
    <table id="functions-json-op-table">
-     <title>JSON Operators</title>
+     <title><type>json</> and <type>jsonb</> Operators</title>
       <tgroup cols="4">
        <thead>
         <row>
@@ -10107,13 +10108,13 @@ table2-mapping
         </row>
         <row>
          <entry><literal>#&gt;</literal></entry>
-        <entry>array of text</entry>
+        <entry>text[]</entry>
          <entry>Get JSON object at specified path</entry>
          <entry><literal>'{"a":[1,2,3],"b":[4,5,6]}'::json#&gt;'{a,2}'</literal></entry>
         </row>
         <row>
          <entry><literal>#&gt;&gt;</literal></entry>
-        <entry>array of text</entry>
+        <entry>text[]</entry>
          <entry>Get JSON object at specified path as text</entry>
          <entry><literal>'{"a":[1,2,3],"b":[4,5,6]}'::json#&gt;&gt;'{a,2}'</literal></entry>
         </row>
@@ -10124,11 +10125,11 @@ table2-mapping
    <note>
     <para>
      There are parallel variants of these operators for both the
-    <type>json</type> type and the <type>jsonb</type> type.  In
-    addition to those operators common to both types, a further set of
-    operators exists for <type>jsonb</type>, listed below.  These
-    additional operators are principally intended to enable advanced
-    indexing, but have independent value.
+    <type>json</type> and <type>jsonb</type> types.  In addition to
+    those operators common to both types, a further set of operators
+    exists for <type>jsonb</type> (which comprise various
+    <acronym>GiST</acronym> and <acronym>GIN</acronym> operator
+    classes for indexing).
     </para>
    </note>
    <para>
@@ -10136,68 +10137,12 @@ table2-mapping
      The <type>jsonb</type> type uses a binary data representation,
      with <emphasis>nested</> types, and supports advanced
      <acronym>GiST</acronym> and <acronym>GIN</acronym> indexing.
-    Primitive types described by <acronym>RFC</> 7159 are effectively
-    mapped onto native <productname>PostgreSQL</productname> types.
-    Therefore, there are some very minor additional constraints on
-    what constitutes valid <type>jsonb</type> that do not apply to the
-    <type>json</type> type, or to JSON in the abstract, pertaining to
-    the upper limits on what can be represented by the underlying type
-    system.  These implementation-defined restrictions are permitted
-    by <acronym>RFC</> 7159.
-  </para>
-  <para>
-    Note that currently, the <type>jsonb</type>-only operators (which
-    comprise various <acronym>GiST</acronym> and
-    <acronym>GIN</acronym> operator classes for indexing) do not
-    permit searching for keys that are not at the least-nested level.
-    However, they can be used to efficiently search among more than
-    one possible key, or more than one possible key/value pair within
-    a single <type>jsonb</type> datum/document, among a large number
-    of such <quote>documents</> within a column in a table (i.e. among
-    many rows).
    </para>
    <para>
-    Comparisons and related operations are <emphasis>type-wise</>, in
-    that the underlying <productname>PostgreSQL</productname> datatype
-    comparators are invoked recursively, much like a traditional
-    composite type.
+   <xref linkend="json-indexing"> describes how these operators can be
+   used to effectively index <type>jsonb</>.
    </para>
  
-  <table id="functions-jsonb-type-mapping-table">
-     <title>PostgreSQL type to RFC-7159/JSON primitive type mapping</title>
-     <tgroup cols="3">
-      <thead>
-       <row>
-        <entry><productname>PostgreSQL</productname> type</entry>
-        <entry>RFC-7159/JSON primitive type</entry>
-        <entry>Notes</entry>
-       </row>
-      </thead>
-      <tbody>
-       <row>
-        <entry><literal>text</literal></entry>
-        <entry><literal>string</literal></entry>
-        <entry>See general notes on encoding and JSON below</entry>
-       </row>
-       <row>
-        <entry><literal>numeric</literal></entry>
-        <entry><literal>number</literal></entry>
-        <entry><literal>NaN</literal> and <literal>infinity</literal> values are specially disallowed</entry>
-       </row>
-       <row>
-        <entry><literal>boolean</literal></entry>
-        <entry><literal>boolean</literal></entry>
-        <entry>Only lowercase <literal>true</literal> and <literal>false</literal> values are accepted</entry>
-       </row>
-       <row>
-        <entry><literal>unknown</literal></entry>
-        <entry><literal>null</literal></entry>
-        <entry>SQL <literal>NULL</literal> is orthogonal to the <productname>PostgreSQL</productname> type system</entry>
-       </row>
-      </tbody>
-     </tgroup>
-   </table>
-
    <table id="functions-jsonb-op-table">
       <title>Additonal JSONB Operators</title>
       <tgroup cols="4">
@@ -10225,62 +10170,31 @@ table2-mapping
         <row>
          <entry><literal>&lt;@</literal></entry>
          <entry>jsonb</entry>
-        <entry>Does the jsonb have contained within it this jsonb object?</entry>
+        <entry>Does the jsonb have contained within it this jsonb?</entry>
          <entry><literal>'{"b":2}'::jsonb &lt;@ '{"a":1, "b":2}'::jsonb</literal></entry>
         </row>
         <row>
          <entry><literal>?</literal></entry>
          <entry>text</entry>
-        <entry>Does this key exist?</entry>
+        <entry>Does this key/element exist?</entry>
          <entry><literal>'{"a":1, "b":2}'::jsonb ? 'b'</literal></entry>
         </row>
         <row>
          <entry><literal>?|</literal></entry>
          <entry>text[]</entry>
-        <entry>Do any of these keys exist?</entry>
-        <entry><literal>'{"a":1, "b":2}'::jsonb ?| array['b', 'c']</literal></entry>
+        <entry>Do any of these keys/elements exist?</entry>
+        <entry><literal>'{"a":1, "b":2, "c":3}'::jsonb ?| array['b', 'c']</literal></entry>
         </row>
         <row>
          <entry><literal>?&amp;</literal></entry>
          <entry>text[]</entry>
-        <entry>Do all of these keys exist?</entry>
-        <entry><literal>'{"a":1, "b":2}'::jsonb ?&amp; array['a', 'b']</literal></entry>
+        <entry>Do all of these keys/elements exist?</entry>
+        <entry><literal>'["a", "b"]'::jsonb ?&amp; array['a', 'b']</literal></entry>
         </row>
        </tbody>
       </tgroup>
     </table>
  
-  <para>
-    <type>jsonb</> has GiST and GIN index support for the
-    <literal>@&gt;</>, <literal>?</>, <literal>?&amp;</> and
-    <literal>?|</> operators. For example:
-  </para>
-  <programlisting>
-CREATE INDEX hidx ON testhstore USING GIST (h);
-
-CREATE INDEX hidx ON testhstore USING GIN (h);
-  </programlisting>
-  <para>
-    The non-default GIN operator class <literal>jsonb_hash_ops</>
-    supports indexing the <literal>@&gt;</> operator only:
-  </para>
-  <programlisting>
-CREATE INDEX hidx ON testhstore USING GIN (h jsonb_hash_ops);
-  </programlisting>
-  <para>
-    Although only the <literal>@&gt;</> operator is made indexable, a
-    <literal>jsonb_hash_ops</literal> operator class GIN index has
-    some notable advantages over an equivalent GIN index of the
-    default GIN operator class for <type>jsonb</type>.  Search
-    operations typically perform appreciably better, and the on-disk
-    size of a <literal>jsonb_hash_ops</literal> operator class GIN
-    index can be much smaller.
-  </para>
-  <para>
-    <type>jsonb</> also supports <type>btree</> and <type>hash</>
-    indexes.
-  </para>
-
    <!--
       The release notes contain a reference to "functions-json-table". Since
       that table is now split in two, the id has been parked here so we don't
diff --git a/doc/src/sgml/json.sgml b/doc/src/sgml/json.sgml

new file mode 100644 (file)

index 0000000..26fa9ca
--- /dev/null
+++ b/doc/src/sgml/json.sgml
@@ -0,0 +1,238 @@
+<!-- doc/src/sgml/json.sgml -->
+
+<sect1 id="datatype-json">
+ <title><acronym>JSON</> Types</title>
+
+ <indexterm zone="datatype-json">
+  <primary>JSON</primary>
+ </indexterm>
+
+ <indexterm zone="datatype-json">
+  <primary>JSONB</primary>
+ </indexterm>
+
+ <para>
+  JSON data types are for storing JSON (JavaScript Object Notation)
+  data, as specified in <ulink url="http://rfc7159.net/rfc7159">RFC
+  7159</ulink>. Such data can also be stored as <type>text</type>, but
+  both JSON data types have the advantage of enforcing that each
+  stored value is a valid JSON value.  There are also related support
+  functions available; see <xref linkend="functions-json">.
+ </para>
+
+ <para>
+  There are two JSON data types: <type>json</> and <type>jsonb</>.
+  Both accept <emphasis>almost</emphasis> identical sets of values as
+  input. The difference is primarily a matter of efficiency.  The
+  <type>json</> data type stores an exact copy of the the input text,
+  which processing functions must continually reparse, while
+  <type>jsonb</> data is stored in a decomposed binary format that
+  makes it slightly less efficient to input due to added serialization
+  overhead, but significantly faster to process, since it never needs
+  reparsing.  <type>jsonb</> also supports advanced
+  <acronym>GiST</acronym> and <acronym>GIN</acronym> indexing, which
+  is a further significant advantage.
+ </para>
+
+ <para>
+  The other difference between the types is that the <type>json</>
+  type is guaranteed to contain an exact copy of the input, including
+  preservation of semantically insignificant white space, and the
+  order of keys within JSON objects (although <type>jsonb</> will
+  preserve trailing zeros within a JSON number). Also, because the
+  exact text is kept, if a JSON object within the value contains the
+  same key more than once, all the key/value pairs are kept.  In that
+  case, the processing functions consider the last value as the
+  operative one.  By contrast, <type>jsonb</> does not preserve white
+  space, does not preserve the order of object keys, and does not keep
+  duplicate object keys.  Only the last value for a key specified in
+  the input is kept.
+ </para>
+
+ <para>
+  In general, most applications will prefer to store JSON data as
+  <type>jsonb</>, unless there are quite specialized needs.
+ </para>
+
+ <para>
+  <productname>PostgreSQL</productname> allows only one server
+  encoding per database.  It is therefore not possible for the JSON
+  types to conform rigidly to the specification unless the server
+  encoding is UTF-8. Attempts to directly include characters which
+  cannot be represented in the server encoding will fail; conversely,
+  characters which can be represented in the server encoding but not
+  in UTF-8 will be allowed.  <literal>\uXXXX</literal> escapes are
+  allowed regardless of the server encoding, and are checked only for
+  syntactic correctness.
+ </para>
+
+ <sect2 id="json-types">
+  <title>Mapping RFC-7159/JSON primitive types to <productname>PostgreSQL</productname> types</title>
+  <para>
+    <type>jsonb</type> comparisons and related operations are
+    <emphasis>type-wise</>, in that the underlying
+    <productname>PostgreSQL</productname> datatype comparators are
+    invoked recursively, much like a traditional composite type.
+  </para>
+
+  <para>
+    Primitive types described by <acronym>RFC</> 7159 are effectively
+    mapped onto native <productname>PostgreSQL</productname> types.
+    Therefore, there are some very minor additional constraints on
+    what constitutes valid <type>jsonb</type> that do not apply to the
+    <type>json</type> type, or to JSON in the abstract, pertaining to
+    the upper limits on what can be represented by the underlying type
+    system.  These implementation-defined restrictions are permitted
+    by <acronym>RFC</> 7159.
+  </para>
+
+  <table id="json-type-mapping-table">
+     <title>PostgreSQL type to RFC-7159/JSON primitive type mapping</title>
+     <tgroup cols="3">
+      <thead>
+       <row>
+        <entry><productname>PostgreSQL</productname> type</entry>
+        <entry>RFC-7159/JSON primitive type</entry>
+        <entry>Notes</entry>
+       </row>
+      </thead>
+      <tbody>
+       <row>
+        <entry><literal>text</literal></entry>
+        <entry><literal>string</literal></entry>
+        <entry>See general introductory notes on encoding and JSON</entry>
+       </row>
+       <row>
+        <entry><literal>numeric</literal></entry>
+        <entry><literal>number</literal></entry>
+        <entry><literal>NaN</literal> and <literal>infinity</literal> values are specially disallowed</entry>
+       </row>
+       <row>
+        <entry><literal>boolean</literal></entry>
+        <entry><literal>boolean</literal></entry>
+        <entry>Only lowercase <literal>true</literal> and <literal>false</literal> values are accepted</entry>
+       </row>
+       <row>
+        <entry><literal>unknown</literal></entry>
+        <entry><literal>null</literal></entry>
+        <entry>SQL <literal>NULL</literal> is orthogonal to the <productname>PostgreSQL</productname> type system.  NULL semantics do not apply.</entry>
+       </row>
+      </tbody>
+     </tgroup>
+   </table>
+
+  <para>
+   The functions <literal>lower_inf</literal>
+   and <literal>upper_inf</literal> test for infinite lower
+   and upper bounds of a range, respectively.
+  </para>
+ </sect2>
+
+ <sect2 id="json-querying">
+  <title>Querying <type>jsonb</type> documents effectively</title>
+  <para>
+   Representing data as JSON can be considerably more flexible than
+   the traditional relational data model. It is quite possible for
+   both approaches to co-exist and complement each other within the
+   same application.  However, even for an application requiring
+   maximal flexility, it is still desirable for JSON documents to have
+   a somewhat fixed structure.  This structure is not typically
+   enforced (although enforcing some business rules declaratively is
+   possible), but makes it easier to write queries that usefully
+   summarize a set of documents in a table.
+  </para>
+  <para>
+   <type>jsonb</> data is subject to the same concurrency control
+   considerations as any other datatype when stored in a table.
+   Although storing very large documents is practicable, in order to
+   ensure correct behavior the lock manager naturally acquires
+   row-level locks as rows are updated.  Clearly <type>jsonb</>
+   documents ought to kept at a manageable size in order to decrease
+   lock contention among updating transactions.
+  </para>
+ </sect2>
+ <sect2 id="json-hybrid-document-relational">
+  <title>Combining relational and document models</title>
+  <para>
+   Make documents smaller, and join together tables with multiple
+   atomic <type>jsonb</> documents.  Materialize primary key values
+   for joins.
+  </para>
+ </sect2>
+ <sect2 id="json-indexing">
+  <title>Indexing</title>
+  <indexterm>
+    <primary>jsonb</primary>
+    <secondary>indexes on</secondary>
+  </indexterm>
+  <para>
+    Note that currently, the <type>jsonb</>-only operators do not
+    permit searching for keys or elements that are not at the
+    least-nested level (except in the limited sense that JSON object
+    key/value pairs can all be indexed at that same nesting level).
+    However, they can be used to efficiently search among more than
+    one possible key, or more than one possible key/value pair within
+    a single <type>jsonb</> datum/document, among a large number of
+    such <quote>documents</> within a column in a table (i.e. among
+    many rows).
+  </para>
+  <para>
+    <type>jsonb</> has GiST and GIN index support for the
+    <literal>@&gt;</>, <literal>?</>, <literal>?&amp;</> and
+    <literal>?|</> operators. For example:
+  </para>
+  <programlisting>
+CREATE INDEX hidx ON testhstore USING GIST (h);
+
+CREATE INDEX hidx ON testhstore USING GIN (h);
+  </programlisting>
+  <para>
+    The non-default GIN operator class <literal>jsonb_hash_ops</>
+    supports indexing the <literal>@&gt;</> operator only:
+  </para>
+  <programlisting>
+CREATE INDEX hidx ON testhstore USING GIN (h jsonb_hash_ops);
+  </programlisting>
+  <para>
+    Although only the <literal>@&gt;</> operator is made indexable, a
+    <literal>jsonb_hash_ops</literal> operator class GIN index has
+    some notable advantages over an equivalent GIN index of the
+    default GIN operator class for <type>jsonb</type>.  Search
+    operations typically perform appreciably better, and the on-disk
+    size of a <literal>jsonb_hash_ops</literal> operator class GIN
+    index can be much smaller.
+  </para>
+  <para>
+   A GiST or GIN index can accelerate queries involving these range operators:
+   <literal>=</>,
+   <literal>@&gt;</>
+   <literal>?</>
+   <literal>?|</>
+   <literal>?&amp;</>
+   (see <xref linkend="functions-jsonb-op-table"> for more information).
+  </para>
+
+  <para>
+    <type>jsonb</> also supports <type>btree</> and <type>hash</>
+    indexes.  Ordering between <type>jsonb</> datums is:
+
+    <synopsis>
+      <replaceable>Object</replaceable> > <replaceable>Array</replaceable> > <replaceable>Boolean</replaceable> > <replaceable>Number</replaceable> > <replaceable>String</replaceable> > <literal>Null</literal>
+
+      <literal>true</literal> > <literal>false</literal>
+
+      <replaceable>Object with n keys</replaceable> > <replaceable>object with n - 1 keys</replaceable>
+
+      <replaceable>Array with n elements</replaceable> > <replaceable>array with n - 1 elements</replaceable>
+
+      Objects with equal numbers of keys compared key-1 value-1 key-2
+      value-2 ...
+
+      Arrays with equal numbers of elements compared element-1,
+      element-2 ...
+
+      Strings compared lexically (always with default database collation).
+    </synopsis>
+  </para>
+ </sect2>
+</sect1>
author	Peter Geoghegan <[email protected]>
	Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)
committer	Peter Geoghegan <[email protected]>
	Sun, 9 Mar 2014 10:19:46 +0000 (03:19 -0700)
doc/src/sgml/datatype.sgml		patch \| blob \| blame \| history
doc/src/sgml/filelist.sgml		patch \| blob \| blame \| history
doc/src/sgml/func.sgml		patch \| blob \| blame \| history
doc/src/sgml/json.sgml	[new file with mode: 0644]	patch \| blob