mirror of
https://github.com/postgres/postgres.git
synced 2025-12-22 17:42:17 +03:00
Implement genuine serializable isolation level.
Until now, our Serializable mode has in fact been what's called Snapshot Isolation, which allows some anomalies that could not occur in any serialized ordering of the transactions. This patch fixes that using a method called Serializable Snapshot Isolation, based on research papers by Michael J. Cahill (see README-SSI for full references). In Serializable Snapshot Isolation, transactions run like they do in Snapshot Isolation, but a predicate lock manager observes the reads and writes performed and aborts transactions if it detects that an anomaly might occur. This method produces some false positives, ie. it sometimes aborts transactions even though there is no anomaly. To track reads we implement predicate locking, see storage/lmgr/predicate.c. Whenever a tuple is read, a predicate lock is acquired on the tuple. Shared memory is finite, so when a transaction takes many tuple-level locks on a page, the locks are promoted to a single page-level lock, and further to a single relation level lock if necessary. To lock key values with no matching tuple, a sequential scan always takes a relation-level lock, and an index scan acquires a page-level lock that covers the search key, whether or not there are any matching keys at the moment. A predicate lock doesn't conflict with any regular locks or with another predicate locks in the normal sense. They're only used by the predicate lock manager to detect the danger of anomalies. Only serializable transactions participate in predicate locking, so there should be no extra overhead for for other transactions. Predicate locks can't be released at commit, but must be remembered until all the transactions that overlapped with it have completed. That means that we need to remember an unbounded amount of predicate locks, so we apply a lossy but conservative method of tracking locks for committed transactions. If we run short of shared memory, we overflow to a new "pg_serial" SLRU pool. We don't currently allow Serializable transactions in Hot Standby mode. That would be hard, because even read-only transactions can cause anomalies that wouldn't otherwise occur. Serializable isolation mode now means the new fully serializable level. Repeatable Read gives you the old Snapshot Isolation level that we have always had. Kevin Grittner and Dan Ports, reviewed by Jeff Davis, Heikki Linnakangas and Anssi Kääriäinen
This commit is contained in:
@@ -27,6 +27,7 @@ BEGIN [ WORK | TRANSACTION ] [ <replaceable class="parameter">transaction_mode</
|
||||
|
||||
ISOLATION LEVEL { SERIALIZABLE | REPEATABLE READ | READ COMMITTED | READ UNCOMMITTED }
|
||||
READ WRITE | READ ONLY
|
||||
[ NOT ] DEFERRABLE
|
||||
</synopsis>
|
||||
</refsynopsisdiv>
|
||||
|
||||
@@ -57,7 +58,7 @@ BEGIN [ WORK | TRANSACTION ] [ <replaceable class="parameter">transaction_mode</
|
||||
</para>
|
||||
|
||||
<para>
|
||||
If the isolation level or read/write mode is specified, the new
|
||||
If the isolation level, read/write mode, or deferrable mode is specified, the new
|
||||
transaction has those characteristics, as if
|
||||
<xref linkend="sql-set-transaction">
|
||||
was executed.
|
||||
@@ -135,6 +136,12 @@ BEGIN;
|
||||
contains additional compatibility information.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
The <literal>DEFERRABLE</literal>
|
||||
<replaceable class="parameter">transaction_mode</replaceable>
|
||||
is a <productname>PostgreSQL</productname> language extension.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
Incidentally, the <literal>BEGIN</literal> key word is used for a
|
||||
different purpose in embedded SQL. You are advised to be careful
|
||||
|
||||
@@ -67,10 +67,12 @@ LOCK [ TABLE ] [ ONLY ] <replaceable class="PARAMETER">name</replaceable> [, ...
|
||||
</para>
|
||||
|
||||
<para>
|
||||
To achieve a similar effect when running a transaction at the Serializable
|
||||
To achieve a similar effect when running a transaction at the
|
||||
<literal>REPEATABLE READ</> or <literal>SERIALIZABLE</>
|
||||
isolation level, you have to execute the <command>LOCK TABLE</> statement
|
||||
before executing any <command>SELECT</> or data modification statement.
|
||||
A serializable transaction's view of data will be frozen when its first
|
||||
A <literal>REPEATABLE READ</> or <literal>SERIALIZABLE</> transaction's
|
||||
view of data will be frozen when its first
|
||||
<command>SELECT</> or data modification statement begins. A <command>LOCK
|
||||
TABLE</> later in the transaction will still prevent concurrent writes
|
||||
— but it won't ensure that what the transaction reads corresponds to
|
||||
|
||||
@@ -646,6 +646,41 @@ PostgreSQL documentation
|
||||
</listitem>
|
||||
</varlistentry>
|
||||
|
||||
<varlistentry>
|
||||
<term><option>--serializable-deferrable</option></term>
|
||||
<listitem>
|
||||
<para>
|
||||
Use a <literal>serializable</literal> transaction for the dump, to
|
||||
ensure that the snapshot used is consistent with later database
|
||||
states; but do this by waiting for a point in the transaction stream
|
||||
at which no anomalies can be present, so that there isn't a risk of
|
||||
the dump failing or causing other transactions to roll back with a
|
||||
<literal>serialization_failure</literal>. See <xref linkend="mvcc">
|
||||
for more information about transaction isolation and concurrency
|
||||
control.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
This option is not beneficial for a dump which is intended only for
|
||||
disaster recovery. It could be useful for a dump used to load a
|
||||
copy of the database for reporting or other read-only load sharing
|
||||
while the original database continues to be updated. Without it the
|
||||
dump may reflect a state which is not consistent with any serial
|
||||
execution of the transactions eventually committed. For example, if
|
||||
batch processing techniques are used, a batch may show as closed in
|
||||
the dump without all of the items which are in the batch appearing.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
This option will make no difference if there are no read-write
|
||||
transactions active when pg_dump is started. If read-write
|
||||
transactions are active, the start of the dump may be delayed for an
|
||||
indeterminate length of time. Once running, performance with or
|
||||
without the switch is the same.
|
||||
</para>
|
||||
</listitem>
|
||||
</varlistentry>
|
||||
|
||||
<varlistentry>
|
||||
<term><option>--no-tablespaces</option></term>
|
||||
<listitem>
|
||||
|
||||
@@ -1144,7 +1144,7 @@ FOR SHARE [ OF <replaceable class="parameter">table_name</replaceable> [, ...] ]
|
||||
has already locked a selected row or rows, <command>SELECT FOR
|
||||
UPDATE</command> will wait for the other transaction to complete,
|
||||
and will then lock and return the updated row (or no row, if the
|
||||
row was deleted). Within a <literal>SERIALIZABLE</> transaction,
|
||||
row was deleted). Within a <literal>REPEATABLE READ</> or <literal>SERIALIZABLE</> transaction,
|
||||
however, an error will be thrown if a row to be locked has changed
|
||||
since the transaction started. For further discussion see <xref
|
||||
linkend="mvcc">.
|
||||
|
||||
@@ -15,6 +15,21 @@
|
||||
<primary>SET TRANSACTION</primary>
|
||||
</indexterm>
|
||||
|
||||
<indexterm>
|
||||
<primary>transaction isolation level</primary>
|
||||
<secondary>setting</secondary>
|
||||
</indexterm>
|
||||
|
||||
<indexterm>
|
||||
<primary>read-only transaction</primary>
|
||||
<secondary>setting</secondary>
|
||||
</indexterm>
|
||||
|
||||
<indexterm>
|
||||
<primary>deferrable transaction</primary>
|
||||
<secondary>setting</secondary>
|
||||
</indexterm>
|
||||
|
||||
<refsynopsisdiv>
|
||||
<synopsis>
|
||||
SET TRANSACTION <replaceable class="parameter">transaction_mode</replaceable> [, ...]
|
||||
@@ -24,6 +39,7 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
|
||||
ISOLATION LEVEL { SERIALIZABLE | REPEATABLE READ | READ COMMITTED | READ UNCOMMITTED }
|
||||
READ WRITE | READ ONLY
|
||||
[ NOT ] DEFERRABLE
|
||||
</synopsis>
|
||||
</refsynopsisdiv>
|
||||
|
||||
@@ -42,8 +58,8 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
|
||||
<para>
|
||||
The available transaction characteristics are the transaction
|
||||
isolation level and the transaction access mode (read/write or
|
||||
read-only).
|
||||
isolation level, the transaction access mode (read/write or
|
||||
read-only), and the deferrable mode.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
@@ -62,7 +78,7 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
</varlistentry>
|
||||
|
||||
<varlistentry>
|
||||
<term><literal>SERIALIZABLE</literal></term>
|
||||
<term><literal>REPEATABLE READ</literal></term>
|
||||
<listitem>
|
||||
<para>
|
||||
All statements of the current transaction can only see rows committed
|
||||
@@ -71,14 +87,27 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
</para>
|
||||
</listitem>
|
||||
</varlistentry>
|
||||
|
||||
<varlistentry>
|
||||
<term><literal>SERIALIZABLE</literal></term>
|
||||
<listitem>
|
||||
<para>
|
||||
All statements of the current transaction can only see rows committed
|
||||
before the first query or data-modification statement was executed in
|
||||
this transaction. If a pattern of reads and writes among concurrent
|
||||
serializable transactions would create a situation which could not
|
||||
have occurred for any serial (one-at-a-time) execution of those
|
||||
transactions, one of them will be rolled back with a
|
||||
<literal>serialization_failure</literal> <literal>SQLSTATE</literal>.
|
||||
</para>
|
||||
</listitem>
|
||||
</varlistentry>
|
||||
</variablelist>
|
||||
|
||||
The SQL standard defines two additional levels, <literal>READ
|
||||
UNCOMMITTED</literal> and <literal>REPEATABLE READ</literal>.
|
||||
The SQL standard defines one additional level, <literal>READ
|
||||
UNCOMMITTED</literal>.
|
||||
In <productname>PostgreSQL</productname> <literal>READ
|
||||
UNCOMMITTED</literal> is treated as
|
||||
<literal>READ COMMITTED</literal>, while <literal>REPEATABLE
|
||||
READ</literal> is treated as <literal>SERIALIZABLE</literal>.
|
||||
UNCOMMITTED</literal> is treated as <literal>READ COMMITTED</literal>.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
@@ -127,8 +156,9 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
|
||||
<para>
|
||||
The session default transaction modes can also be set by setting the
|
||||
configuration parameters <xref linkend="guc-default-transaction-isolation">
|
||||
and <xref linkend="guc-default-transaction-read-only">.
|
||||
configuration parameters <xref linkend="guc-default-transaction-isolation">,
|
||||
<xref linkend="guc-default-transaction-read-only">, and
|
||||
<xref linkend="guc-default-transaction-deferrable">.
|
||||
(In fact <command>SET SESSION CHARACTERISTICS</command> is just a
|
||||
verbose equivalent for setting these variables with <command>SET</>.)
|
||||
This means the defaults can be set in the configuration file, via
|
||||
@@ -146,9 +176,7 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
isolation level in the standard. In
|
||||
<productname>PostgreSQL</productname> the default is ordinarily
|
||||
<literal>READ COMMITTED</literal>, but you can change it as
|
||||
mentioned above. Because of lack of predicate locking, the
|
||||
<literal>SERIALIZABLE</literal> level is not truly
|
||||
serializable. See <xref linkend="mvcc"> for details.
|
||||
mentioned above.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
@@ -158,6 +186,12 @@ SET SESSION CHARACTERISTICS AS TRANSACTION <replaceable class="parameter">transa
|
||||
not implemented in the <productname>PostgreSQL</productname> server.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
The <literal>DEFERRABLE</literal>
|
||||
<replaceable class="parameter">transaction_mode</replaceable>
|
||||
is a <productname>PostgreSQL</productname> language extension.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
The SQL standard requires commas between successive <replaceable
|
||||
class="parameter">transaction_modes</replaceable>, but for historical
|
||||
|
||||
@@ -27,6 +27,7 @@ START TRANSACTION [ <replaceable class="parameter">transaction_mode</replaceable
|
||||
|
||||
ISOLATION LEVEL { SERIALIZABLE | REPEATABLE READ | READ COMMITTED | READ UNCOMMITTED }
|
||||
READ WRITE | READ ONLY
|
||||
[ NOT ] DEFERRABLE
|
||||
</synopsis>
|
||||
</refsynopsisdiv>
|
||||
|
||||
@@ -34,8 +35,8 @@ START TRANSACTION [ <replaceable class="parameter">transaction_mode</replaceable
|
||||
<title>Description</title>
|
||||
|
||||
<para>
|
||||
This command begins a new transaction block. If the isolation level or
|
||||
read/write mode is specified, the new transaction has those
|
||||
This command begins a new transaction block. If the isolation level,
|
||||
read/write mode, or deferrable mode is specified, the new transaction has those
|
||||
characteristics, as if <xref linkend="sql-set-transaction"> was executed. This is the same
|
||||
as the <xref linkend="sql-begin"> command.
|
||||
</para>
|
||||
@@ -64,6 +65,12 @@ START TRANSACTION [ <replaceable class="parameter">transaction_mode</replaceable
|
||||
as a convenience.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
The <literal>DEFERRABLE</literal>
|
||||
<replaceable class="parameter">transaction_mode</replaceable>
|
||||
is a <productname>PostgreSQL</productname> language extension.
|
||||
</para>
|
||||
|
||||
<para>
|
||||
The SQL standard requires commas between successive <replaceable
|
||||
class="parameter">transaction_modes</replaceable>, but for historical
|
||||
|
||||
Reference in New Issue
Block a user