1
0
mirror of https://github.com/postgres/postgres.git synced 2025-10-25 13:17:41 +03:00

Clarify type resolution behavior for domain types.

The user documentation was vague and not entirely accurate about how
we treat domain inputs for ambiguous operators/functions.  Clarify
that, and add an example and some commentary.  Per a recent question
from Adam Mackler.

It's acted like this ever since we added domains, so back-patch
to all supported branches.
This commit is contained in:
Tom Lane
2014-08-10 16:13:25 -04:00
parent bbe826f21b
commit 83412c7b9f

View File

@@ -185,7 +185,9 @@ That is, if a query is well-formed and the types already match, then the query s
without spending extra time in the parser and without introducing unnecessary implicit conversion without spending extra time in the parser and without introducing unnecessary implicit conversion
calls in the query. calls in the query.
</para> </para>
</listitem>
<listitem>
<para> <para>
Additionally, if a query usually requires an implicit conversion for a function, and Additionally, if a query usually requires an implicit conversion for a function, and
if then the user defines a new function with the correct argument types, the parser if then the user defines a new function with the correct argument types, the parser
@@ -209,7 +211,7 @@ should use this new function and no longer do implicit conversion to use the old
The specific operator that is referenced by an operator expression The specific operator that is referenced by an operator expression
is determined using the following procedure. is determined using the following procedure.
Note that this procedure is indirectly affected Note that this procedure is indirectly affected
by the precedence of the involved operators, since that will determine by the precedence of the operators involved, since that will determine
which sub-expressions are taken to be the inputs of which operators. which sub-expressions are taken to be the inputs of which operators.
See <xref linkend="sql-precedence"> for more information. See <xref linkend="sql-precedence"> for more information.
</para> </para>
@@ -217,7 +219,7 @@ should use this new function and no longer do implicit conversion to use the old
<procedure> <procedure>
<title>Operator Type Resolution</title> <title>Operator Type Resolution</title>
<step performance="required"> <step id="op-resol-select" performance="required">
<para> <para>
Select the operators to be considered from the Select the operators to be considered from the
<classname>pg_operator</classname> system catalog. If a non-schema-qualified <classname>pg_operator</classname> system catalog. If a non-schema-qualified
@@ -240,7 +242,7 @@ search path position.
</substeps> </substeps>
</step> </step>
<step performance="required"> <step id="op-resol-exact-match" performance="required">
<para> <para>
Check for an operator accepting exactly the input argument types. Check for an operator accepting exactly the input argument types.
If one exists (there can be only one exact match in the set of If one exists (there can be only one exact match in the set of
@@ -248,7 +250,7 @@ operators considered), use it.
</para> </para>
<substeps> <substeps>
<step performance="optional"> <step id="op-resol-exact-unknown" performance="optional">
<para> <para>
If one argument of a binary operator invocation is of the <type>unknown</type> type, If one argument of a binary operator invocation is of the <type>unknown</type> type,
then assume it is the same type as the other argument for this check. then assume it is the same type as the other argument for this check.
@@ -256,10 +258,17 @@ Invocations involving two <type>unknown</type> inputs, or a unary operator
with an <type>unknown</type> input, will never find a match at this step. with an <type>unknown</type> input, will never find a match at this step.
</para> </para>
</step> </step>
<step id="op-resol-exact-domain" performance="optional">
<para>
If one argument of a binary operator invocation is of the <type>unknown</type>
type and the other is of a domain type, next check to see if there is an
operator accepting exactly the domain's base type on both sides; if so, use it.
</para>
</step>
</substeps> </substeps>
</step> </step>
<step performance="required"> <step id="op-resol-best-match" performance="required">
<para> <para>
Look for the best match. Look for the best match.
</para> </para>
@@ -275,9 +284,15 @@ candidate remains, use it; else continue to the next step.
</step> </step>
<step performance="required"> <step performance="required">
<para> <para>
If any input argument is of a domain type, treat it as being of the
domain's base type for all subsequent steps. This ensures that domains
act like their base types for purposes of ambiguous-operator resolution.
</para>
</step>
<step performance="required">
<para>
Run through all candidates and keep those with the most exact matches Run through all candidates and keep those with the most exact matches
on input types. (Domains are considered the same as their base type on input types. Keep all candidates if none have exact matches.
for this purpose.) Keep all candidates if none have exact matches.
If only one candidate remains, use it; else continue to the next step. If only one candidate remains, use it; else continue to the next step.
</para> </para>
</step> </step>
@@ -376,7 +391,7 @@ be interpreted as type <type>text</type>.
</para> </para>
<para> <para>
Here is a concatenation on unspecified types: Here is a concatenation of two values of unspecified types:
<screen> <screen>
SELECT 'abc' || 'def' AS "unspecified"; SELECT 'abc' || 'def' AS "unspecified";
@@ -394,7 +409,7 @@ and finds that there are candidates accepting both string-category and
bit-string-category inputs. Since string category is preferred when available, bit-string-category inputs. Since string category is preferred when available,
that category is selected, and then the that category is selected, and then the
preferred type for strings, <type>text</type>, is used as the specific preferred type for strings, <type>text</type>, is used as the specific
type to resolve the unknown literals as. type to resolve the unknown-type literals as.
</para> </para>
</example> </example>
@@ -450,6 +465,45 @@ SELECT ~ CAST('20' AS int8) AS "negation";
</para> </para>
</example> </example>
<example>
<title>Custom Operator on a Domain Type</title>
<para>
Users sometimes try to declare operators applying just to a domain type.
This is possible but is not nearly as useful as it might seem, because the
operator resolution rules are designed to select operators applying to the
domain's base type. As an example consider
<screen>
CREATE DOMAIN mytext AS text CHECK(...);
CREATE FUNCTION mytext_eq_text (mytext, text) RETURNS boolean AS ...;
CREATE OPERATOR = (procedure=mytext_eq_text, leftarg=mytext, rightarg=text);
CREATE TABLE mytable (val mytext);
SELECT * FROM mytable WHERE val = 'foo';
</screen>
This query will not use the custom operator. The parser will first see if
there is a <type>mytext</> <literal>=</> <type>mytext</> operator
(<xref linkend="op-resol-exact-unknown">), which there is not;
then it will consider the domain's base type <type>text</>, and see if
there is a <type>text</> <literal>=</> <type>text</> operator
(<xref linkend="op-resol-exact-domain">), which there is;
so it resolves the <type>unknown</>-type literal as <type>text</> and
uses the <type>text</> <literal>=</> <type>text</> operator.
The only way to get the custom operator to be used is to explicitly cast
the literal:
<screen>
SELECT * FROM mytable WHERE val = text 'foo';
</screen>
so that the <type>mytext</> <literal>=</> <type>text</> operator is found
immediately according to the exact-match rule. If the best-match rules
are reached, they actively discriminate against operators on domain types.
If they did not, such an operator would create too many ambiguous-operator
failures, because the casting rules always consider a domain as castable
to or from its base type, and so the domain operator would be considered
usable in all the same cases as a similarly-named operator on the base type.
</para>
</example>
</sect1> </sect1>
<sect1 id="typeconv-func"> <sect1 id="typeconv-func">
@@ -565,9 +619,15 @@ candidate remains, use it; else continue to the next step.
</step> </step>
<step performance="required"> <step performance="required">
<para> <para>
If any input argument is of a domain type, treat it as being of the
domain's base type for all subsequent steps. This ensures that domains
act like their base types for purposes of ambiguous-function resolution.
</para>
</step>
<step performance="required">
<para>
Run through all candidates and keep those with the most exact matches Run through all candidates and keep those with the most exact matches
on input types. (Domains are considered the same as their base type on input types. Keep all candidates if none have exact matches.
for this purpose.) Keep all candidates if none have exact matches.
If only one candidate remains, use it; else continue to the next step. If only one candidate remains, use it; else continue to the next step.
</para> </para>
</step> </step>
@@ -858,8 +918,23 @@ and Related Constructs</title>
<step performance="required"> <step performance="required">
<para> <para>
If all inputs are of the same type, and it is not <type>unknown</type>, If all inputs are of the same type, and it is not <type>unknown</type>,
resolve as that type. Otherwise, replace any domain types in the list with resolve as that type.
their underlying base types. </para>
</step>
<step performance="required">
<para>
If any input is of a domain type, treat it as being of the
domain's base type for all subsequent steps.
<footnote>
<para>
Somewhat like the treatment of domain inputs for operators and
functions, this behavior allows a domain type to be preserved through
a <literal>UNION</> or similar construct, so long as the user is
careful to ensure that all inputs are implicitly or explicitly of that
exact type. Otherwise the domain's base type will be preferred.
</para>
</footnote>
</para> </para>
</step> </step>