1
0
mirror of https://github.com/postgres/postgres.git synced 2025-07-28 23:42:10 +03:00

Have text search thesaurus files use "?" for stop words.

Throw an error for actual stop words, rather than a warning.  This fixes
problems with cache reloading causing warning messages.

Re-enable stop words in regression tests;  was disabled by Tom.

Document "?" as API change.
This commit is contained in:
Bruce Momjian
2007-11-10 15:39:34 +00:00
parent 82748bc253
commit d009992ba3
4 changed files with 56 additions and 51 deletions

View File

@ -1,4 +1,4 @@
<!-- $PostgreSQL: pgsql/doc/src/sgml/textsearch.sgml,v 1.30 2007/11/05 15:55:53 mha Exp $ -->
<!-- $PostgreSQL: pgsql/doc/src/sgml/textsearch.sgml,v 1.31 2007/11/10 15:39:34 momjian Exp $ -->
<chapter id="textsearch">
<title id="textsearch-title">Full Text Search</title>
@ -2258,20 +2258,17 @@ more sample word(s) : more indexed word(s)
</para>
<para>
Stop words recognized by the subdictionary are replaced by a <quote>stop
word placeholder</quote> to record their position. To illustrate this,
consider these phrases:
Specific stop words recognized by the subdictionary cannot be
specified; instead use <literal>?</> to mark the location where any
stop word can appear. For example, assuming that <literal>a</> and
<literal>the</> are stop words according to the subdictionary:
<programlisting>
a one the two : swsw
the one a two : swsw2
? one ? two : swsw
</programlisting>
Assuming that <literal>a</> and <literal>the</> are stop words according
to the subdictionary, these two phrases are identical to the thesaurus:
they both look like <replaceable>stopword</> <literal>one</>
<replaceable>stopword</> <literal>two</>. Input matching this pattern
will be replaced by <literal>swsw2</>, according to the tie-breaking rule.
matches <literal>a one the two</> and <literal>the one a two</>;
both would be replaced by <literal>swsw</>.
</para>
<para>
@ -3576,6 +3573,12 @@ Parser: "pg_catalog.default"
</para>
</listitem>
<listitem>
<para>
Thesaurus files now use <literal>?</> for stop words.
</para>
</listitem>
<listitem>
<para>
What else?