postgres

mirror of https://github.com/postgres/postgres.git synced 2025-08-31 17:02:12 +03:00

Files

Tom Lane 0d32342501 Teach the regular expression functions to do case-insensitive matching and

locale-dependent character classification properly when the database encoding
is UTF8.

The previous coding worked okay in single-byte encodings, or in any case for
ASCII characters, but failed entirely on multibyte characters.  The fix
assumes that the <wctype.h> functions use Unicode code points as the wchar
representation for Unicode, ie, wchar matches pg_wchar.

This is only a partial solution, since we're still stupid about non-ASCII
characters in multibyte encodings other than UTF8.  The practical effect
of that is limited, however, since those cases are generally Far Eastern
glyphs for which concepts like case-folding don't apply anyway.  Certainly
all or nearly all of the field reports of problems have been about UTF8.
A more general solution would require switching to the platform's wchar
representation for all regex operations; which is possible but would have
substantial disadvantages.  Let's try this and see if it's sufficient in
practice.

2009-12-01 21:00:24 +00:00

Replace regular expression package with Henry Spencer's latest version

2003-02-05 17:41:33 +00:00

Makefile

Refactor backend makefiles to remove lots of duplicate code

2008-02-19 10:30:09 +00:00

re_syntax.n

Replace regular expression package with Henry Spencer's latest version

2003-02-05 17:41:33 +00:00

regc_color.c

Sync our regex code with upstream changes since last time we did this, which

2008-02-14 17:33:37 +00:00