1
0
mirror of https://github.com/MariaDB/server.git synced 2025-07-27 18:02:13 +03:00

MDEV-33734 Improve the sequence increment inequality testing

We add an extra condition that makes the inequality testing in
SEQUENCE::increment_value() mathematically watertight, and we cast to
and from unsigned in potential underflow and overflow addition and
subtractions to avoid undefined behaviour.

Let's start by distinguishing between c++ expressions and mathematical
expressions. by c++ expression I mean an expression with the outcome
determined by the compiler/runtime. by mathematical expression I mean
an expression whose value is mathematically determined. So a c++
expression -9223372036854775806 - 1000 at worst can evaluate to any
value due to underflow. A mathematical expression -9223372036854775806
- 1000 evaluates to -9223372036854776806.

The problem boils down to how to write a c++ expression equivalent to
an mathematical expression x + y < z where x and z can take any values
of long long int, and y < 0 is also a long long int. Ideally we want
to avoid underflow, but I'm not sure how this can be done.

The correct c++ form should be (x + y < z || x < z - y || x < z).
Let M=9223372036854775808 i.e. LONGLONG_MAX + 1. We have

-M < x < M - 1
-M < y < 0
-M < z < M - 1

Let's consider the case where x + y < z is true as a mathematical
expression.

If the first disjunct underflows, i.e. the mathematical expression x
+ y < -M. If the arbitrary value resulting from the underflow causes
the c++ expression to hold too, then we are done. Otherwise we move
onto the next expression x < z - y. If there's no overflow in z
- y then we are done. If there's overflow i.e. z - y > M - 1,
and the c++ expression evals to false, then we are onto x < z.
There's no over or underflow here, and it will eval to true. To see
this, note that

x + y < -M means x < -M - y < -M - (-M) = 0
z - y > M - 1 means z > y + M - 1 > - M + M - 1 = -1
so x < z.

Now let's consider the case where x + y < z is false as a mathematical
expression.

The first disjunct will not underflow in this case, so we move to (x <
z - y). This will not overflow. To see this, note that

x + y >= z means z - y <= x < M - 1

So it evals to false too. And the third disjunct x < z also evals to
false because x >= z - y > z.

I suspect that in either case the expression x < z does not determine
the final value of the disjunction in the vast majority cases, which
is why we leave it as the final one in case of the rare cases of both
an underflow and an overflow happening.

Here's an example of both underflow and overflow happening and the
added inequality x < z saves the day:

x = - M / 2
y = - M / 2 - 1
z = M / 2

x + y evals to M - 1 which is > z
z - y evals to - M + 1 which is < x

We can do the same to test x + y > z where the increment y is positive:

(x > z - y || x + y > z || x > z)

And the same analysis applies to unsigned cases.
This commit is contained in:
Yuchen Pei
2024-03-26 14:37:17 +11:00
parent 9b02b7c77a
commit 7bec41d25d
6 changed files with 51 additions and 29 deletions

View File

@ -743,8 +743,11 @@ void sequence_definition::adjust_values(longlong next_value)
global_system_variables.auto_increment_increment);
/*
Ensure that next_free_value has the right offset, so that we
can generate a serie by just adding real_increment.
Ensure that next_free_value has the right offset, so that we can
generate a serie by just adding real_increment. The goal is to
adjust next_free_value upwards such that
next_free_value % real_increment == offset
*/
off= next_free_value % real_increment;
if (off < 0)
@ -760,15 +763,18 @@ void sequence_definition::adjust_values(longlong next_value)
need to cast to_add.
*/
if ((is_unsigned &&
(ulonglong) next_free_value > (ulonglong) max_value - to_add) ||
(is_unsigned &&
(ulonglong) next_free_value + to_add > (ulonglong) max_value) ||
(!is_unsigned && next_free_value > max_value - to_add) ||
(!is_unsigned && next_free_value + to_add > max_value))
((ulonglong) next_free_value > (ulonglong) max_value - to_add ||
(ulonglong) next_free_value + to_add > (ulonglong) max_value ||
(ulonglong) next_free_value > (ulonglong) max_value)) ||
(!is_unsigned &&
(next_free_value > (longlong) ((ulonglong) max_value - to_add) ||
(longlong) ((ulonglong) next_free_value + to_add) > max_value ||
next_free_value > max_value)))
next_free_value= max_value+1;
else
{
next_free_value+= to_add;
next_free_value=
(longlong) ((ulonglong) next_free_value + (ulonglong) to_add);
if (is_unsigned)
DBUG_ASSERT((ulonglong) next_free_value % real_increment ==
(ulonglong) offset);
@ -898,7 +904,7 @@ longlong SEQUENCE::next_value(TABLE *table, bool second_round, int *error)
next_free_value= increment_value(next_free_value, real_increment);
if (within_bound(res_value, reserved_until, reserved_until,
real_increment > 0))
real_increment > 0))
{
write_unlock(table);
DBUG_RETURN(res_value);