mirror of
https://github.com/postgres/postgres.git
synced 2025-09-02 04:21:28 +03:00
Teach planner and executor about monotonic window funcs
Window functions such as row_number() always return a value higher than the previously returned value for tuples in any given window partition. Traditionally queries such as; SELECT * FROM ( SELECT *, row_number() over (order by c) rn FROM t ) t WHERE rn <= 10; were executed fairly inefficiently. Neither the query planner nor the executor knew that once rn made it to 11 that nothing further would match the outer query's WHERE clause. It would blindly continue until all tuples were exhausted from the subquery. Here we implement means to make the above execute more efficiently. This is done by way of adding a pg_proc.prosupport function to various of the built-in window functions and adding supporting code to allow the support function to inform the planner if the window function is monotonically increasing, monotonically decreasing, both or neither. The planner is then able to make use of that information and possibly allow the executor to short-circuit execution by way of adding a "run condition" to the WindowAgg to allow it to determine if some of its execution work can be skipped. This "run condition" is not like a normal filter. These run conditions are only built using quals comparing values to monotonic window functions. For monotonic increasing functions, quals making use of the btree operators for <, <= and = can be used (assuming the window function column is on the left). You can see here that once such a condition becomes false that a monotonic increasing function could never make it subsequently true again. For monotonically decreasing functions the >, >= and = btree operators for the given type can be used for run conditions. The best-case situation for this is when there is a single WindowAgg node without a PARTITION BY clause. Here when the run condition becomes false the WindowAgg node can simply return NULL. No more tuples will ever match the run condition. It's a little more complex when there is a PARTITION BY clause. In this case, we cannot return NULL as we must still process other partitions. To speed this case up we pull tuples from the outer plan to check if they're from the same partition and simply discard them if they are. When we find a tuple belonging to another partition we start processing as normal again until the run condition becomes false or we run out of tuples to process. When there are multiple WindowAgg nodes to evaluate then this complicates the situation. For intermediate WindowAggs we must ensure we always return all tuples to the calling node. Any filtering done could lead to incorrect results in WindowAgg nodes above. For all intermediate nodes, we can still save some work when the run condition becomes false. We've no need to evaluate the WindowFuncs anymore. Other WindowAgg nodes cannot reference the value of these and these tuples will not appear in the final result anyway. The savings here are small in comparison to what can be saved in the top-level WingowAgg, but still worthwhile. Intermediate WindowAgg nodes never filter out tuples, but here we change WindowAgg so that the top-level WindowAgg filters out tuples that don't match the intermediate WindowAgg node's run condition. Such filters appear in the "Filter" clause in EXPLAIN for the top-level WindowAgg node. Here we add prosupport functions to allow the above to work for; row_number(), rank(), dense_rank(), count(*) and count(expr). It appears technically possible to do the same for min() and max(), however, it seems unlikely to be useful enough, so that's not done here. Bump catversion Author: David Rowley Reviewed-by: Andy Fan, Zhihong Yu Discussion: https://postgr.es/m/CAApHDvqvp3At8++yF8ij06sdcoo1S_b2YoaT9D4Nf+MObzsrLQ@mail.gmail.com
This commit is contained in:
@@ -1988,6 +1988,14 @@ ExplainNode(PlanState *planstate, List *ancestors,
|
||||
show_instrumentation_count("Rows Removed by Filter", 1,
|
||||
planstate, es);
|
||||
break;
|
||||
case T_WindowAgg:
|
||||
show_upper_qual(plan->qual, "Filter", planstate, ancestors, es);
|
||||
if (plan->qual)
|
||||
show_instrumentation_count("Rows Removed by Filter", 1,
|
||||
planstate, es);
|
||||
show_upper_qual(((WindowAgg *) plan)->runConditionOrig,
|
||||
"Run Condition", planstate, ancestors, es);
|
||||
break;
|
||||
case T_Group:
|
||||
show_group_keys(castNode(GroupState, planstate), ancestors, es);
|
||||
show_upper_qual(plan->qual, "Filter", planstate, ancestors, es);
|
||||
|
@@ -1248,6 +1248,20 @@ spool_tuples(WindowAggState *winstate, int64 pos)
|
||||
if (winstate->partition_spooled)
|
||||
return; /* whole partition done already */
|
||||
|
||||
/*
|
||||
* When in pass-through mode we can just exhaust all tuples in the current
|
||||
* partition. We don't need these tuples for any further window function
|
||||
* evaluation, however, we do need to keep them around if we're not the
|
||||
* top-level window as another WindowAgg node above must see these.
|
||||
*/
|
||||
if (winstate->status != WINDOWAGG_RUN)
|
||||
{
|
||||
Assert(winstate->status == WINDOWAGG_PASSTHROUGH ||
|
||||
winstate->status == WINDOWAGG_PASSTHROUGH_STRICT);
|
||||
|
||||
pos = -1;
|
||||
}
|
||||
|
||||
/*
|
||||
* If the tuplestore has spilled to disk, alternate reading and writing
|
||||
* becomes quite expensive due to frequent buffer flushes. It's cheaper
|
||||
@@ -1256,7 +1270,7 @@ spool_tuples(WindowAggState *winstate, int64 pos)
|
||||
* XXX this is a horrid kluge --- it'd be better to fix the performance
|
||||
* problem inside tuplestore. FIXME
|
||||
*/
|
||||
if (!tuplestore_in_memory(winstate->buffer))
|
||||
else if (!tuplestore_in_memory(winstate->buffer))
|
||||
pos = -1;
|
||||
|
||||
outerPlan = outerPlanState(winstate);
|
||||
@@ -1295,9 +1309,16 @@ spool_tuples(WindowAggState *winstate, int64 pos)
|
||||
}
|
||||
}
|
||||
|
||||
/* Still in partition, so save it into the tuplestore */
|
||||
tuplestore_puttupleslot(winstate->buffer, outerslot);
|
||||
winstate->spooled_rows++;
|
||||
/*
|
||||
* Remember the tuple unless we're the top-level window and we're in
|
||||
* pass-through mode.
|
||||
*/
|
||||
if (winstate->status != WINDOWAGG_PASSTHROUGH_STRICT)
|
||||
{
|
||||
/* Still in partition, so save it into the tuplestore */
|
||||
tuplestore_puttupleslot(winstate->buffer, outerslot);
|
||||
winstate->spooled_rows++;
|
||||
}
|
||||
}
|
||||
|
||||
MemoryContextSwitchTo(oldcontext);
|
||||
@@ -2023,13 +2044,14 @@ static TupleTableSlot *
|
||||
ExecWindowAgg(PlanState *pstate)
|
||||
{
|
||||
WindowAggState *winstate = castNode(WindowAggState, pstate);
|
||||
TupleTableSlot *slot;
|
||||
ExprContext *econtext;
|
||||
int i;
|
||||
int numfuncs;
|
||||
|
||||
CHECK_FOR_INTERRUPTS();
|
||||
|
||||
if (winstate->all_done)
|
||||
if (winstate->status == WINDOWAGG_DONE)
|
||||
return NULL;
|
||||
|
||||
/*
|
||||
@@ -2099,143 +2121,224 @@ ExecWindowAgg(PlanState *pstate)
|
||||
winstate->all_first = false;
|
||||
}
|
||||
|
||||
if (winstate->buffer == NULL)
|
||||
/* We need to loop as the runCondition or qual may filter out tuples */
|
||||
for (;;)
|
||||
{
|
||||
/* Initialize for first partition and set current row = 0 */
|
||||
begin_partition(winstate);
|
||||
/* If there are no input rows, we'll detect that and exit below */
|
||||
}
|
||||
else
|
||||
{
|
||||
/* Advance current row within partition */
|
||||
winstate->currentpos++;
|
||||
/* This might mean that the frame moves, too */
|
||||
winstate->framehead_valid = false;
|
||||
winstate->frametail_valid = false;
|
||||
/* we don't need to invalidate grouptail here; see below */
|
||||
}
|
||||
|
||||
/*
|
||||
* Spool all tuples up to and including the current row, if we haven't
|
||||
* already
|
||||
*/
|
||||
spool_tuples(winstate, winstate->currentpos);
|
||||
|
||||
/* Move to the next partition if we reached the end of this partition */
|
||||
if (winstate->partition_spooled &&
|
||||
winstate->currentpos >= winstate->spooled_rows)
|
||||
{
|
||||
release_partition(winstate);
|
||||
|
||||
if (winstate->more_partitions)
|
||||
if (winstate->buffer == NULL)
|
||||
{
|
||||
/* Initialize for first partition and set current row = 0 */
|
||||
begin_partition(winstate);
|
||||
Assert(winstate->spooled_rows > 0);
|
||||
/* If there are no input rows, we'll detect that and exit below */
|
||||
}
|
||||
else
|
||||
{
|
||||
winstate->all_done = true;
|
||||
return NULL;
|
||||
/* Advance current row within partition */
|
||||
winstate->currentpos++;
|
||||
/* This might mean that the frame moves, too */
|
||||
winstate->framehead_valid = false;
|
||||
winstate->frametail_valid = false;
|
||||
/* we don't need to invalidate grouptail here; see below */
|
||||
}
|
||||
}
|
||||
|
||||
/* final output execution is in ps_ExprContext */
|
||||
econtext = winstate->ss.ps.ps_ExprContext;
|
||||
/*
|
||||
* Spool all tuples up to and including the current row, if we haven't
|
||||
* already
|
||||
*/
|
||||
spool_tuples(winstate, winstate->currentpos);
|
||||
|
||||
/* Clear the per-output-tuple context for current row */
|
||||
ResetExprContext(econtext);
|
||||
|
||||
/*
|
||||
* Read the current row from the tuplestore, and save in ScanTupleSlot.
|
||||
* (We can't rely on the outerplan's output slot because we may have to
|
||||
* read beyond the current row. Also, we have to actually copy the row
|
||||
* out of the tuplestore, since window function evaluation might cause the
|
||||
* tuplestore to dump its state to disk.)
|
||||
*
|
||||
* In GROUPS mode, or when tracking a group-oriented exclusion clause, we
|
||||
* must also detect entering a new peer group and update associated state
|
||||
* when that happens. We use temp_slot_2 to temporarily hold the previous
|
||||
* row for this purpose.
|
||||
*
|
||||
* Current row must be in the tuplestore, since we spooled it above.
|
||||
*/
|
||||
tuplestore_select_read_pointer(winstate->buffer, winstate->current_ptr);
|
||||
if ((winstate->frameOptions & (FRAMEOPTION_GROUPS |
|
||||
FRAMEOPTION_EXCLUDE_GROUP |
|
||||
FRAMEOPTION_EXCLUDE_TIES)) &&
|
||||
winstate->currentpos > 0)
|
||||
{
|
||||
ExecCopySlot(winstate->temp_slot_2, winstate->ss.ss_ScanTupleSlot);
|
||||
if (!tuplestore_gettupleslot(winstate->buffer, true, true,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
elog(ERROR, "unexpected end of tuplestore");
|
||||
if (!are_peers(winstate, winstate->temp_slot_2,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
/* Move to the next partition if we reached the end of this partition */
|
||||
if (winstate->partition_spooled &&
|
||||
winstate->currentpos >= winstate->spooled_rows)
|
||||
{
|
||||
winstate->currentgroup++;
|
||||
winstate->groupheadpos = winstate->currentpos;
|
||||
winstate->grouptail_valid = false;
|
||||
release_partition(winstate);
|
||||
|
||||
if (winstate->more_partitions)
|
||||
{
|
||||
begin_partition(winstate);
|
||||
Assert(winstate->spooled_rows > 0);
|
||||
|
||||
/* Come out of pass-through mode when changing partition */
|
||||
winstate->status = WINDOWAGG_RUN;
|
||||
}
|
||||
else
|
||||
{
|
||||
/* No further partitions? We're done */
|
||||
winstate->status = WINDOWAGG_DONE;
|
||||
return NULL;
|
||||
}
|
||||
}
|
||||
ExecClearTuple(winstate->temp_slot_2);
|
||||
}
|
||||
else
|
||||
{
|
||||
if (!tuplestore_gettupleslot(winstate->buffer, true, true,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
elog(ERROR, "unexpected end of tuplestore");
|
||||
|
||||
/* final output execution is in ps_ExprContext */
|
||||
econtext = winstate->ss.ps.ps_ExprContext;
|
||||
|
||||
/* Clear the per-output-tuple context for current row */
|
||||
ResetExprContext(econtext);
|
||||
|
||||
/*
|
||||
* Read the current row from the tuplestore, and save in
|
||||
* ScanTupleSlot. (We can't rely on the outerplan's output slot
|
||||
* because we may have to read beyond the current row. Also, we have
|
||||
* to actually copy the row out of the tuplestore, since window
|
||||
* function evaluation might cause the tuplestore to dump its state to
|
||||
* disk.)
|
||||
*
|
||||
* In GROUPS mode, or when tracking a group-oriented exclusion clause,
|
||||
* we must also detect entering a new peer group and update associated
|
||||
* state when that happens. We use temp_slot_2 to temporarily hold
|
||||
* the previous row for this purpose.
|
||||
*
|
||||
* Current row must be in the tuplestore, since we spooled it above.
|
||||
*/
|
||||
tuplestore_select_read_pointer(winstate->buffer, winstate->current_ptr);
|
||||
if ((winstate->frameOptions & (FRAMEOPTION_GROUPS |
|
||||
FRAMEOPTION_EXCLUDE_GROUP |
|
||||
FRAMEOPTION_EXCLUDE_TIES)) &&
|
||||
winstate->currentpos > 0)
|
||||
{
|
||||
ExecCopySlot(winstate->temp_slot_2, winstate->ss.ss_ScanTupleSlot);
|
||||
if (!tuplestore_gettupleslot(winstate->buffer, true, true,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
elog(ERROR, "unexpected end of tuplestore");
|
||||
if (!are_peers(winstate, winstate->temp_slot_2,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
{
|
||||
winstate->currentgroup++;
|
||||
winstate->groupheadpos = winstate->currentpos;
|
||||
winstate->grouptail_valid = false;
|
||||
}
|
||||
ExecClearTuple(winstate->temp_slot_2);
|
||||
}
|
||||
else
|
||||
{
|
||||
if (!tuplestore_gettupleslot(winstate->buffer, true, true,
|
||||
winstate->ss.ss_ScanTupleSlot))
|
||||
elog(ERROR, "unexpected end of tuplestore");
|
||||
}
|
||||
|
||||
/* don't evaluate the window functions when we're in pass-through mode */
|
||||
if (winstate->status == WINDOWAGG_RUN)
|
||||
{
|
||||
/*
|
||||
* Evaluate true window functions
|
||||
*/
|
||||
numfuncs = winstate->numfuncs;
|
||||
for (i = 0; i < numfuncs; i++)
|
||||
{
|
||||
WindowStatePerFunc perfuncstate = &(winstate->perfunc[i]);
|
||||
|
||||
if (perfuncstate->plain_agg)
|
||||
continue;
|
||||
eval_windowfunction(winstate, perfuncstate,
|
||||
&(econtext->ecxt_aggvalues[perfuncstate->wfuncstate->wfuncno]),
|
||||
&(econtext->ecxt_aggnulls[perfuncstate->wfuncstate->wfuncno]));
|
||||
}
|
||||
|
||||
/*
|
||||
* Evaluate aggregates
|
||||
*/
|
||||
if (winstate->numaggs > 0)
|
||||
eval_windowaggregates(winstate);
|
||||
}
|
||||
|
||||
/*
|
||||
* If we have created auxiliary read pointers for the frame or group
|
||||
* boundaries, force them to be kept up-to-date, because we don't know
|
||||
* whether the window function(s) will do anything that requires that.
|
||||
* Failing to advance the pointers would result in being unable to
|
||||
* trim data from the tuplestore, which is bad. (If we could know in
|
||||
* advance whether the window functions will use frame boundary info,
|
||||
* we could skip creating these pointers in the first place ... but
|
||||
* unfortunately the window function API doesn't require that.)
|
||||
*/
|
||||
if (winstate->framehead_ptr >= 0)
|
||||
update_frameheadpos(winstate);
|
||||
if (winstate->frametail_ptr >= 0)
|
||||
update_frametailpos(winstate);
|
||||
if (winstate->grouptail_ptr >= 0)
|
||||
update_grouptailpos(winstate);
|
||||
|
||||
/*
|
||||
* Truncate any no-longer-needed rows from the tuplestore.
|
||||
*/
|
||||
tuplestore_trim(winstate->buffer);
|
||||
|
||||
/*
|
||||
* Form and return a projection tuple using the windowfunc results and
|
||||
* the current row. Setting ecxt_outertuple arranges that any Vars
|
||||
* will be evaluated with respect to that row.
|
||||
*/
|
||||
econtext->ecxt_outertuple = winstate->ss.ss_ScanTupleSlot;
|
||||
|
||||
slot = ExecProject(winstate->ss.ps.ps_ProjInfo);
|
||||
|
||||
if (winstate->status == WINDOWAGG_RUN)
|
||||
{
|
||||
econtext->ecxt_scantuple = slot;
|
||||
|
||||
/*
|
||||
* Now evaluate the run condition to see if we need to go into
|
||||
* pass-through mode, or maybe stop completely.
|
||||
*/
|
||||
if (!ExecQual(winstate->runcondition, econtext))
|
||||
{
|
||||
/*
|
||||
* Determine which mode to move into. If there is no
|
||||
* PARTITION BY clause and we're the top-level WindowAgg then
|
||||
* we're done. This tuple and any future tuples cannot
|
||||
* possibly match the runcondition. However, when there is a
|
||||
* PARTITION BY clause or we're not the top-level window we
|
||||
* can't just stop as we need to either process other
|
||||
* partitions or ensure WindowAgg nodes above us receive all
|
||||
* of the tuples they need to process their WindowFuncs.
|
||||
*/
|
||||
if (winstate->use_pass_through)
|
||||
{
|
||||
/*
|
||||
* STRICT pass-through mode is required for the top window
|
||||
* when there is a PARTITION BY clause. Otherwise we must
|
||||
* ensure we store tuples that don't match the
|
||||
* runcondition so they're available to WindowAggs above.
|
||||
*/
|
||||
if (winstate->top_window)
|
||||
{
|
||||
winstate->status = WINDOWAGG_PASSTHROUGH_STRICT;
|
||||
continue;
|
||||
}
|
||||
else
|
||||
winstate->status = WINDOWAGG_PASSTHROUGH;
|
||||
}
|
||||
else
|
||||
{
|
||||
/*
|
||||
* Pass-through not required. We can just return NULL.
|
||||
* Nothing else will match the runcondition.
|
||||
*/
|
||||
winstate->status = WINDOWAGG_DONE;
|
||||
return NULL;
|
||||
}
|
||||
}
|
||||
|
||||
/*
|
||||
* Filter out any tuples we don't need in the top-level WindowAgg.
|
||||
*/
|
||||
if (!ExecQual(winstate->ss.ps.qual, econtext))
|
||||
{
|
||||
InstrCountFiltered1(winstate, 1);
|
||||
continue;
|
||||
}
|
||||
|
||||
break;
|
||||
}
|
||||
|
||||
/*
|
||||
* When not in WINDOWAGG_RUN mode, we must still return this tuple if
|
||||
* we're anything apart from the top window.
|
||||
*/
|
||||
else if (!winstate->top_window)
|
||||
break;
|
||||
}
|
||||
|
||||
/*
|
||||
* Evaluate true window functions
|
||||
*/
|
||||
numfuncs = winstate->numfuncs;
|
||||
for (i = 0; i < numfuncs; i++)
|
||||
{
|
||||
WindowStatePerFunc perfuncstate = &(winstate->perfunc[i]);
|
||||
|
||||
if (perfuncstate->plain_agg)
|
||||
continue;
|
||||
eval_windowfunction(winstate, perfuncstate,
|
||||
&(econtext->ecxt_aggvalues[perfuncstate->wfuncstate->wfuncno]),
|
||||
&(econtext->ecxt_aggnulls[perfuncstate->wfuncstate->wfuncno]));
|
||||
}
|
||||
|
||||
/*
|
||||
* Evaluate aggregates
|
||||
*/
|
||||
if (winstate->numaggs > 0)
|
||||
eval_windowaggregates(winstate);
|
||||
|
||||
/*
|
||||
* If we have created auxiliary read pointers for the frame or group
|
||||
* boundaries, force them to be kept up-to-date, because we don't know
|
||||
* whether the window function(s) will do anything that requires that.
|
||||
* Failing to advance the pointers would result in being unable to trim
|
||||
* data from the tuplestore, which is bad. (If we could know in advance
|
||||
* whether the window functions will use frame boundary info, we could
|
||||
* skip creating these pointers in the first place ... but unfortunately
|
||||
* the window function API doesn't require that.)
|
||||
*/
|
||||
if (winstate->framehead_ptr >= 0)
|
||||
update_frameheadpos(winstate);
|
||||
if (winstate->frametail_ptr >= 0)
|
||||
update_frametailpos(winstate);
|
||||
if (winstate->grouptail_ptr >= 0)
|
||||
update_grouptailpos(winstate);
|
||||
|
||||
/*
|
||||
* Truncate any no-longer-needed rows from the tuplestore.
|
||||
*/
|
||||
tuplestore_trim(winstate->buffer);
|
||||
|
||||
/*
|
||||
* Form and return a projection tuple using the windowfunc results and the
|
||||
* current row. Setting ecxt_outertuple arranges that any Vars will be
|
||||
* evaluated with respect to that row.
|
||||
*/
|
||||
econtext->ecxt_outertuple = winstate->ss.ss_ScanTupleSlot;
|
||||
|
||||
return ExecProject(winstate->ss.ps.ps_ProjInfo);
|
||||
return slot;
|
||||
}
|
||||
|
||||
/* -----------------
|
||||
@@ -2300,12 +2403,32 @@ ExecInitWindowAgg(WindowAgg *node, EState *estate, int eflags)
|
||||
"WindowAgg Aggregates",
|
||||
ALLOCSET_DEFAULT_SIZES);
|
||||
|
||||
/* Only the top-level WindowAgg may have a qual */
|
||||
Assert(node->plan.qual == NIL || node->topWindow);
|
||||
|
||||
/* Initialize the qual */
|
||||
winstate->ss.ps.qual = ExecInitQual(node->plan.qual,
|
||||
(PlanState *) winstate);
|
||||
|
||||
/*
|
||||
* WindowAgg nodes never have quals, since they can only occur at the
|
||||
* logical top level of a query (ie, after any WHERE or HAVING filters)
|
||||
* Setup the run condition, if we received one from the query planner.
|
||||
* When set, this may allow us to move into pass-through mode so that we
|
||||
* don't have to perform any further evaluation of WindowFuncs in the
|
||||
* current partition or possibly stop returning tuples altogether when all
|
||||
* tuples are in the same partition.
|
||||
*/
|
||||
Assert(node->plan.qual == NIL);
|
||||
winstate->ss.ps.qual = NULL;
|
||||
winstate->runcondition = ExecInitQual(node->runCondition,
|
||||
(PlanState *) winstate);
|
||||
|
||||
/*
|
||||
* When we're not the top-level WindowAgg node or we are but have a
|
||||
* PARTITION BY clause we must move into one of the WINDOWAGG_PASSTHROUGH*
|
||||
* modes when the runCondition becomes false.
|
||||
*/
|
||||
winstate->use_pass_through = !node->topWindow || node->partNumCols > 0;
|
||||
|
||||
/* remember if we're the top-window or we are below the top-window */
|
||||
winstate->top_window = node->topWindow;
|
||||
|
||||
/*
|
||||
* initialize child nodes
|
||||
@@ -2500,6 +2623,9 @@ ExecInitWindowAgg(WindowAgg *node, EState *estate, int eflags)
|
||||
winstate->agg_winobj = agg_winobj;
|
||||
}
|
||||
|
||||
/* Set the status to running */
|
||||
winstate->status = WINDOWAGG_RUN;
|
||||
|
||||
/* copy frame options to state node for easy access */
|
||||
winstate->frameOptions = frameOptions;
|
||||
|
||||
@@ -2579,7 +2705,7 @@ ExecReScanWindowAgg(WindowAggState *node)
|
||||
PlanState *outerPlan = outerPlanState(node);
|
||||
ExprContext *econtext = node->ss.ps.ps_ExprContext;
|
||||
|
||||
node->all_done = false;
|
||||
node->status = WINDOWAGG_RUN;
|
||||
node->all_first = true;
|
||||
|
||||
/* release tuplestore et al */
|
||||
|
@@ -1104,11 +1104,14 @@ _copyWindowAgg(const WindowAgg *from)
|
||||
COPY_SCALAR_FIELD(frameOptions);
|
||||
COPY_NODE_FIELD(startOffset);
|
||||
COPY_NODE_FIELD(endOffset);
|
||||
COPY_NODE_FIELD(runCondition);
|
||||
COPY_NODE_FIELD(runConditionOrig);
|
||||
COPY_SCALAR_FIELD(startInRangeFunc);
|
||||
COPY_SCALAR_FIELD(endInRangeFunc);
|
||||
COPY_SCALAR_FIELD(inRangeColl);
|
||||
COPY_SCALAR_FIELD(inRangeAsc);
|
||||
COPY_SCALAR_FIELD(inRangeNullsFirst);
|
||||
COPY_SCALAR_FIELD(topWindow);
|
||||
|
||||
return newnode;
|
||||
}
|
||||
@@ -3061,6 +3064,7 @@ _copyWindowClause(const WindowClause *from)
|
||||
COPY_SCALAR_FIELD(frameOptions);
|
||||
COPY_NODE_FIELD(startOffset);
|
||||
COPY_NODE_FIELD(endOffset);
|
||||
COPY_NODE_FIELD(runCondition);
|
||||
COPY_SCALAR_FIELD(startInRangeFunc);
|
||||
COPY_SCALAR_FIELD(endInRangeFunc);
|
||||
COPY_SCALAR_FIELD(inRangeColl);
|
||||
|
@@ -3234,6 +3234,7 @@ _equalWindowClause(const WindowClause *a, const WindowClause *b)
|
||||
COMPARE_SCALAR_FIELD(frameOptions);
|
||||
COMPARE_NODE_FIELD(startOffset);
|
||||
COMPARE_NODE_FIELD(endOffset);
|
||||
COMPARE_NODE_FIELD(runCondition);
|
||||
COMPARE_SCALAR_FIELD(startInRangeFunc);
|
||||
COMPARE_SCALAR_FIELD(endInRangeFunc);
|
||||
COMPARE_SCALAR_FIELD(inRangeColl);
|
||||
|
@@ -829,11 +829,14 @@ _outWindowAgg(StringInfo str, const WindowAgg *node)
|
||||
WRITE_INT_FIELD(frameOptions);
|
||||
WRITE_NODE_FIELD(startOffset);
|
||||
WRITE_NODE_FIELD(endOffset);
|
||||
WRITE_NODE_FIELD(runCondition);
|
||||
WRITE_NODE_FIELD(runConditionOrig);
|
||||
WRITE_OID_FIELD(startInRangeFunc);
|
||||
WRITE_OID_FIELD(endInRangeFunc);
|
||||
WRITE_OID_FIELD(inRangeColl);
|
||||
WRITE_BOOL_FIELD(inRangeAsc);
|
||||
WRITE_BOOL_FIELD(inRangeNullsFirst);
|
||||
WRITE_BOOL_FIELD(topWindow);
|
||||
}
|
||||
|
||||
static void
|
||||
@@ -2283,6 +2286,8 @@ _outWindowAggPath(StringInfo str, const WindowAggPath *node)
|
||||
|
||||
WRITE_NODE_FIELD(subpath);
|
||||
WRITE_NODE_FIELD(winclause);
|
||||
WRITE_NODE_FIELD(qual);
|
||||
WRITE_BOOL_FIELD(topwindow);
|
||||
}
|
||||
|
||||
static void
|
||||
@@ -3293,6 +3298,7 @@ _outWindowClause(StringInfo str, const WindowClause *node)
|
||||
WRITE_INT_FIELD(frameOptions);
|
||||
WRITE_NODE_FIELD(startOffset);
|
||||
WRITE_NODE_FIELD(endOffset);
|
||||
WRITE_NODE_FIELD(runCondition);
|
||||
WRITE_OID_FIELD(startInRangeFunc);
|
||||
WRITE_OID_FIELD(endInRangeFunc);
|
||||
WRITE_OID_FIELD(inRangeColl);
|
||||
|
@@ -384,6 +384,7 @@ _readWindowClause(void)
|
||||
READ_INT_FIELD(frameOptions);
|
||||
READ_NODE_FIELD(startOffset);
|
||||
READ_NODE_FIELD(endOffset);
|
||||
READ_NODE_FIELD(runCondition);
|
||||
READ_OID_FIELD(startInRangeFunc);
|
||||
READ_OID_FIELD(endInRangeFunc);
|
||||
READ_OID_FIELD(inRangeColl);
|
||||
@@ -2576,11 +2577,14 @@ _readWindowAgg(void)
|
||||
READ_INT_FIELD(frameOptions);
|
||||
READ_NODE_FIELD(startOffset);
|
||||
READ_NODE_FIELD(endOffset);
|
||||
READ_NODE_FIELD(runCondition);
|
||||
READ_NODE_FIELD(runConditionOrig);
|
||||
READ_OID_FIELD(startInRangeFunc);
|
||||
READ_OID_FIELD(endInRangeFunc);
|
||||
READ_OID_FIELD(inRangeColl);
|
||||
READ_BOOL_FIELD(inRangeAsc);
|
||||
READ_BOOL_FIELD(inRangeNullsFirst);
|
||||
READ_BOOL_FIELD(topWindow);
|
||||
|
||||
READ_DONE();
|
||||
}
|
||||
|
@@ -27,6 +27,7 @@
|
||||
#include "miscadmin.h"
|
||||
#include "nodes/makefuncs.h"
|
||||
#include "nodes/nodeFuncs.h"
|
||||
#include "nodes/supportnodes.h"
|
||||
#ifdef OPTIMIZER_DEBUG
|
||||
#include "nodes/print.h"
|
||||
#endif
|
||||
@@ -2157,6 +2158,269 @@ has_multiple_baserels(PlannerInfo *root)
|
||||
return false;
|
||||
}
|
||||
|
||||
/*
|
||||
* find_window_run_conditions
|
||||
* Determine if 'wfunc' is really a WindowFunc and call its prosupport
|
||||
* function to determine the function's monotonic properties. We then
|
||||
* see if 'opexpr' can be used to short-circuit execution.
|
||||
*
|
||||
* For example row_number() over (order by ...) always produces a value one
|
||||
* higher than the previous. If someone has a window function in a subquery
|
||||
* and has a WHERE clause in the outer query to filter rows <= 10, then we may
|
||||
* as well stop processing the windowagg once the row number reaches 11. Here
|
||||
* we check if 'opexpr' might help us to stop doing needless extra processing
|
||||
* in WindowAgg nodes.
|
||||
*
|
||||
* '*keep_original' is set to true if the caller should also use 'opexpr' for
|
||||
* its original purpose. This is set to false if the caller can assume that
|
||||
* the run condition will handle all of the required filtering.
|
||||
*
|
||||
* Returns true if 'opexpr' was found to be useful and was added to the
|
||||
* WindowClauses runCondition. We also set *keep_original accordingly.
|
||||
* If the 'opexpr' cannot be used then we set *keep_original to true and
|
||||
* return false.
|
||||
*/
|
||||
static bool
|
||||
find_window_run_conditions(Query *subquery, RangeTblEntry *rte, Index rti,
|
||||
AttrNumber attno, WindowFunc *wfunc, OpExpr *opexpr,
|
||||
bool wfunc_left, bool *keep_original)
|
||||
{
|
||||
Oid prosupport;
|
||||
Expr *otherexpr;
|
||||
SupportRequestWFuncMonotonic req;
|
||||
SupportRequestWFuncMonotonic *res;
|
||||
WindowClause *wclause;
|
||||
List *opinfos;
|
||||
OpExpr *runopexpr;
|
||||
Oid runoperator;
|
||||
ListCell *lc;
|
||||
|
||||
*keep_original = true;
|
||||
|
||||
while (IsA(wfunc, RelabelType))
|
||||
wfunc = (WindowFunc *) ((RelabelType *) wfunc)->arg;
|
||||
|
||||
/* we can only work with window functions */
|
||||
if (!IsA(wfunc, WindowFunc))
|
||||
return false;
|
||||
|
||||
prosupport = get_func_support(wfunc->winfnoid);
|
||||
|
||||
/* Check if there's a support function for 'wfunc' */
|
||||
if (!OidIsValid(prosupport))
|
||||
return false;
|
||||
|
||||
/* get the Expr from the other side of the OpExpr */
|
||||
if (wfunc_left)
|
||||
otherexpr = lsecond(opexpr->args);
|
||||
else
|
||||
otherexpr = linitial(opexpr->args);
|
||||
|
||||
/*
|
||||
* The value being compared must not change during the evaluation of the
|
||||
* window partition.
|
||||
*/
|
||||
if (!is_pseudo_constant_clause((Node *) otherexpr))
|
||||
return false;
|
||||
|
||||
/* find the window clause belonging to the window function */
|
||||
wclause = (WindowClause *) list_nth(subquery->windowClause,
|
||||
wfunc->winref - 1);
|
||||
|
||||
req.type = T_SupportRequestWFuncMonotonic;
|
||||
req.window_func = wfunc;
|
||||
req.window_clause = wclause;
|
||||
|
||||
/* call the support function */
|
||||
res = (SupportRequestWFuncMonotonic *)
|
||||
DatumGetPointer(OidFunctionCall1(prosupport,
|
||||
PointerGetDatum(&req)));
|
||||
|
||||
/*
|
||||
* Nothing to do if the function is neither monotonically increasing nor
|
||||
* monotonically decreasing.
|
||||
*/
|
||||
if (res == NULL || res->monotonic == MONOTONICFUNC_NONE)
|
||||
return false;
|
||||
|
||||
runopexpr = NULL;
|
||||
runoperator = InvalidOid;
|
||||
opinfos = get_op_btree_interpretation(opexpr->opno);
|
||||
|
||||
foreach(lc, opinfos)
|
||||
{
|
||||
OpBtreeInterpretation *opinfo = (OpBtreeInterpretation *) lfirst(lc);
|
||||
int strategy = opinfo->strategy;
|
||||
|
||||
/* handle < / <= */
|
||||
if (strategy == BTLessStrategyNumber ||
|
||||
strategy == BTLessEqualStrategyNumber)
|
||||
{
|
||||
/*
|
||||
* < / <= is supported for monotonically increasing functions in
|
||||
* the form <wfunc> op <pseudoconst> and <pseudoconst> op <wfunc>
|
||||
* for monotonically decreasing functions.
|
||||
*/
|
||||
if ((wfunc_left && (res->monotonic & MONOTONICFUNC_INCREASING)) ||
|
||||
(!wfunc_left && (res->monotonic & MONOTONICFUNC_DECREASING)))
|
||||
{
|
||||
*keep_original = false;
|
||||
runopexpr = opexpr;
|
||||
runoperator = opexpr->opno;
|
||||
}
|
||||
break;
|
||||
}
|
||||
/* handle > / >= */
|
||||
else if (strategy == BTGreaterStrategyNumber ||
|
||||
strategy == BTGreaterEqualStrategyNumber)
|
||||
{
|
||||
/*
|
||||
* > / >= is supported for monotonically decreasing functions in
|
||||
* the form <wfunc> op <pseudoconst> and <pseudoconst> op <wfunc>
|
||||
* for monotonically increasing functions.
|
||||
*/
|
||||
if ((wfunc_left && (res->monotonic & MONOTONICFUNC_DECREASING)) ||
|
||||
(!wfunc_left && (res->monotonic & MONOTONICFUNC_INCREASING)))
|
||||
{
|
||||
*keep_original = false;
|
||||
runopexpr = opexpr;
|
||||
runoperator = opexpr->opno;
|
||||
}
|
||||
break;
|
||||
}
|
||||
/* handle = */
|
||||
else if (strategy == BTEqualStrategyNumber)
|
||||
{
|
||||
int16 newstrategy;
|
||||
|
||||
/*
|
||||
* When both monotonically increasing and decreasing then the
|
||||
* return value of the window function will be the same each time.
|
||||
* We can simply use 'opexpr' as the run condition without
|
||||
* modifying it.
|
||||
*/
|
||||
if ((res->monotonic & MONOTONICFUNC_BOTH) == MONOTONICFUNC_BOTH)
|
||||
{
|
||||
*keep_original = false;
|
||||
runopexpr = opexpr;
|
||||
break;
|
||||
}
|
||||
|
||||
/*
|
||||
* When monotonically increasing we make a qual with <wfunc> <=
|
||||
* <value> or <value> >= <wfunc> in order to filter out values
|
||||
* which are above the value in the equality condition. For
|
||||
* monotonically decreasing functions we want to filter values
|
||||
* below the value in the equality condition.
|
||||
*/
|
||||
if (res->monotonic & MONOTONICFUNC_INCREASING)
|
||||
newstrategy = wfunc_left ? BTLessEqualStrategyNumber : BTGreaterEqualStrategyNumber;
|
||||
else
|
||||
newstrategy = wfunc_left ? BTGreaterEqualStrategyNumber : BTLessEqualStrategyNumber;
|
||||
|
||||
/* We must keep the original equality qual */
|
||||
*keep_original = true;
|
||||
runopexpr = opexpr;
|
||||
|
||||
/* determine the operator to use for the runCondition qual */
|
||||
runoperator = get_opfamily_member(opinfo->opfamily_id,
|
||||
opinfo->oplefttype,
|
||||
opinfo->oprighttype,
|
||||
newstrategy);
|
||||
break;
|
||||
}
|
||||
}
|
||||
|
||||
if (runopexpr != NULL)
|
||||
{
|
||||
Expr *newexpr;
|
||||
|
||||
/*
|
||||
* Build the qual required for the run condition keeping the
|
||||
* WindowFunc on the same side as it was originally.
|
||||
*/
|
||||
if (wfunc_left)
|
||||
newexpr = make_opclause(runoperator,
|
||||
runopexpr->opresulttype,
|
||||
runopexpr->opretset, (Expr *) wfunc,
|
||||
otherexpr, runopexpr->opcollid,
|
||||
runopexpr->inputcollid);
|
||||
else
|
||||
newexpr = make_opclause(runoperator,
|
||||
runopexpr->opresulttype,
|
||||
runopexpr->opretset,
|
||||
otherexpr, (Expr *) wfunc,
|
||||
runopexpr->opcollid,
|
||||
runopexpr->inputcollid);
|
||||
|
||||
wclause->runCondition = lappend(wclause->runCondition, newexpr);
|
||||
|
||||
return true;
|
||||
}
|
||||
|
||||
/* unsupported OpExpr */
|
||||
return false;
|
||||
}
|
||||
|
||||
/*
|
||||
* check_and_push_window_quals
|
||||
* Check if 'clause' is a qual that can be pushed into a WindowFunc's
|
||||
* WindowClause as a 'runCondition' qual. These, when present, allow
|
||||
* some unnecessary work to be skipped during execution.
|
||||
*
|
||||
* Returns true if the caller still must keep the original qual or false if
|
||||
* the caller can safely ignore the original qual because the WindowAgg node
|
||||
* will use the runCondition to stop returning tuples.
|
||||
*/
|
||||
static bool
|
||||
check_and_push_window_quals(Query *subquery, RangeTblEntry *rte, Index rti,
|
||||
Node *clause)
|
||||
{
|
||||
OpExpr *opexpr = (OpExpr *) clause;
|
||||
bool keep_original = true;
|
||||
Var *var1;
|
||||
Var *var2;
|
||||
|
||||
/* We're only able to use OpExprs with 2 operands */
|
||||
if (!IsA(opexpr, OpExpr))
|
||||
return true;
|
||||
|
||||
if (list_length(opexpr->args) != 2)
|
||||
return true;
|
||||
|
||||
/*
|
||||
* Check for plain Vars that reference window functions in the subquery.
|
||||
* If we find any, we'll ask find_window_run_conditions() if 'opexpr' can
|
||||
* be used as part of the run condition.
|
||||
*/
|
||||
|
||||
/* Check the left side of the OpExpr */
|
||||
var1 = linitial(opexpr->args);
|
||||
if (IsA(var1, Var) && var1->varattno > 0)
|
||||
{
|
||||
TargetEntry *tle = list_nth(subquery->targetList, var1->varattno - 1);
|
||||
WindowFunc *wfunc = (WindowFunc *) tle->expr;
|
||||
|
||||
if (find_window_run_conditions(subquery, rte, rti, tle->resno, wfunc,
|
||||
opexpr, true, &keep_original))
|
||||
return keep_original;
|
||||
}
|
||||
|
||||
/* and check the right side */
|
||||
var2 = lsecond(opexpr->args);
|
||||
if (IsA(var2, Var) && var2->varattno > 0)
|
||||
{
|
||||
TargetEntry *tle = list_nth(subquery->targetList, var2->varattno - 1);
|
||||
WindowFunc *wfunc = (WindowFunc *) tle->expr;
|
||||
|
||||
if (find_window_run_conditions(subquery, rte, rti, tle->resno, wfunc,
|
||||
opexpr, false, &keep_original))
|
||||
return keep_original;
|
||||
}
|
||||
|
||||
return true;
|
||||
}
|
||||
|
||||
/*
|
||||
* set_subquery_pathlist
|
||||
* Generate SubqueryScan access paths for a subquery RTE
|
||||
@@ -2245,19 +2509,31 @@ set_subquery_pathlist(PlannerInfo *root, RelOptInfo *rel,
|
||||
foreach(l, rel->baserestrictinfo)
|
||||
{
|
||||
RestrictInfo *rinfo = (RestrictInfo *) lfirst(l);
|
||||
Node *clause = (Node *) rinfo->clause;
|
||||
|
||||
if (!rinfo->pseudoconstant &&
|
||||
qual_is_pushdown_safe(subquery, rti, rinfo, &safetyInfo))
|
||||
{
|
||||
Node *clause = (Node *) rinfo->clause;
|
||||
|
||||
/* Push it down */
|
||||
subquery_push_qual(subquery, rte, rti, clause);
|
||||
}
|
||||
else
|
||||
{
|
||||
/* Keep it in the upper query */
|
||||
upperrestrictlist = lappend(upperrestrictlist, rinfo);
|
||||
/*
|
||||
* Since we can't push the qual down into the subquery, check
|
||||
* if it happens to reference a window function. If so then
|
||||
* it might be useful to use for the WindowAgg's runCondition.
|
||||
*/
|
||||
if (!subquery->hasWindowFuncs ||
|
||||
check_and_push_window_quals(subquery, rte, rti, clause))
|
||||
{
|
||||
/*
|
||||
* subquery has no window funcs or the clause is not a
|
||||
* suitable window run condition qual or it is, but the
|
||||
* original must also be kept in the upper query.
|
||||
*/
|
||||
upperrestrictlist = lappend(upperrestrictlist, rinfo);
|
||||
}
|
||||
}
|
||||
}
|
||||
rel->baserestrictinfo = upperrestrictlist;
|
||||
|
@@ -288,6 +288,7 @@ static WindowAgg *make_windowagg(List *tlist, Index winref,
|
||||
int frameOptions, Node *startOffset, Node *endOffset,
|
||||
Oid startInRangeFunc, Oid endInRangeFunc,
|
||||
Oid inRangeColl, bool inRangeAsc, bool inRangeNullsFirst,
|
||||
List *runCondition, List *qual, bool topWindow,
|
||||
Plan *lefttree);
|
||||
static Group *make_group(List *tlist, List *qual, int numGroupCols,
|
||||
AttrNumber *grpColIdx, Oid *grpOperators, Oid *grpCollations,
|
||||
@@ -2672,6 +2673,9 @@ create_windowagg_plan(PlannerInfo *root, WindowAggPath *best_path)
|
||||
wc->inRangeColl,
|
||||
wc->inRangeAsc,
|
||||
wc->inRangeNullsFirst,
|
||||
wc->runCondition,
|
||||
best_path->qual,
|
||||
best_path->topwindow,
|
||||
subplan);
|
||||
|
||||
copy_generic_path_info(&plan->plan, (Path *) best_path);
|
||||
@@ -6558,7 +6562,7 @@ make_windowagg(List *tlist, Index winref,
|
||||
int frameOptions, Node *startOffset, Node *endOffset,
|
||||
Oid startInRangeFunc, Oid endInRangeFunc,
|
||||
Oid inRangeColl, bool inRangeAsc, bool inRangeNullsFirst,
|
||||
Plan *lefttree)
|
||||
List *runCondition, List *qual, bool topWindow, Plan *lefttree)
|
||||
{
|
||||
WindowAgg *node = makeNode(WindowAgg);
|
||||
Plan *plan = &node->plan;
|
||||
@@ -6575,17 +6579,20 @@ make_windowagg(List *tlist, Index winref,
|
||||
node->frameOptions = frameOptions;
|
||||
node->startOffset = startOffset;
|
||||
node->endOffset = endOffset;
|
||||
node->runCondition = runCondition;
|
||||
/* a duplicate of the above for EXPLAIN */
|
||||
node->runConditionOrig = runCondition;
|
||||
node->startInRangeFunc = startInRangeFunc;
|
||||
node->endInRangeFunc = endInRangeFunc;
|
||||
node->inRangeColl = inRangeColl;
|
||||
node->inRangeAsc = inRangeAsc;
|
||||
node->inRangeNullsFirst = inRangeNullsFirst;
|
||||
node->topWindow = topWindow;
|
||||
|
||||
plan->targetlist = tlist;
|
||||
plan->lefttree = lefttree;
|
||||
plan->righttree = NULL;
|
||||
/* WindowAgg nodes never have a qual clause */
|
||||
plan->qual = NIL;
|
||||
plan->qual = qual;
|
||||
|
||||
return node;
|
||||
}
|
||||
|
@@ -4190,6 +4190,7 @@ create_one_window_path(PlannerInfo *root,
|
||||
{
|
||||
PathTarget *window_target;
|
||||
ListCell *l;
|
||||
List *topqual = NIL;
|
||||
|
||||
/*
|
||||
* Since each window clause could require a different sort order, we stack
|
||||
@@ -4214,6 +4215,7 @@ create_one_window_path(PlannerInfo *root,
|
||||
List *window_pathkeys;
|
||||
int presorted_keys;
|
||||
bool is_sorted;
|
||||
bool topwindow;
|
||||
|
||||
window_pathkeys = make_pathkeys_for_window(root,
|
||||
wc,
|
||||
@@ -4277,10 +4279,21 @@ create_one_window_path(PlannerInfo *root,
|
||||
window_target = output_target;
|
||||
}
|
||||
|
||||
/* mark the final item in the list as the top-level window */
|
||||
topwindow = foreach_current_index(l) == list_length(activeWindows) - 1;
|
||||
|
||||
/*
|
||||
* Accumulate all of the runConditions from each intermediate
|
||||
* WindowClause. The top-level WindowAgg must pass these as a qual so
|
||||
* that it filters out unwanted tuples correctly.
|
||||
*/
|
||||
if (!topwindow)
|
||||
topqual = list_concat(topqual, wc->runCondition);
|
||||
|
||||
path = (Path *)
|
||||
create_windowagg_path(root, window_rel, path, window_target,
|
||||
wflists->windowFuncs[wc->winref],
|
||||
wc);
|
||||
wc, topwindow ? topqual : NIL, topwindow);
|
||||
}
|
||||
|
||||
add_path(window_rel, path);
|
||||
|
@@ -71,6 +71,13 @@ typedef struct
|
||||
double num_exec;
|
||||
} fix_upper_expr_context;
|
||||
|
||||
typedef struct
|
||||
{
|
||||
PlannerInfo *root;
|
||||
indexed_tlist *subplan_itlist;
|
||||
int newvarno;
|
||||
} fix_windowagg_cond_context;
|
||||
|
||||
/*
|
||||
* Selecting the best alternative in an AlternativeSubPlan expression requires
|
||||
* estimating how many times that expression will be evaluated. For an
|
||||
@@ -171,6 +178,9 @@ static List *set_returning_clause_references(PlannerInfo *root,
|
||||
Plan *topplan,
|
||||
Index resultRelation,
|
||||
int rtoffset);
|
||||
static List *set_windowagg_runcondition_references(PlannerInfo *root,
|
||||
List *runcondition,
|
||||
Plan *plan);
|
||||
|
||||
|
||||
/*****************************************************************************
|
||||
@@ -885,6 +895,18 @@ set_plan_refs(PlannerInfo *root, Plan *plan, int rtoffset)
|
||||
{
|
||||
WindowAgg *wplan = (WindowAgg *) plan;
|
||||
|
||||
/*
|
||||
* Adjust the WindowAgg's run conditions by swapping the
|
||||
* WindowFuncs references out to instead reference the Var in
|
||||
* the scan slot so that when the executor evaluates the
|
||||
* runCondition, it receives the WindowFunc's value from the
|
||||
* slot that the result has just been stored into rather than
|
||||
* evaluating the WindowFunc all over again.
|
||||
*/
|
||||
wplan->runCondition = set_windowagg_runcondition_references(root,
|
||||
wplan->runCondition,
|
||||
(Plan *) wplan);
|
||||
|
||||
set_upper_references(root, plan, rtoffset);
|
||||
|
||||
/*
|
||||
@@ -896,6 +918,14 @@ set_plan_refs(PlannerInfo *root, Plan *plan, int rtoffset)
|
||||
fix_scan_expr(root, wplan->startOffset, rtoffset, 1);
|
||||
wplan->endOffset =
|
||||
fix_scan_expr(root, wplan->endOffset, rtoffset, 1);
|
||||
wplan->runCondition = fix_scan_list(root,
|
||||
wplan->runCondition,
|
||||
rtoffset,
|
||||
NUM_EXEC_TLIST(plan));
|
||||
wplan->runConditionOrig = fix_scan_list(root,
|
||||
wplan->runConditionOrig,
|
||||
rtoffset,
|
||||
NUM_EXEC_TLIST(plan));
|
||||
}
|
||||
break;
|
||||
case T_Result:
|
||||
@@ -3064,6 +3094,78 @@ set_returning_clause_references(PlannerInfo *root,
|
||||
return rlist;
|
||||
}
|
||||
|
||||
/*
|
||||
* fix_windowagg_condition_expr_mutator
|
||||
* Mutator function for replacing WindowFuncs with the corresponding Var
|
||||
* in the targetlist which references that WindowFunc.
|
||||
*/
|
||||
static Node *
|
||||
fix_windowagg_condition_expr_mutator(Node *node,
|
||||
fix_windowagg_cond_context *context)
|
||||
{
|
||||
if (node == NULL)
|
||||
return NULL;
|
||||
|
||||
if (IsA(node, WindowFunc))
|
||||
{
|
||||
Var *newvar;
|
||||
|
||||
newvar = search_indexed_tlist_for_non_var((Expr *) node,
|
||||
context->subplan_itlist,
|
||||
context->newvarno);
|
||||
if (newvar)
|
||||
return (Node *) newvar;
|
||||
elog(ERROR, "WindowFunc not found in subplan target lists");
|
||||
}
|
||||
|
||||
return expression_tree_mutator(node,
|
||||
fix_windowagg_condition_expr_mutator,
|
||||
(void *) context);
|
||||
}
|
||||
|
||||
/*
|
||||
* fix_windowagg_condition_expr
|
||||
* Converts references in 'runcondition' so that any WindowFunc
|
||||
* references are swapped out for a Var which references the matching
|
||||
* WindowFunc in 'subplan_itlist'.
|
||||
*/
|
||||
static List *
|
||||
fix_windowagg_condition_expr(PlannerInfo *root,
|
||||
List *runcondition,
|
||||
indexed_tlist *subplan_itlist)
|
||||
{
|
||||
fix_windowagg_cond_context context;
|
||||
|
||||
context.root = root;
|
||||
context.subplan_itlist = subplan_itlist;
|
||||
context.newvarno = 0;
|
||||
|
||||
return (List *) fix_windowagg_condition_expr_mutator((Node *) runcondition,
|
||||
&context);
|
||||
}
|
||||
|
||||
/*
|
||||
* set_windowagg_runcondition_references
|
||||
* Converts references in 'runcondition' so that any WindowFunc
|
||||
* references are swapped out for a Var which references the matching
|
||||
* WindowFunc in 'plan' targetlist.
|
||||
*/
|
||||
static List *
|
||||
set_windowagg_runcondition_references(PlannerInfo *root,
|
||||
List *runcondition,
|
||||
Plan *plan)
|
||||
{
|
||||
List *newlist;
|
||||
indexed_tlist *itlist;
|
||||
|
||||
itlist = build_tlist_index(plan->targetlist);
|
||||
|
||||
newlist = fix_windowagg_condition_expr(root, runcondition, itlist);
|
||||
|
||||
pfree(itlist);
|
||||
|
||||
return newlist;
|
||||
}
|
||||
|
||||
/*****************************************************************************
|
||||
* QUERY DEPENDENCY MANAGEMENT
|
||||
|
@@ -3388,6 +3388,10 @@ create_minmaxagg_path(PlannerInfo *root,
|
||||
* 'target' is the PathTarget to be computed
|
||||
* 'windowFuncs' is a list of WindowFunc structs
|
||||
* 'winclause' is a WindowClause that is common to all the WindowFuncs
|
||||
* 'qual' WindowClause.runconditions from lower-level WindowAggPaths.
|
||||
* Must always be NIL when topwindow == false
|
||||
* 'topwindow' pass as true only for the top-level WindowAgg. False for all
|
||||
* intermediate WindowAggs.
|
||||
*
|
||||
* The input must be sorted according to the WindowClause's PARTITION keys
|
||||
* plus ORDER BY keys.
|
||||
@@ -3398,10 +3402,15 @@ create_windowagg_path(PlannerInfo *root,
|
||||
Path *subpath,
|
||||
PathTarget *target,
|
||||
List *windowFuncs,
|
||||
WindowClause *winclause)
|
||||
WindowClause *winclause,
|
||||
List *qual,
|
||||
bool topwindow)
|
||||
{
|
||||
WindowAggPath *pathnode = makeNode(WindowAggPath);
|
||||
|
||||
/* qual can only be set for the topwindow */
|
||||
Assert(qual == NIL || topwindow);
|
||||
|
||||
pathnode->path.pathtype = T_WindowAgg;
|
||||
pathnode->path.parent = rel;
|
||||
pathnode->path.pathtarget = target;
|
||||
@@ -3416,6 +3425,8 @@ create_windowagg_path(PlannerInfo *root,
|
||||
|
||||
pathnode->subpath = subpath;
|
||||
pathnode->winclause = winclause;
|
||||
pathnode->qual = qual;
|
||||
pathnode->topwindow = topwindow;
|
||||
|
||||
/*
|
||||
* For costing purposes, assume that there are no redundant partitioning
|
||||
|
@@ -24,6 +24,7 @@
|
||||
#include "nodes/supportnodes.h"
|
||||
#include "optimizer/optimizer.h"
|
||||
#include "utils/builtins.h"
|
||||
#include "utils/lsyscache.h"
|
||||
|
||||
|
||||
typedef struct
|
||||
@@ -818,6 +819,49 @@ int8dec_any(PG_FUNCTION_ARGS)
|
||||
return int8dec(fcinfo);
|
||||
}
|
||||
|
||||
/*
|
||||
* int8inc_support
|
||||
* prosupport function for int8inc() and int8inc_any()
|
||||
*/
|
||||
Datum
|
||||
int8inc_support(PG_FUNCTION_ARGS)
|
||||
{
|
||||
Node *rawreq = (Node *) PG_GETARG_POINTER(0);
|
||||
|
||||
if (IsA(rawreq, SupportRequestWFuncMonotonic))
|
||||
{
|
||||
SupportRequestWFuncMonotonic *req = (SupportRequestWFuncMonotonic *) rawreq;
|
||||
MonotonicFunction monotonic = MONOTONICFUNC_NONE;
|
||||
int frameOptions = req->window_clause->frameOptions;
|
||||
|
||||
/* No ORDER BY clause then all rows are peers */
|
||||
if (req->window_clause->orderClause == NIL)
|
||||
monotonic = MONOTONICFUNC_BOTH;
|
||||
else
|
||||
{
|
||||
/*
|
||||
* Otherwise take into account the frame options. When the frame
|
||||
* bound is the start of the window then the resulting value can
|
||||
* never decrease, therefore is monotonically increasing
|
||||
*/
|
||||
if (frameOptions & FRAMEOPTION_START_UNBOUNDED_PRECEDING)
|
||||
monotonic |= MONOTONICFUNC_INCREASING;
|
||||
|
||||
/*
|
||||
* Likewise, if the frame bound is the end of the window then the
|
||||
* resulting value can never decrease.
|
||||
*/
|
||||
if (frameOptions & FRAMEOPTION_END_UNBOUNDED_FOLLOWING)
|
||||
monotonic |= MONOTONICFUNC_DECREASING;
|
||||
}
|
||||
|
||||
req->monotonic = monotonic;
|
||||
PG_RETURN_POINTER(req);
|
||||
}
|
||||
|
||||
PG_RETURN_POINTER(NULL);
|
||||
}
|
||||
|
||||
|
||||
Datum
|
||||
int8larger(PG_FUNCTION_ARGS)
|
||||
|
@@ -13,6 +13,7 @@
|
||||
*/
|
||||
#include "postgres.h"
|
||||
|
||||
#include "nodes/supportnodes.h"
|
||||
#include "utils/builtins.h"
|
||||
#include "windowapi.h"
|
||||
|
||||
@@ -88,6 +89,26 @@ window_row_number(PG_FUNCTION_ARGS)
|
||||
PG_RETURN_INT64(curpos + 1);
|
||||
}
|
||||
|
||||
/*
|
||||
* window_row_number_support
|
||||
* prosupport function for window_row_number()
|
||||
*/
|
||||
Datum
|
||||
window_row_number_support(PG_FUNCTION_ARGS)
|
||||
{
|
||||
Node *rawreq = (Node *) PG_GETARG_POINTER(0);
|
||||
|
||||
if (IsA(rawreq, SupportRequestWFuncMonotonic))
|
||||
{
|
||||
SupportRequestWFuncMonotonic *req = (SupportRequestWFuncMonotonic *) rawreq;
|
||||
|
||||
/* row_number() is monotonically increasing */
|
||||
req->monotonic = MONOTONICFUNC_INCREASING;
|
||||
PG_RETURN_POINTER(req);
|
||||
}
|
||||
|
||||
PG_RETURN_POINTER(NULL);
|
||||
}
|
||||
|
||||
/*
|
||||
* rank
|
||||
@@ -110,6 +131,27 @@ window_rank(PG_FUNCTION_ARGS)
|
||||
PG_RETURN_INT64(context->rank);
|
||||
}
|
||||
|
||||
/*
|
||||
* window_rank_support
|
||||
* prosupport function for window_rank()
|
||||
*/
|
||||
Datum
|
||||
window_rank_support(PG_FUNCTION_ARGS)
|
||||
{
|
||||
Node *rawreq = (Node *) PG_GETARG_POINTER(0);
|
||||
|
||||
if (IsA(rawreq, SupportRequestWFuncMonotonic))
|
||||
{
|
||||
SupportRequestWFuncMonotonic *req = (SupportRequestWFuncMonotonic *) rawreq;
|
||||
|
||||
/* rank() is monotonically increasing */
|
||||
req->monotonic = MONOTONICFUNC_INCREASING;
|
||||
PG_RETURN_POINTER(req);
|
||||
}
|
||||
|
||||
PG_RETURN_POINTER(NULL);
|
||||
}
|
||||
|
||||
/*
|
||||
* dense_rank
|
||||
* Rank increases by 1 when key columns change.
|
||||
@@ -130,6 +172,27 @@ window_dense_rank(PG_FUNCTION_ARGS)
|
||||
PG_RETURN_INT64(context->rank);
|
||||
}
|
||||
|
||||
/*
|
||||
* window_dense_rank_support
|
||||
* prosupport function for window_dense_rank()
|
||||
*/
|
||||
Datum
|
||||
window_dense_rank_support(PG_FUNCTION_ARGS)
|
||||
{
|
||||
Node *rawreq = (Node *) PG_GETARG_POINTER(0);
|
||||
|
||||
if (IsA(rawreq, SupportRequestWFuncMonotonic))
|
||||
{
|
||||
SupportRequestWFuncMonotonic *req = (SupportRequestWFuncMonotonic *) rawreq;
|
||||
|
||||
/* dense_rank() is monotonically increasing */
|
||||
req->monotonic = MONOTONICFUNC_INCREASING;
|
||||
PG_RETURN_POINTER(req);
|
||||
}
|
||||
|
||||
PG_RETURN_POINTER(NULL);
|
||||
}
|
||||
|
||||
/*
|
||||
* percent_rank
|
||||
* return fraction between 0 and 1 inclusive,
|
||||
|
Reference in New Issue
Block a user