git.postgresql.org Git - postgres-xl.git/commit

projects / postgres-xl.git / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: 03162cb) | patch

author	Tomas Vondra <[email protected]>
	Thu, 13 Jul 2017 16:22:30 +0000 (18:22 +0200)
committer	Tomas Vondra <[email protected]>
	Thu, 13 Jul 2017 16:35:06 +0000 (18:35 +0200)
commit	6d4a2588c8b2d548321d24177381b3520c4deee3
tree	8dcdfc0fadb6a9eae9578bf8408ee8bcc8d5bf55	tree
parent	03162cb93078de77532bf08498d96345fe14ea68	commit \| diff

Build extended stats on coordinators during ANALYZE

When running ANALYZE on a coordinator, we simply fetch the statistics
built on datanodes, and keep stats from a random datanode (assuming all
datanodes are similar in terms of data volume and data distribution).

This was only done for regular per-attribute stats, though, not for the
extended statistics added in PostgreSQL 10, causing various failures in
stats_ext tests due to missing statistics. This commit fixes this gap
by using the same approach as for simple statistics - we collect stats
from datanodes and keep the first result we receive for each statistic.

While working on this I realized this approach has some inherent issues,
particularly on columns that are distribution keys. As we keep stats
from a random node, we completely ignore MCV and histograms from the
remaining nodes. That may cause planning issues, but addressing it is
out of scope for this commit.

src/backend/commands/analyze.c		diff \| blob \| blame \| history
src/test/regress/expected/stats_ext.out		diff \| blob \| blame \| history

Official repo for Postgres-XL. Stable branch is XL9_5_STABLE. Current development is PG10 compatible. Controlled by Postgres-X2 Core Team.