[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Performance of CLUSTER



On 06/11/2012 09:25 AM, Mark Thornton wrote:

Certainly not --- the server only has 5GB of memory. Nevertheless I
don't expect quadratic behaviour for CLUSTER (n log n would be my
expected time).

And there it is. :)

Since that's the case, *DO NOT* create the symlink from pgsql_tmp to /dev/shm like I suggested before. You don't have enough memory for that, and it will likely cause problems. I need to stop assuming everyone has huge servers. I know low-end laptops have 4GB of RAM these days, but servers have longer shelf-lives, and VMs can play larger roles.

So here's the thing, and I should have honestly realized it the second I noted the >100x jump in execution time. All of your previous tables fit in memory. Nice, speedy, >100x faster than disk, memory. It's not that the table is only 10x larger than other tables in your examples, it's that the entire thing doesn't fit in memory.

Since it can't just read the table and assume it's in memory, reads have a chance to fetch from disk. Since it's also maintaining several temporary files for the new index and replacement table structures, it's fighting for random reads and writes during the whole process. That's in addition to any transaction log traffic and checkpoints since the process will span several.

Actually, your case is a good illustration of how memory and high-performance IO devices can reduce maintenance costs. If you played around with steadily increasing table sizes, I bet you could even find the exact row count and table size where the table no longer fits in PostgreSQL or OS cache, and suddenly takes 100x longer to process. That kind of steady table growth is often seen in databases, and admins sometimes see this without understanding why it happens.

--
Shaun Thomas
OptionsHouse | 141 W. Jackson Blvd. | Suite 500 | Chicago IL, 60604
312-444-8534
sthomas@xxxxxxxxxxxxxxxx

______________________________________________

See http://www.peak6.com/email_disclaimer/ for terms and conditions related to this email

--
Sent via pgsql-performance mailing list (pgsql-performance@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


[Postgresql General]     [Postgresql PHP]     [PHP Users]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Home]     [Yosemite]

Powered by Linux