Re: [BUGS] bug #7499 additional comments

Craig Ringer Wed, 22 Aug 2012 19:37:09 -0700

On 08/23/2012 04:12 AM, Denis Kolesnik wrote:

Suppose a person who has basic SQL knowledges would learn on praxis
how would result a query if a person adds the clause "limit 1" to it

Then they just got bitten by not learning enough and not testing theircode well enough; they were probably programming by recipe andcopy-and-paste, not by learning the platform they're working with.



http://www.postgresql.org/docs/9.1/static/sql-select.html#SQL-ORDERBY

"The ORDER BY clause causes the result rows to be sorted according tothe specified expression(s). If two rows are equal according to theleftmost expression, they are compared according to the next expressionand so on. If they are equal according to all specified expressions,they are returned in an implementation-dependent order."

It'd be really nice if every programming language and tool could becompletely safe and easy, with no undefined, implementation-defined orinconsistent behaviour. Unfortunately, in the real world that doesn'thappen because perfectly specified platforms are (a) really hard toactually write and (b) usually hard to optimise and thus slow.

Suppose a person with basic C knowledge wrote this (utterly wrong anddangerous, do not use for anything) program:


#include <stdio.h>
#include <string.h>
#include <malloc.h>
int main() {
        char * blah = (char*)malloc(10);
        strcpy(blah,"1234567890");
        printf("%s\n", blah);
}

This program has *at* *least* one bug that'll cause it to run most ofthe time, but fail unpredictably, especially when used as part of alarger program rather than standalone. Failure will depend on platform,C library, kernel, compiler settings, and the contents of uninitializedmemory.

Is the platform responsible for the user shooting themselves in the footbecause they didn't learn about null termination of strings, bufferover-runs, the dangers of using strcpy(), etc? To me it's a bug in theuser's code, not the platform.

Sure, the platform could be easier to use. It could add lots of boundschecks, prohibit raw memory access, use garbage collection instead ofexplicit pointer-based memory management, etc. Then you'd have a newplatform called Java, which is very useful - but not really somethingyou can use to write tiny programs that take microseconds to run, orhigh-performance operating system kernels.

Even Java has plenty of traps and confusing characteristics. Anything todo with threads. finalize() methods. try {} catch {} finally {}constructs. Double-checked locking. Plenty more. That's in a languagethat was designed to be an easier and safer alternative to C.

Everything is a compromise, including the SQL language andimplementations of it. If Pg made underspecified sorts an error thenlots of other people would scream "bug!" because pretty much every otherdatabase system lets you do this so it'd be a portability problem - andbecause it's a really useful behaviour for some purposes. If Pg's queryplanner always ensured that sorts were stable and always did the samesorts, people wouldn't use Pg because it'd be too slow.

More importantly, PostgreSQL has no way of *knowing* for sure that thesort is underspecified. It can't know that the column you've specifiedisn't unique, or at least unique within the subset of data you'reworking with. It trusts you to know what you want.


The trick is to read the documentation, learn, and test your code well.

That's true of every language, even those that try to protect theprogrammer from their mistakes as much as possible.


--
Craig Ringer


--
Sent via pgsql-bugs mailing list (pgsql-bugs@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-bugs

Re: [BUGS] bug #7499 additional comments

Reply via email to