Re: [GENERAL] Selecting K random rows - efficiently!

cluster Wed, 24 Oct 2007 02:10:19 -0700

Another way to look at the problem is: How do I sample a subset of sizeK efficiently? A query like


   SAMPLE 1000 OF
   (SELECT * FROM mydata WHERE <some condition>)

should return 1000 random rows from the select statement so that twoconsecutive evaluations of the query would only with very littleprobability return the same 1000 rows.

(Yes, I know that "SAMPLE 1000 OF" is not valid SQL)

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [GENERAL] Selecting K random rows - efficiently!

Reply via email to