Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
The limit clause (or the method DataFrame.limit
if you are using Spark)
is a better alternative if randomness is not critical.
PostgreSQL
SELECT id from table TABLESAMPLE BERNOULLI(10) WHERE …