hive has a feature for data sampling where you actually don't read the
entire table but sample of the table.
I suppose these parameters belong to those queries.
more you can read at
https://cwiki.apache.org/Hive/languagemanual-sampling.html
On Fri, Jan 25, 2013 at 4:42 AM, Wu, James C. wrote:
Hi James.
Basically if we have a table called table A which is mapped to a directory in
hive /data/a . And n is the number of the files under /data/a with each row
size s.
hive -e "select * from a limit 10"
To show the result very fast
hive.limit.optimize.limit.file < n
in this case will
Hi,
Do anyone know the meaning of these hive settings? The description of them are
not clear to me. If someone can give me an example of how they shall be used,
it would be great!
hive.limit.row.max.size
10
When trying a smaller subset of data for simple LIMIT, how much
size we need