Re: about hive limit optimization settings

2013-01-24 Thread Nitin Pawar
hive has a feature for data sampling where you actually don't read the entire table but sample of the table. I suppose these parameters belong to those queries. more you can read at https://cwiki.apache.org/Hive/languagemanual-sampling.html On Fri, Jan 25, 2013 at 4:42 AM, Wu, James C. wrote:

Re: about hive limit optimization settings

2013-01-24 Thread Abdelrhman Shettia
Hi James. Basically if we have a table called table A which is mapped to a directory in hive /data/a . And n is the number of the files under /data/a with each row size s. hive -e "select * from a limit 10" To show the result very fast hive.limit.optimize.limit.file < n in this case will

about hive limit optimization settings

2013-01-24 Thread Wu, James C.
Hi, Do anyone know the meaning of these hive settings? The description of them are not clear to me. If someone can give me an example of how they shall be used, it would be great! hive.limit.row.max.size 10 When trying a smaller subset of data for simple LIMIT, how much size we need