Hi,

We are exploring hive for a very large data warehouse (Up to 2 PB data size) 
and 
would like to get some information

1. What are your experiences on using hive for large data warehouses
2. What is biggest hive implementation that you have seen
3. How is the query performance with peta bytes of data
4. Details on configurations that you have used/seen (such as CPU numbers and 
capacity, Disk sizes, cost per node etc)

Any help on this will enable us to take better decision.

Thanks for your help,
Sheetal


      

Reply via email to