Hi, ALL My question is that spark and impala , which is more fitter for MPP . The motivition as below case: 1. three big table need make join operation; (about 100 field per table, more than 1TB per table) 2. beside above tables, it is very possible to they need make join operation with n other small or middle table 3. for all join operation , they will be random, meanwhile , it need a very quick response.
look forward your authoritative help ! Best Regards liuguodong