On Fri, Oct 21, 2011 at 9:22 AM, john smith <js1987.sm...@gmail.com> wrote:
> Hi list, > > I am also facing the same problem. My reducers hang at this position and it > takes hours to complete a single reduce task. Can any hive guru help us out > with this issue. > > Thanks, > jS > > 2011/10/21 bangbig <lizhongliangg...@163.com> > >> HI all, >> >> HIVE runs too slowly when it is doing such things(see the log below), what's >> the problem? because I'm joining two large table? >> >> it runs pretty fast at first. when the job finishes 95%, it begins to slow >> down. >> >> -------------------------------------------------- >> >> INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 forwarding 1044000000 >> rows >> 2011-10-21 16:55:57,427 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1045000000 rows >> 2011-10-21 16:55:57,545 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1046000000 rows >> 2011-10-21 16:55:57,686 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1047000000 rows >> 2011-10-21 16:55:57,806 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1048000000 rows >> 2011-10-21 16:55:57,926 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1049000000 rows >> 2011-10-21 16:55:58,045 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1050000000 rows >> 2011-10-21 16:55:58,164 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1051000000 rows >> 2011-10-21 16:55:58,284 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1052000000 rows >> 2011-10-21 16:55:58,405 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1053000000 rows >> 2011-10-21 16:55:58,525 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1054000000 rows >> 2011-10-21 16:55:58,644 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1055000000 rows >> 2011-10-21 16:55:58,764 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1056000000 rows >> 2011-10-21 16:55:58,883 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1057000000 rows >> 2011-10-21 16:55:59,003 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1058000000 rows >> 2011-10-21 16:55:59,122 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1059000000 rows >> 2011-10-21 16:55:59,242 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1060000000 rows >> 2011-10-21 16:55:59,361 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1061000000 rows >> 2011-10-21 16:55:59,482 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1062000000 rows >> 2011-10-21 16:55:59,601 INFO org.apache.hadoop.hive.ql.exec.JoinOperator: 4 >> forwarding 1063000000 rows >> >> >> >> > It is hard to say without seeing the query, the table definition, and the explain. Please send the query. Although I have a theory: This query is not good: select a,b from a,b where a.id=b.id It does a Cart join. This query is better. select a,b from a inner join b on (a.id=b.id) Consider setting in your hive-site.xml hive.mapred.mode=strict It can prevent you from running dangerous queries.