[ https://issues.apache.org/jira/browse/HIVE-7702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14105838#comment-14105838 ]
Brock Noland commented on HIVE-7702: ------------------------------------ Hi Chinna, Thank you! Using git and the following command I was able to compare the results against MR {noformat} git status | awk '/new file:/ {print $NF}' | xargs -I {} sh -c 'diff {} $(echo {} | perl -pe "s@/spark@@g")' {noformat} Do you know if the differences are due to sorting order or correctness? > Start running .q file tests on spark [Spark Branch] > --------------------------------------------------- > > Key: HIVE-7702 > URL: https://issues.apache.org/jira/browse/HIVE-7702 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Brock Noland > Assignee: Chinna Rao Lalam > Attachments: HIVE-7702-spark.patch, HIVE-7702.1-spark.patch > > > Spark can currently only support a few queries, however there are some .q > file tests which will pass today. The basic idea is that we should get some > number of these actually working (10-20) so we can actually start testing the > project. > A good starting point might be the udf*, varchar*, or alter* tests: > https://github.com/apache/hive/tree/spark/ql/src/test/queries/clientpositive > To generate the output file for test XXX.q, you'd do: > {noformat} > mvn clean install -DskipTests -Phadoop-2 > cd itests > mvn clean install -DskipTests -Phadoop-2 > cd qtest-spark > mvn test -Dtest=TestCliDriver -Dqfile=XXX.q -Dtest.output.overwrite=true > -Phadoop-2 > {noformat} > which would generate XXX.q.out which we can check-in to source control as a > "golden file". > Multiple tests can be run at a give time as so: > {noformat} > mvn test -Dtest=TestCliDriver -Dqfile=X1.q,X2.q -Dtest.output.overwrite=true > -Phadoop-2 > {noformat} -- This message was sent by Atlassian JIRA (v6.2#6252)