Could not Name UDF using Reserved Word
--------------------------------------

                 Key: HIVE-2717
                 URL: https://issues.apache.org/jira/browse/HIVE-2717
             Project: Hive
          Issue Type: Bug
          Components: Query Processor
            Reporter: Zhenxiao Luo
            Assignee: Zhenxiao Luo


Parser/SemanticAnalyzer prevent Naming UDF using Reserved Word(eg. sort, format)

Even with the following changes in Hive Grammer:

[~/Code/hive]git diff ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
diff --git a/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 
b/ql/src/java/org/apache/hadoop/h
index 888bf47..ec256de 100644
--- a/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
+++ b/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
@@ -1816,7 +1816,7 @@ functionName
 @init { msgs.push("function name"); }
 @after { msgs.pop(); }
     : // Keyword IF is also a function name
-    Identifier | KW_IF | KW_ARRAY | KW_MAP | KW_STRUCT | KW_UNIONTYPE
+    Identifier | KW_IF | KW_ARRAY | KW_MAP | KW_STRUCT | KW_UNIONTYPE | KW_SORT
     ;
 
 castExpression
@@ -2091,6 +2091,7 @@ sysFuncNames
     | KW_MAP
     | KW_STRUCT
     | KW_UNIONTYPE
+    | KW_SORT
     | EQUAL
     | NOTEQUAL
     | LESSTHANOREQUALTO


Semantic analysis always reports error:

-- Evaluate function against STRING valued keys
EXPLAIN
SELECT sort(array("b", "d", "c", "a")) FROM src LIMIT 1
2012-01-09 11:31:55,134 INFO  parse.ParseDriver (ParseDriver.java:parse(426)) - 
Parsing command:

-- Evaluate function against STRING valued keys
EXPLAIN
SELECT sort(array("b", "d", "c", "a")) FROM src LIMIT 1
2012-01-09 11:31:55,146 INFO  parse.ParseDriver (ParseDriver.java:parse(443)) - 
Parse Completed
2012-01-09 11:31:55,147 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:analyzeInternal(7445)) - Starting Semantic Analysis
2012-01-09 11:31:55,148 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:analyzeInternal(7475)) - Completed phase 1 of Semantic 
Analysis
2012-01-09 11:31:55,148 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:getMetaData(942)) - Get metadata for source tables
2012-01-09 11:31:55,149 INFO  metastore.HiveMetaStore 
(HiveMetaStore.java:logInfo(528)) - 0: get_table : db=default tbl=src
2012-01-09 11:31:55,200 INFO  hive.log 
(MetaStoreUtils.java:getDDLFromFieldSchema(457)) - DDL: struct src { string 
key, string value}
2012-01-09 11:31:55,200 DEBUG lazy.LazySimpleSerDe 
(LazySimpleSerDe.java:initialize(195)) - 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe initialized with: 
columnNames=[key, value] columnTypes=[string, string] separator=[[B@3bb20e65] 
nullstring=\N lastColumnTakesRest=false
2012-01-09 11:31:55,200 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:getMetaData(1021)) - Get metadata for subqueries
2012-01-09 11:31:55,201 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:getMetaData(1035)) - Get metadata for destination tables
2012-01-09 11:31:55,201 INFO  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:analyzeInternal(7478)) - Completed getting MetaData in 
Semantic Analysis
2012-01-09 11:31:55,203 INFO  hive.log 
(MetaStoreUtils.java:getDDLFromFieldSchema(457)) - DDL: struct src { string 
key, string value}
2012-01-09 11:31:55,203 DEBUG lazy.LazySimpleSerDe 
(LazySimpleSerDe.java:initialize(195)) - 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe initialized with: 
columnNames=[key, value] columnTypes=[string, string] separator=[[B@12e84396] 
nullstring=\N lastColumnTakesRest=false
2012-01-09 11:31:55,222 DEBUG parse.SemanticAnalyzer 
(SemanticAnalyzer.java:genTablePlan(6598)) - Created Table Plan for src 
org.apache.hadoop.hive.ql.exec.TableScanOperator@5e9ea579
2012-01-09 11:31:55,223 DEBUG parse.SemanticAnalyzer 
(SemanticAnalyzer.java:genSelectPlan(2117)) - tree: (TOK_SELECT (TOK_SELEXPR 
(TOK_FUNCTION sort (TOK_FUNCTION array "b" "d" "c" "a"))))
2012-01-09 11:31:55,225 DEBUG parse.SemanticAnalyzer 
(SemanticAnalyzer.java:genSelectPlan(2222)) - genSelectPlan: input = 
src{(key,key: string)(value,value: 
string)(block__offset__inside__file,BLOCK__OFFSET__INSIDE__FILE: 
bigint)(input__file__name,INPUT__FILE__NAME: string)}
2012-01-09 11:31:55,234 ERROR ql.Driver (SessionState.java:printError(380)) - 
FAILED: Error in semantic analysis: Line 5:7 Arguments length mismatch 'sort': 
The function SORT(array(obj1, obj2,...)) needs one argument.
org.apache.hadoop.hive.ql.parse.SemanticException: Line 5:7 Arguments length 
mismatch 'sort': The function SORT(array(obj1, obj2,...)) needs one argument.
    at 
org.apache.hadoop.hive.ql.parse.TypeCheckProcFactory$DefaultExprProcessor.process(TypeCheckProcFactory.java:810)
    at 
org.apache.hadoop.hive.ql.lib.DefaultRuleDispatcher.dispatch(DefaultRuleDispatcher.java:89)
    at 
org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.dispatch(DefaultGraphWalker.java:88)
    at 
org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.walk(DefaultGraphWalker.java:125)
    at 
org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.startWalking(DefaultGraphWalker.java:102)
    at 
org.apache.hadoop.hive.ql.parse.TypeCheckProcFactory.genExprNode(TypeCheckProcFactory.java:161)
    at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genExprNodeDesc(SemanticAnalyzer.java:7708)
    at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genSelectPlan(SemanticAnalyzer.java:2301


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to