[ https://issues.apache.org/jira/browse/HIVE-24672?focusedWorklogId=539214&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-539214 ]
ASF GitHub Bot logged work on HIVE-24672: ----------------------------------------- Author: ASF GitHub Bot Created on: 21/Jan/21 17:29 Start Date: 21/Jan/21 17:29 Worklog Time Spent: 10m Work Description: abstractdog commented on a change in pull request #1896: URL: https://github.com/apache/hive/pull/1896#discussion_r562065992 ########## File path: ql/src/test/queries/clientnegative/compute_stats_long.q ########## @@ -3,5 +3,5 @@ create table tab_int(a int); -- insert some data LOAD DATA LOCAL INPATH "../../data/files/int.txt" INTO TABLE tab_int; --- compute stats should raise an error since the number of bit vectors > 1024 -select compute_stats(a, 10000) from tab_int; +-- this should raise an error since the number of bit vectors has a hard limit at 1024 +select compute_stats(a, 'fm', 10000) from tab_int; Review comment: thanks ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 539214) Time Spent: 40m (was: 0.5h) > compute_stats_long.q fails for wrong reasons > -------------------------------------------- > > Key: HIVE-24672 > URL: https://issues.apache.org/jira/browse/HIVE-24672 > Project: Hive > Issue Type: Bug > Reporter: Mustafa İman > Assignee: Mustafa İman > Priority: Major > Labels: pull-request-available > Time Spent: 40m > Remaining Estimate: 0h > > TestNegativeCliDriver[compute_stats_long] intends to test fmsketch has a hard > limit on number of bit vectors (1024). However, the test fails for the > following wrong reason. > {code:java} > Caused by: java.lang.RuntimeException: Can not recognize 10000Caused by: > java.lang.RuntimeException: Can not recognize 10000 at > org.apache.hadoop.hive.common.ndv.NumDistinctValueEstimatorFactory.getEmptyNumDistinctValueEstimator(NumDistinctValueEstimatorFactory.java:71) > {code} > Instead it should fail with > {code:java} > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: The maximum > allowed value for number of bit vectors is 1024, but was passed 10000 bit > vectorsCaused by: org.apache.hadoop.hive.ql.metadata.HiveException: The > maximum allowed value for number of bit vectors is 1024, but was passed > 10000 bit vectors at > org.apache.hadoop.hive.ql.udf.generic.GenericUDAFComputeBitVectorFMSketch$NumericStatsEvaluator.iterate(GenericUDAFComputeBitVectorFMSketch.java:125) > ~[hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT] > {code} > Since this function is superseeded by compute_bit_vector_fm, it is best if we > add the same test for compute_bit_vector_fm too. > > -- This message was sent by Atlassian Jira (v8.3.4#803005)