Input Sampling By Splits ------------------------ Key: HIVE-2121 URL: https://issues.apache.org/jira/browse/HIVE-2121 Project: Hive Issue Type: New Feature Reporter: Siying Dong Assignee: Siying Dong
We need a better input sampling to serve at least two purposes: 1. test their queries against a smaller data set 2. understand more about how the data look like without scanning the whole table. A simple function that gives a subset splits will help in those cases. It doesn't have to be strict sampling. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira