[jira] [Commented] (HIVE-18051) qfiles: dataset support

Zoltan Haindrich (JIRA) Wed, 17 Jan 2018 04:50:18 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-18051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16328703#comment-16328703
 ]


Zoltan Haindrich commented on HIVE-18051:
-----------------------------------------

as a matter of fact yes, just a bunch of drop statements :)
{code:bash}
cat data/scripts/*cleanup*sql
{code}

I think if we really need it later; it will be easy to add it back...but they 
are only used for table drops... there might be in the future other things 
which will need to be cleaned up - but I think in those case we should do the 
"general approach" - like what we have currently for functions/tables...

I have to also note that I personally "hate" the cleanup script; because its 
executed prior to the tests; and its very annoying to see my breakpoints 
trigger for those statements --  I've to always disable the relevant ones; 
enable a set of "workaround" breakpoints and change everything back when my 
real statement is compiling..

> qfiles: dataset support
> -----------------------
>
>                 Key: HIVE-18051
>                 URL: https://issues.apache.org/jira/browse/HIVE-18051
>             Project: Hive
>          Issue Type: Improvement
>          Components: Testing Infrastructure
>            Reporter: Zoltan Haindrich
>            Assignee: Laszlo Bodor
>            Priority: Major
>         Attachments: HIVE-18051.01.patch, HIVE-18051.02.patch, 
> HIVE-18051.03.patch, HIVE-18051.04.patch, HIVE-18051.05.patch
>
>
> it would be great to have some kind of test dataset support; currently there 
> is the {{q_test_init.sql}} which is quite large; and I'm often override it 
> with an invalid string; because I write independent qtests most of the time - 
> and the load of {{src}} and other tables are just a waste of time for me ; 
> not to mention that the loading of those tables may also trigger breakpoints 
> - which is a bit annoying.
> Most of the tests are "only" using the {{src}} table and possibly 2 others; 
> however the main init script contains a bunch of tables - meanwhile there are 
> quite few other tests which could possibly also benefit from a more general 
> feature; for example the creation of {{bucket_small}} is present in 20 q 
> files.
> the proposal would be to enable the qfiles to be annotated with metadata like 
> datasets:
> {code}
> --! qt:dataset:src,bucket_small
> {code}
> proposal for storing a dataset:
> * the loader script would be at: {{data/datasets/__NAME__/load.hive.sql}}
> * the table data could be stored under that location
> a draft about this; and other qfiles related ideas:
> https://docs.google.com/document/d/1KtcIx8ggL9LxDintFuJo8NQuvNWkmtvv_ekbWrTLNGc/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (HIVE-18051) qfiles: dataset support

Reply via email to