I have opened https://issues.apache.org/jira/browse/HIVE-20287 and would
love to get feedback

On Wed, Aug 1, 2018 at 11:14 AM, Lars Francke <lars.fran...@gmail.com>
wrote:

> Thanks Lefty! I haven't created a JIRA but it's on my list.
>
> Still interested in any input anyone might have?
>
> Some of you guys _must_ have some knowledge on this stuff.
>
> You're correct that the DROP stuff is documented. What I meant is that I
> couldn't find the relevant parts in the code.
>
> On Sat, Jul 21, 2018 at 6:15 AM, Lefty Leverenz <leftylever...@gmail.com>
> wrote:
>
>> Agreed, the Hive wiki is woefully incomplete.  But it does mention DROP
>> for
>> external tables in two sections of the DDL doc:
>>
>>    - External Tables
>>    <https://cwiki.apache.org/confluence/display/Hive/LanguageM
>> anual+DDL#LanguageManualDDL-ExternalTables>:
>>     "When dropping an EXTERNAL table, data in the table is NOT deleted
>> from
>>    the file system."
>>    - Drop Table
>>    <https://cwiki.apache.org/confluence/display/Hive/LanguageM
>> anual+DDL#LanguageManualDDL-DropTable>:
>>     "When dropping an EXTERNAL table, data in the table will NOT be
>> deleted
>>    from the file system."
>>
>> A wiki page comparing managed and external tables would be very helpful,
>> and the list is a good start.  I suggest you open a JIRA issue.
>>
>> Who can help Lars with this task?
>>
>> -- Lefty
>>
>>
>> On Thu, Jul 19, 2018 at 3:04 PM Lars Francke <lars.fran...@gmail.com>
>> wrote:
>>
>> > Hi,
>> >
>> > I've been wondering if anyone can tell me what the differences and
>> > limitations of managed vs. external tables are these days.
>> >
>> > I've seen the docs from Hortonworks[1] that list these features as not
>> > supported by External tables:
>> > * Query cache
>> > * Materialized views, except in a limited way
>> > * Default statistics gathering
>> > * Compute queries using statistics
>> > * Automatic runtime filtering
>> > * File merging after insert
>> >
>> > But there are no details. For some of them I'm not even sure what they
>> are
>> > supposed to mean because I couldn't find any documentation in our Wiki
>> :(
>> >
>> > So I'd love to create a Wiki page that describes the differences between
>> > managed & external tables and list relevant JIRAs but I need your help
>> for
>> > that.
>> >
>> > I scanned the code for all references to TableType and this is the list
>> I
>> > found but I'm almost certainly missing something:
>> >
>> > * ARCHIVE/UNARCHIVE - DDLTask - Only works for managed tables
>> > * TRUNCATE - DDLSemanticAnalyzer - Only works for managed tables
>> > * MERGE/CONCATENATE - HiveRelOpMaterializationValidator - Only works
>> for
>> > managed tables
>> > * Constraints - DDLSemanticAnalyzer -  (NOT NULL, DEFAULT, CHECK, only
>> RELY
>> > ist allowed)
>> > * IMPORT - ImportSemanticAnalyzer - This has some wild restrictions I
>> > didn't follow for external tables
>> > * Query Results Caching - https://issues.apache.org/jira
>> /browse/HIVE-18513
>> > SemanticAnalyzer - Documentation missing for Results Cache
>> >
>> > So there's a bunch of those from the Hortonworks list missing. If anyone
>> > could point me to documentation or code for those that'd be great.
>> >
>> > I also didn't find the most obvious/oldest one: Data is not deleted for
>> > EXTERNAL tables on DROP, I'm sure I just missed it.
>> >
>> > And last but not least: I have not looked at the grammar to see what's
>> > already forbidden at that level.
>> >
>> > As I said: Any hints would be greatly appreciated.
>> >
>> > Thank you!
>> >
>> > Lars
>> >
>> > A side note: There are so many fantastic features in Hive but because
>> lots
>> > of them are not or under-documented most people I know just stick to the
>> > basics. That's a pity :(
>> >
>> > [1] <
>> >
>> > https://docs.hortonworks.com/HDPDocuments/HDP3/HDP-3.0.0/usi
>> ng-hiveql/content/hive_hive_3_tables.html
>> > >
>> >
>>
>
>

Reply via email to