Re: where kylin global dict store?

2017-01-05 Thread Alberto Ramón
(I answer to User Kylin Maillist) Check this references, please: http://kylin.apache.org/blog/2016/08/01/count-distinct-in-kylin/ there is an exemple "builder": "org.apache.kylin.dict.GlobalDictionaryBuilder" and Kylin 1904 BR, Alb 2017-01-0

ORC slipt

2017-03-31 Thread Alberto Ramón
Some doubts about ORC: *1- hive.exec.orc.default.buffer.size* is used for read or write? *2- orc.stripe.size* is compressed or uncompresed? *3- orc.stripe.size* must be multiple of HDFS block size? 4- For read ORC file , the numbers of mappers depends onr HDFS blocks or Stripe number? *5- hiv

Partition Pruning using UDF

2018-05-09 Thread Alberto Ramón
Hello We have a UDP to select the correct partition to read 'FindPartition': Select * from TB where partitionCol =FindPartition(); How I can avoid a full scan of all partitions? (Set MyPartition=FindPartition(); // Is not valid in Hive)

Re: Partition Pruning using UDF

2018-05-15 Thread Alberto Ramón
spector in the unit method > so that Hive knows that it can perform partition pruning with it. > > On Wed, 9 May 2018, 19:23 Alberto Ramón, > wrote: > >> Hello >> >> We have a UDP to select the correct partition to read 'FindPartition': >> Select *

Re: Partition Pruning using UDF

2018-05-20 Thread Alberto Ramón
urn, the Hive optimiser will know > that it can safely perform partition pruning on it. > > On Tue, 15 May 2018, 23:32 Alberto Ramón, > wrote: > >> Yes, I checked, by default all UDF are deterministic (LINK >> <https://hive.apache.org/javadocs/r1.2.2/api/org/apache/hadoo