Hi,
> I wanted to understand why hive has a performance issue with using _
> character in queries.
This is somewhat of a missed optimization issue - the "%" impl uses a fast
BoyerMoore algorithm and avoids converting from utf-8 bytes -> String.
http://mail-archives.apache.org/mod_mbox/hive-use
Hi,
I wanted to understand why hive has a performance issue with using _
character in queries.
For eg
- select object from table where object like '*%SR_311%*' and
date='2017-11-23' t*akes hours*
Same query (modified) completes in < 5 mins
- select object from table where object like