Prasanth Jayachandran created HIVE-17417:
--------------------------------------------
Summary: Lazy Timestamp and Date serialization is very expensive
Key: HIVE-17417
URL: https://issues.apache.org/jira/browse/HIVE-17417
Project: Hive
Issue Type: Bug
Components: Serializers/Deserializers
Affects Versions: 3.0.0, 2.4.0
Reporter: Prasanth Jayachandran
Assignee: Prasanth Jayachandran
Priority: Critical
In a specific case where a schema contains array<struct> with timestamp and
date fields (array size >10000). Any access to this column very very expensive
in terms of CPU as most of the time is serialization of timestamp and date.
Refer attached profiles. >70% time spent in serialization + tostring
conversions.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)