I saw bunch of failures related to disk space issue

https://issues.apache.org/jira/browse/HIVE-11416?focusedCommentId=14650616&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14650616
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/4792/console

Some of then throws IOException failing to create directories.
https://issues.apache.org/jira/browse/HIVE-11304?focusedCommentId=14651250&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14651250
https://issues.apache.org/jira/browse/HIVE-11405?focusedCommentId=14654006&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14654006

Some are 404 not found issue
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/4793/console

The patches that I submitted recently had multiple runs and many attempts 
failed with one of the above reasons. I initially thought HIVE-11304 patches 
are generating huge log files because of log4j2 migration but then later found 
out that even trunk is generating huge log files (Sergio also found the same). 
Commits before HIVE-10166 seems to be generating smaller files in my test runs 
as well but I don’t have the exact number. I didn’t spend much time on 
debugging the issue but running TestJDBCWithMiniHS2 in trunk seems to emitting 
way too many thrift warnings than before.

> On Aug 5, 2015, at 11:09 AM, Xuefu Zhang <xzh...@cloudera.com> wrote:
> 
> Prasanth,
> 
> Could you send me a URL that shows the warning msg that you were seeing?
> I'm not sure if it's related to the thrift version that were used to
> regenerate the code.
> 
> Thanks,
> Xuefu
> 
> On Wed, Aug 5, 2015 at 10:54 AM, Prasanth Jayachandran <
> pjayachand...@hortonworks.com> wrote:
> 
>> Thanks Sergio for creating a follow up JIRA!
>> 
>>> On Aug 5, 2015, at 10:49 AM, Sergio Pena <sergio.p...@cloudera.com>
>> wrote:
>>> 
>>> Thanks Prasanth.
>>> 
>>> I run the same test you did, and I found the following sizes:
>>> 
>>> BEFORE HIVE-10166
>>> *13M* Aug  5 11:57 ./hive-unit/target/tmp/log/hive.log
>>> 
>>> WITH HIVE-10166
>>> *2.4G* Aug  5 12:07 ./hive-unit/target/tmp/log/hive.log
>>> 
>>> CURRENT HEAD
>>> *3.2G* Aug  5 12:36 ./hive-unit/target/tmp/log/hive.log
>>> 
>>> HIVE-10166 is adding more size to the file. But, there are other commits
>>> that are adding more values to it that we should investigate.
>>> 
>>> I created a JIRA to track this issue.
>>> https://issues.apache.org/jira/browse/HIVE-11466
>>> 
>>> 
>>> On Mon, Aug 3, 2015 at 3:29 PM, Prasanth Jayachandran <
>>> pjayachand...@hortonworks.com> wrote:
>>> 
>>>> Hi Sergio
>>>> 
>>>> This seems to be related to recent merge in HIVE-10166. I checked out a
>>>> commit prior to HIVE-10166 and ran TestJdbcWithMiniHS2 and the log seems
>>>> reasonable.
>>>> After HIVE-10166, the log file is getting too many WARN msgs (this log
>> msg
>>>> repeats and fills up the disk)
>>>> 
>>>> org.apache.thrift.transport.TTransportException: No underlying server
>>>> socket.
>>>>  at
>>>> 
>> org.apache.thrift.transport.TServerSocket.acceptImpl(TServerSocket.java:126)
>>>>  at
>>>> 
>> org.apache.thrift.transport.TServerSocket.acceptImpl(TServerSocket.java:35)
>>>>  at
>>>> 
>> org.apache.thrift.transport.TServerTransport.accept(TServerTransport.java:60)
>>>>  at
>>>> 
>> org.apache.thrift.server.TThreadPoolServer.serve(TThreadPoolServer.java:161)
>>>>  at
>>>> 
>> org.apache.hive.service.cli.thrift.ThriftBinaryCLIService.run(ThriftBinaryCLIService.java:100)
>>>> 
>>>> Can someone take a look?
>>>> 
>>>> Thanks
>>>> Prasanth
>>>> 
>>>>> On Aug 2, 2015, at 9:29 PM, Prasanth Jayachandran <
>>>> pjayachand...@hortonworks.com> wrote:
>>>>> 
>>>>> Hi Sergio
>>>>> 
>>>>> Thanks for looking into this. It could be related to my patch
>> HIVE-11304
>>>> (Log4j2 migration). I might have mistakenly specified log4j2 threshold
>> to
>>>> ALL level somewhere resulting in DEBUG level logging. I will look into
>> it.
>>>>> 
>>>>> Thanks
>>>>> Prasanth
>>>>> 
>>>>>> On Aug 2, 2015, at 9:21 PM, Sergio Pena <sergio.p...@cloudera.com>
>>>> wrote:
>>>>>> 
>>>>>> Hi Prasanth,
>>>>>> 
>>>>>> I see there are some logs in the system that are too big, and using
>> many
>>>>>> space. Jenkins will delete those logs eventually.
>>>>>> These are some of the logs bigger than 1G that I found:
>>>>>> 
>>>>>> *13G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4789/succeeded/TestJdbcWithMiniHS2/hive.log*
>>>>>> *9.9G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4790/succeeded/TestJdbcWithMiniHS2/hive.log
>>>>>> <<<  HIVE-11416*
>>>>>> *5.5G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4790/succeeded/TestSchedulerQueue/hive.log*
>>>>>> *4.9G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4789/succeeded/TestSchedulerQueue/hive.log*
>>>>>> *4.6G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4792/succeeded/TestSchedulerQueue/hive.log*
>>>>>> *4.1G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-Upload-10/succeeded/TestSchedulerQueue/hive.log*
>>>>>> 2.0G ./logs/PreCommit-HIVE-TRUNK-Build-4792/succeeded/TestSSL/hive.log
>>>>>> 1.9G ./logs/PreCommit-HIVE-TRUNK-Build-4790/failed/TestSSL/hive.log
>>>>>> 1.8G ./logs/PreCommit-HIVE-TRUNK-Build-4789/succeeded/TestSSL/hive.log
>>>>>> 1.8G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-Upload-10/succeeded/TestJdbcWithMiniHS2/hive.log
>>>>>> 1.7G
>>>>>> 
>>>> 
>> ./logs/HIVE-TRUNK-HADOOP-2-1/succeeded/TestSparkCliDriver-date_udf.q-join23.q-auto_join4.q-and-12-more/spark.log
>>>>>> 1.7G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4789/succeeded/TestSparkCliDriver-timestamp_lazy.q-bucketsortoptimize_insert_4.q-date_udf.q-and-12-more/spark.log
>>>>>> 1.7G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4790/succeeded/TestSparkCliDriver-timestamp_lazy.q-bucketsortoptimize_insert_4.q-date_udf.q-and-12-more/spark.log
>>>>>> 1.7G
>>>>>> 
>>>> 
>> ./logs/PreCommit-HIVE-TRUNK-Build-4792/succeeded/TestSparkCliDriver-timestamp_lazy.q-bucketsortoptimize_insert_4.q-date_udf.q-and-12-more/spark.log
>>>>>> 
>>>>>> *TestJdbcWithMiniHS2* is one causing this issue. Is debug enabled on
>>>> this
>>>>>> log?
>>>>>> 
>>>>>> - Sergio
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> On Sun, Aug 2, 2015 at 7:01 PM, Prasanth Jayachandran <
>>>>>> pjayachand...@hortonworks.com> wrote:
>>>>>> 
>>>>>>> Looks like there is something wrong with the precommit tests.
>>>>>>> The tests runs through but throws IOException or runs out of disk.
>>>>>>> https://issues.apache.org/jira/browse/HIVE-11416
>>>>>>> https://issues.apache.org/jira/browse/HIVE-11304
>>>>>>> 
>>>>>>> Can someone take a look whats going on?
>>>>>>> 
>>>>>>> Thanks
>>>>>>> Prasanth
>>>>>>> 
>>>>> 
>>>>> 
>>>> 
>>>> 
>> 
>> 

Reply via email to