[ https://issues.apache.org/jira/browse/HIVE-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718433#comment-16718433 ]
Johannes Alberti commented on HIVE-10144: ----------------------------------------- the same issue still exists, the big file (itests/thirdparty/spark-1.2.0-bin-hadoop2-without-hive.tgz) is still around in the history of the current github.com/apache/hive repo, the file was introduced with commit https://github.com/apache/hive/commit/5feb58db2c99627cb41a747a097a0ec4b019d60c and is still reachable in the history via branch 'spark' and tag 'master_2015_11_30', a mirror to any github enterprise with binary file limit of <=100GB fails due to these references {noformat} $ git log --all --full-history -- *spark-1.2.0-bin-hadoop2-without-hive* [...] $ git branch --all --contains 5feb58db2c99627cb41a747a097a0ec4b019d60c remotes/origin/spark $ git tag --contains 5feb58db2c99627cb41a747a097a0ec4b019d60c master_2015_11_30 {noformat} I was able to create a partial mirror by excluding the spark branch and the old tag {noformat} $ git clone https://github.com/apache/hive.git $ cd hive $ git remote set-url --push origin https://github.foo.bar/foobar/hive $ git branch -r -d origin/spark $ git tag master_2015_11_30 -d $ git reflog expire --expire=now --all && git gc --prune=now --aggressive $ git push --prune origin +refs/remotes/origin/*:refs/heads/* +refs/tags/*:refs/tags/* {noformat} > [LLAP] merge brought in file blocking github sync > ------------------------------------------------- > > Key: HIVE-10144 > URL: https://issues.apache.org/jira/browse/HIVE-10144 > Project: Hive > Issue Type: Bug > Components: Build Infrastructure > Reporter: Szehon Ho > Assignee: Gunther Hagleitner > Priority: Major > > r1669718 brought in a file that is not in source control on llap branch: > [http://svn.apache.org/repos/asf/hive/branches/llap/itests/thirdparty/|http://svn.apache.org/repos/asf/hive/branches/llap/itests/thirdparty/] > It is a file downloaded during test build and should not be in source > control. It is actually blocking the github sync as its too large. See > INFRA-9360 -- This message was sent by Atlassian JIRA (v7.6.3#76005)