+1 (binding)

Very exciting to see this effort come to fruition!

Arun

On Feb 26, 2013, at 2:55 PM, Suresh Srinivas wrote:

> I had posted heads up about merging branch-trunk-win to trunk on Feb 8th. I
> am happy to announce that we are ready for the merge.
> 
> Here is a brief recap on the highlights of the work done:
> - Command-line scripts for the Hadoop surface area
> - Mapping the HDFS permissions model to Windows
> - Abstracted and reconciled mismatches around differences in Path semantics
> in Java and Windows
> - Native Task Controller for Windows
> - Implementation of a Block Placement Policy to support cloud environments,
> more specifically Azure.
> - Implementation of Hadoop native libraries for Windows (compression
> codecs, native I/O)
> - Several reliability issues, including race-conditions, intermittent test
> failures, resource leaks.
> - Several new unit test cases written for the above changes
> 
> Please find the details of the work in CHANGES.branch-trunk-win.txt -
> Common changes<http://bit.ly/Xe7Ynv>, HDFS changes<http://bit.ly/13QOSo9>,
> and YARN and MapReduce changes <http://bit.ly/128zzMt>. This is the work
> ported from branch-1-win to a branch based on trunk.
> 
> For details of the testing done, please see the thread -
> http://bit.ly/WpavJ4. Merge patch for this is available on HADOOP-8562<
> https://issues.apache.org/jira/browse/HADOOP-8562>.
> 
> This was a large undertaking that involved developing code, testing the
> entire Hadoop stack, including scale tests. This is made possible only with
> the contribution from many many folks in the community. Following people
> contributed to this work: Ivan Mitic, Chuan Liu, Ramya Sunil, Bikas Saha,
> Kanna Karanam, John Gordon, Brandon Li, Chris Nauroth, David Lao, Sumadhur
> Reddy Bolli, Arpit Agarwal, Ahmed El Baz, Mike Liddell, Jing Zhao, Thejas
> Nair, Steve Maine, Ganeshan Iyer, Raja Aluri, Giridharan Kesavan, Ramya
> Bharathi Nimmagadda, Daryn Sharp, Arun Murthy, Tsz-Wo Nicholas Sze, Suresh
> Srinivas and Sanjay Radia. There are many others who contributed as well
> providing feedback and comments on numerous jiras.
> 
> The vote will run for seven days and will end on March 5, 6:00PM PST.
> 
> Regards,
> Suresh
> 
> 
> 
> 
> On Thu, Feb 7, 2013 at 6:41 PM, Mahadevan Venkatraman
> <mah...@microsoft.com>wrote:
> 
>> It is super exciting to look at the prospect of these changes being merged
>> to trunk. Having Windows as one of the supported Hadoop platforms is a
>> fantastic opportunity both for the Hadoop project and Microsoft customers.
>> 
>> This work began around a year back when a few of us started with a basic
>> port of Hadoop on Windows. Ever since, the Hadoop team in Microsoft have
>> made significant progress in the following areas:
>> (PS: Some of these items are already included in Suresh's email, but
>> including again for completeness)
>> 
>> - Command-line scripts for the Hadoop surface area
>> - Mapping the HDFS permissions model to Windows
>> - Abstracted and reconciled mismatches around differences in Path
>> semantics in Java and Windows
>> - Native Task Controller for Windows
>> - Implementation of a Block Placement Policy to support cloud
>> environments, more specifically Azure.
>> - Implementation of Hadoop native libraries for Windows (compression
>> codecs, native I/O) - Several reliability issues, including
>> race-conditions, intermittent test failures, resource leaks.
>> - Several new unit test cases written for the above changes
>> 
>> In the process, we have closely engaged with the Apache open source
>> community and have got great support and assistance from the community in
>> terms of contributing fixes, code review comments and commits.
>> 
>> In addition, the Hadoop team at Microsoft has also made good progress in
>> other projects including Hive, Pig, Sqoop, Oozie, HCat and HBase. Many of
>> these changes have already been committed to the respective trunks with
>> help from various committers and contributors. It is great to see the
>> commitment of the community to support multiple platforms, and we look
>> forward to the day when a developer/customer is able to successfully deploy
>> a complete solution stack based on Apache Hadoop releases.
>> 
>> Next Steps:
>> 
>> All of the above changes are part of the Windows Azure HDInsight and
>> HDInsight Server products from Microsoft. We have successfully on-boarded
>> several internal customers and have been running production workloads on
>> Windows Azure HDInsight. Our vision is to create a big data platform based
>> on Hadoop, and we are committed to helping make Hadoop a world-class
>> solution that anyone can use to solve their biggest data challenges.
>> 
>> As an immediate next step, we would like to have a discussion around how
>> we can ensure that the quality of the mainline Hadoop branches on Windows
>> is maintained. To this end, we would like to get to the state where we have
>> pre-checkin validation gates and nightly test runs enabled on Windows. If
>> you have any suggestions around this, please do send an email.  We are
>> committed to helping sustain the long-term quality of Hadoop on both Linux
>> and Windows.
>> 
>> We sincerely thank the community for their contribution and support so
>> far. And hope to continue having a close engagement in the future.
>> 
>> -Microsoft HDInsight Team
>> 
>> 
>> -----Original Message-----
>> From: Suresh Srinivas [mailto:sur...@hortonworks.com]
>> Sent: Thursday, February 7, 2013 5:42 PM
>> To: common-...@hadoop.apache.org; yarn-...@hadoop.apache.org;
>> hdfs-dev@hadoop.apache.org; mapreduce-...@hadoop.apache.org
>> Subject: Heads up - merge branch-trunk-win to trunk
>> 
>> The support for Hadoop on Windows was proposed in HADOOP-8079<
>> https://issues.apache.org/jira/browse/HADOOP-8079> almost a year ago. The
>> goal was to make Hadoop natively integrated, full-featured, and performance
>> and scalability tuned on Windows Server or Windows Azure.
>> We are happy to announce that a lot of progress has been made in this
>> regard.
>> 
>> Initial work started in a feature branch, branch-1-win, based on branch-1.
>> The details related to the work done in the branch can be seen in
>> CHANGES.txt<
>> http://svn.apache.org/viewvc/hadoop/common/branches/branch-1-win/CHANGES.branch-1-win.txt?view=markup
>>> .
>> This work has been ported to a branch, branch-trunk-win, based on trunk.
>> Merge patch for this is available on
>> HADOOP-8562<https://issues.apache.org/jira/browse/HADOOP-8562>
>> .
>> 
>> Highlights of the work done so far:
>> 1. Necessary changes in Hadoop to run natively on Windows. These changes
>> handle differences in platforms related to path names, process/task
>> management etc.
>> 2. Addition of winutils tools for managing file permissions and ownership,
>> user group mapping, hardlinks, symbolic links, chmod, disk utilization, and
>> process/task management.
>> 3. Added cmd scripts equivalent to existing shell scripts
>> hadoop-daemon.sh, start and stop scripts.
>> 4. Addition of block placement policy implemnation to support cloud
>> enviroment, more specifically Azure.
>> 
>> We are very close to wrapping up the work in branch-trunk-win and getting
>> ready for a merge. Currently the merge patch is passing close to 100% of
>> unit tests on Linux. Soon I will call for a vote to merge this branch into
>> trunk.
>> 
>> Next steps:
>> 1. Call for vote to merge branch-trunk-win to trunk, when the work
>> completes and precommit build is clean.
>> 2. Start a discussion on adding Jenkins precommit builds on windows and
>> how to integrate that with the existing commit process.
>> 
>> Let me know if you have any questions.
>> 
>> Regards,
>> Suresh
>> 
>> 
> 
> 
> -- 
> http://hortonworks.com/download/

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/


Reply via email to