© Hortonworks Inc. 2013 - Confidential
Apache Hadoop & YARN
• Apache Hadoop
– De facto Big Data open source platform
– Running for about 5 years in production at hundreds of companies
like Yahoo, Ebay and Facebook
• Hadoop 2
– Significant improvements in HDFS distributed storage layer. High
Availability, NFS, Snapshots
– YARN – next generation compute framework for Hadoop designed
from the ground up based on experience gained from Hadoop 1
– YARN running in production at Yahoo for about a year
– YARN awarded Best Paper at SOCC 2013
Page 2