Table of Contents
About this Guide ........................................................................................................5
Configuring HDFS High Availability..........................................................................7
Introduction to HDFS High Availability......................................................................................................7
Overview....................................................................................................................................................................7
Background................................................................................................................................................................7
Architecture...............................................................................................................................................................7
Hardware Configuration for HDFS HA.......................................................................................................9
Hardware Configuration for Quorum-based Storage...........................................................................................9
Hardware Configuration for Shared Storage Using NFS......................................................................................9
Software Configuration for HDFS HA......................................................................................................10
Software Configuration for Quorum-based Storage..........................................................................................10
Software Configuration for Shared Storage Using NFS......................................................................................18
HDFS High Availability Initial Deployment..............................................................................................25
Install and Start the JournalNodes (Quorum-based Storage only)...................................................................26
Initialize the Shared Edits directory......................................................................................................................26
Start the NameNodes............................................................................................................................................26
Restart Services......................................................................................................................................................27
Deploy Automatic Failover.....................................................................................................................................27
Verifying Automatic Failover.................................................................................................................................28
Upgrading an HDFS HA Configuration to the Latest Release...............................................................28
Upgrading from CDH4 Beta 2 or Later..................................................................................................................28
Upgrading from CDH4 Beta 1................................................................................................................................29
Configuring Other CDH Components to Use HDFS HA..........................................................................29
Configuring HBase to Use HDFS HA.....................................................................................................................29
Configuring Oozie to Use HDFS HA.......................................................................................................................31
Upgrading the Hive Metastore to Use HDFS HA.................................................................................................31
HDFS High Availability Administration...................................................................................................32
HA Administration using the haadmin command..............................................................................................32
Using the dfsadmin command when HA is enabled..........................................................................................32
Switching from Shared Storage using NFS to Quorum-based Storage...........................................................32
Disabling HDFS High Availability..........................................................................................................................33
Redeploying HDFS High Availability with Quorum-based Storage...................................................................33
Redeploying HDFS High Availability with Shared Storage using NFS..............................................................33
Configuring High Availability for the JobTracker (MRv1).....................................35