Before You Install CDH 5 on a Cluster
Note: This page contains references to CDH 5 components
or features that have been removed from CDH 6. These references are only applicable if you are managing a CDH 5 cluster with Cloudera Manager 6. For more information, see the Deprecated Items page in the Cloudera Enterprise 6 Release Notes.
Important:
-
Use the service command to start, stop, and restart CDH components, instead of running scripts in /etc/init.d directly. The service command creates a predictable environment by setting the current working directory to / and removing most environment variables (passing only LANG and TERM). With /etc/init.d, existing environment variables remain in force and can produce unpredictable results. When you install CDH from packages, service is installed as part of the Linux Standard Base (LSB).
-
On SLES 11 platforms, do not install or try to use the IBM Java version bundled with the SLES distribution; Hadoop will not run correctly with that version. Install the Oracle JDK following directions under Java Development Kit Installation.
-
If you are migrating from MapReduce v1 (MRv1) to MapReduce v2 (MRv2, YARN), see Migrating from MapReduce 1 (MRv1) to MapReduce 2 (MRv2) for important information and instructions.
Before you install CDH 5 on a cluster, there are some important steps you need to do to prepare your system:
- Verify you are using a supported operating system for CDH 5. See CDH and Cloudera Manager Supported Operating Systems.
- If you haven't already done so, install the Oracle Java Development Kit. For instructions and recommendations, see Java Development Kit Installation.
Scheduler Defaults
Note: This page contains references to CDH 5
components or features that have been removed from CDH 6. These references are only applicable if you are managing a CDH 5 cluster with Cloudera Manager 6. For more information, see the Deprecated Items page in the Cloudera Enterprise 6 Release Notes.Note the following differences between MRv1 (MapReduce) and MRv2 (YARN).
- MRv1 (MapReduce v1):
- Cloudera Manager and CDH 5 set the default to FIFO.
- MRv2 (YARN):
- Cloudera Manager and CDH 5 set the default to Fair Scheduler.
High Availability
- For more information and instructions on setting up a new HA configuration, see High Availability.
Important:
If you configure HA for the NameNode, do not install hadoop-hdfs-secondarynamenode. After completing the HDFS HA software configuration, follow the installation instructions in Deploying HDFS High Availability.
| << Installing and Deploying CDH Using the Command Line | ©2016 Cloudera, Inc. All rights reserved | Creating a Local Yum Repository >> |
| Terms and Conditions Privacy Policy |