Cloudera Administration
Duration: 5 days
1. Getting Started with Apache Hadoop
History of Apache Hadoop and its trends
Components of Apache Hadoop
Understanding the Apache Hadoop daemons
Introducing Cloudera
Introducing CDH
Responsibilities of a Hadoop administrator
2. HDFS and MapReduce
Essentials of HDFS
The read/write operational flow in HDFS
Understanding the namenode UI
Understanding the secondary namenode UI
Exploring HDFS commands
Getting acquainted with MapReduce
3. Cloudera's Distribution Including Apache Hadoop
Getting started with CDH
Understanding the CDH components
Installing CDH
Installing the CDH components
4. Exploring HDFS Federation and Its High Availability
Implementing HDFS Federation
Implementing HDFS High Availability
Jobtracker high availability
5. Using Cloudera Manager
Introducing Cloudera Manager
Understanding the Cloudera Manager architecture
Installing Cloudera Manager
Navigating the Cloudera Manager Web console
Configuring High Availability using Cloudera Manager
6. Implementing Security Using Kerberos
Understanding authentication and authorization
Introducing Kerberos
Understanding the Kerberos Architecture
Installing Kerberos
Configuring Kerberos for Apache Hadoop
Authorization in Apache Hadoop
7. Managing an Apache Hadoop Cluster
Configuring Hadoop services using Cloudera Manager
Role management in Cloudera Manager
Managing hosts using Cloudera Manager
Managing multiple clusters with Cloudera Manager
Rebalancing a Hadoop cluster from Cloudera Manager
8. Cluster Monitoring Using Events and Alerts
Monitoring Hadoop services from Cloudera Manager
Understanding events and alerts
9. Configuring Backups
Understanding backups
Understanding HDFS backups
Using the distributed copy (DistCp)
Configuring backups using Cloudera Manager