0% found this document useful (0 votes)
12 views5 pages

Big Data Training For ACI

Uploaded by

alexmylife
This document outlines training courses that introduce big data concepts, Apache Hadoop, and the MapR Converged Data Platform. The introduction to big data lessons define key terms, summarize big data history, and explain the big data pipeline. The Apache Hadoop essentials lessons cover Hadoop file systems, MapReduce, and the Hadoop ecosystem. The MapR lessons teach how to install, configure, maintain, and troubleshoot the MapR platform.

Copyright:

© All Rights Reserved

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
0% found this document useful (0 votes)
12 views5 pages

Big Data Training For ACI

Uploaded by

alexmylife
This document outlines training courses that introduce big data concepts, Apache Hadoop, and the MapR Converged Data Platform. The introduction to big data lessons define key terms, summarize big data history, and explain the big data pipeline. The Apache Hadoop essentials lessons cover Hadoop file systems, MapReduce, and the Hadoop ecosystem. The MapR lessons teach how to install, configure, maintain, and troubleshoot the MapR platform.

Copyright:

© All Rights Reserved

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd
Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1/ 5

Introduction to Big Data

Lesson 1: Introduction to Big Data

 Define Big Data


 Summarize the History of Big Data Computing
 Define Key Terms in Big Data Computing

Lesson 2: The Big Data Pipeline

 Organize the Steps in the Big Data Pipeline


 Explain the Role of Administrators
 Explain the Role of Developers
 Explain the Role of Data Analysts

Lesson 3: Solving Big Data Problems

 Data Warehouse Optimization


 Recommendation Engine
 Large-Scale Log Analysis

Apache Hadoop Essentials

Lesson 4: Core Elements of Apache Hadoop

 Compare and Contrast Local and Distributed File Systems


 Explain Data Management in the Hadoop File System
 Summarize the MapReduce Algorithm

Lesson 5: The Apache Hadoop Ecosystem


 Define the Apache Hadoop Ecosystem
 Administration: Apache ZooKeeper, YARN
 Ingestion: Apache Flume, Apache Oozie, Apache Sqoop
 Processing: Apache Spark, Apache HBase, Apache Pig
 Analysis: Apache Hive, Apache Drill, Apache Mahout

MapR Converged Data Platform Essentials

Lesson 6: Introduction to the MapR Converged Data Platform

 Define the MapR Converged Data Platform


 Explain Key Features and Benefits of the MapR Converged Data Platform
 Understand Use Cases for the MapR Converged Data Platform
ADM 200 - Install a MapR Cluster

Lesson 1: Prepare for Installation

 Plan the Service Layout


 Lab: Plan a Service Layout
 Prepare and Verify Cluster Hardware
 Lab: Audit the Cluster
 Test Nodes
 Lab: Run Pre-Install Tests

Lesson 2: Install a MapR Cluster

 Install the MapR Converged Data Platform


 Lab: Install the MapR Converged Data Platform
 Add a MapR License
 Lab: Install a MapR License

Lesson 3: Verify and Test the Cluster

 Verify Cluster Status


 Run Post-Install Benchmark Tests
 Lab: Run Synthetic Benchmarks
 Explore the Cluster Structure
 Lab: Explore the Cluster

ADM 201 - Configure a MapR Cluster

Lesson 4: Users, Groups, and System Settings

 Manage Users and Groups


 Lab: Configure Users and Groups
 Configure System Settings

Lesson 5: Configure Topology


 Define Topology
 Configure Node Topology
 Lab: Configure Node Topology

Lesson 6: Configure Volumes

 Describe Volumes and Volume Properties


 Configure Volumes
 Lab: Create Volumes and Set Quotas

Lesson 7: Job Logs and Scheduling

 Configure Logging Options


 Lab: Configure YARN Log Aggregation
 Configure the Fair Scheduler

ADM 202 - Data Access and Protection

Lesson 8: Access the Cluster

 Access Data
 Lab: Modify Cluster Files
 Set Up Client Access
 Lab: Configure Client NFS Access
 Configure Virtual IP Addresses
 Lab: Configure VIPs
 Control Access to the Cluster
 Lab: Control Access to the Cluster

Lesson 9: Snapshots

 Describe Snapshots
 Configure and Use Snapshots
 Lab: Snapshots

Lesson 10: Mirrors


 Describe How Mirrors Work
 Configure and Use Local Mirrors
 Lab: Configure Local Mirrors
 Use Cascading and Remote Mirrors

ADM 203 - Cluster Maintenance

Lesson 11: Monitor and Manage the Cluster

 Monitor the Cluster


 Configure and Respond to Alarms
 Lab: Configure Alerts
 Balance Cluster Resources
 Manage Logs and Snapshots
 Lab: Manage Logs and Snapshots
 Add and Remove Services
 Lab: Add and Remove Services

Lesson 12: Disk and Node Maintenance

 Add and Remove Disks


 Replace a Failed Disk
 Lab: Replace a Failed Disk
 Perform Node Maintenance
 Add Nodes
 Demonstration: Add a Node to the Cluster

Lesson 13: Troubleshooting

 Troubleshoot Different Problem Types


 Lab: Troubleshooting
 Use Support Utilities
 Lab: Collect Logs for Support

You might also like