课程信息

课程名称: Hadoop管理工程师(CCAH)认证

公开班、定制班

开课时间:2024-06-15

课程介绍


课程简介

从安装及配置、负载均衡及调整,以及 诊断和解决部署问题等各方面了解 Hadoop 系统管理员的概念和实践。

面向需要建立或维护 Hadoop 集群的管理员。培训对象要求具备 Linux 基本知识,Hadoop相关知识不作要求。  

CCA Administrator Exam (CCA131) 管理员认证考试

考试形式:120分钟;70%通过;基于一个预配置的Cloudera企业版集群,解决8~12个场景下的任务

 

课程简介

作为大数据核心技术,hadoop 为企业提供了高扩展、高冗余、高容错、和经济有效的“数据驱动”解决方案。针对目前普遍缺乏海量数据技术人员的现状,青蓝咨询的CCAH课程面向具备和掌握Linux系统管理和网络相关技能和经验。无需具备Hadoop基础和经验。


授课对象

系统管理员或者任何需要管理Apache Hadoop机群的人员(包括产品及开发环境)。


授课内容

· Hadoop分布式文件系统和MapReduce工作原理

· Hadoop集群硬件配置规划

· Hadoop集群网络配置规划

· Hadoop集群配置及优化

· 如何配置NameNode HA

· 任何配置NameNode Federation

· 任何配置FairScheduler为多用户共享Hadoop集群

· 任何为Hadoop集群安装和实现基于Kerberos的安全性

· 如何维护和监测Hadoop集群

· 如何使用Flume加载动态产生的文件以及使用Sqoop连接关系数据库进行数据导入导出

· Hive、Pig和HBase等Hadoop生态系统工具相关的系统管理工作


模块

内容

The Case for Apache Hadoop

Why Hadoop?

A Brief History of Hadoop

Core Hadoop Components

Fundamental Concepts

HDFS

 

HDFS Features

Writing and Reading Files

NameNode Considerations

Overview of HDFS Security

Using the Namenode Web UI

Using the Hadoop File Shell

Getting Data into HDFS

Ingesting Data from External Sources with Flume

Ingesting Data from Relational Databases with Sqoop

REST Interfaces

Best Practices for Importing Data

MapReduce

What Is MapReduce?

Features of MapReduce

Basic Concepts

Architectural Overview

MapReduce Version 2

Failure Recovery

Using the JobTracker Web UI

Planning Your Hadoop Cluster

 

General Planning Considerations

Choosing the Right Hardware

Network Considerations

Configuring Nodes

Planning for Cluster Management

Hadoop Installation and Initial Configuration

 

 Deployment Types

 Installing Hadoop

 Specifying the Hadoop Configuration

 Performing Initial HDFS Configuration

 Performing Initial MapReduce Configuration

 Log File Locations

Installing and Configuring Hive, Impala, and Pig

 

 Hive

 Impala

 Pig

Hadoop Clients

 

 What is a Hadoop Client?

 Installing and Configuring Hadoop Clients

 Installing and Configuring Hue

 Hue Authentication and Configuration

Cloudera Manager

 

 

 The Motivation for Cloudera Manager

  Cloudera Manager Features

 Standard and Enterprise Versions

 Cloudera Manager Topology

 Installing Cloudera Manager

 Installing Hadoop Using Cloudera Manager

 Performing Basic Administration Tasks

Advanced Cluster Configuration

 Advanced Configuration Parameters

 Configuring Hadoop Ports

 Explicitly Including and Excluding Hosts

 Configuring HDFS for Rack Awareness

 Configuring HDFS High Availability

Hadoop Security

 

 Why Hadoop Security Is Important

 Hadoop’s Security System Concepts

 What Kerberos Is and How it Works

 Securing a Hadoop Cluster with Kerberos

Managing and Scheduling Jobs

 

 Managing Running Jobs

 Scheduling Hadoop Jobs

 Configuring the FairScheduler Cluster Maintenance

 Checking HDFS Status

 Copying Data Between Clusters

 Adding and Removing Cluster Nodes

 Rebalancing the Cluster

 NameNode Metadata Backup

 Cluster Upgrading

Cluster  Monitoring  and Troubleshooting

 

 General System Monitoring

 Managing Hadoop’s Log Files

 Monitoring Hadoop Clusters

 Common Troubleshooting Issues


注:具体开课时间将根据实际进行调整,请关注青蓝咨询官方公众号消息或咨询课程顾问!




【联系青蓝咨询】

地址: 深圳市南山区高新南一道06号TCL大厦B座3楼309室 (公交站:大冲   地铁站:一号线高新园C出口) 

    邮编:518057 

    电话:0755-86950769

    邮箱:peixun@shzhchina.com 

    网址:http://www.shzhchina.com

 

扫码关注 了解更多课程信息