课程信息
课程名称: Hadoop管理工程师(CCAH)认证
公开班、定制班
开课时间:2024-06-15
课程介绍
【课程简介】
从安装及配置、负载均衡及调整,以及 诊断和解决部署问题等各方面了解 Hadoop 系统管理员的概念和实践。
面向需要建立或维护 Hadoop 集群的管理员。培训对象要求具备 Linux 基本知识,Hadoop相关知识不作要求。
CCA Administrator Exam (CCA131) 管理员认证考试
考试形式:120分钟;70%通过;基于一个预配置的Cloudera企业版集群,解决8~12个场景下的任务
【课程简介】
作为大数据核心技术,hadoop 为企业提供了高扩展、高冗余、高容错、和经济有效的“数据驱动”解决方案。针对目前普遍缺乏海量数据技术人员的现状,青蓝咨询的CCAH课程面向具备和掌握Linux系统管理和网络相关技能和经验。无需具备Hadoop基础和经验。
【授课对象】
系统管理员或者任何需要管理Apache Hadoop机群的人员(包括产品及开发环境)。
【授课内容】
· Hadoop分布式文件系统和MapReduce工作原理
· Hadoop集群硬件配置规划
· Hadoop集群网络配置规划
· Hadoop集群配置及优化
· 如何配置NameNode HA
· 任何配置NameNode Federation
· 任何配置FairScheduler为多用户共享Hadoop集群
· 任何为Hadoop集群安装和实现基于Kerberos的安全性
· 如何维护和监测Hadoop集群
· 如何使用Flume加载动态产生的文件以及使用Sqoop连接关系数据库进行数据导入导出
· Hive、Pig和HBase等Hadoop生态系统工具相关的系统管理工作
模块 |
内容 |
The Case for Apache Hadoop |
l Why Hadoop? l A Brief History of Hadoop l Core Hadoop Components l Fundamental Concepts |
HDFS
|
l HDFS Features l Writing and Reading Files l NameNode Considerations l Overview of HDFS Security l Using the Namenode Web UI l Using the Hadoop File Shell |
Getting Data into HDFS |
l Ingesting Data from External Sources with Flume l Ingesting Data from Relational Databases with Sqoop l REST Interfaces l Best Practices for Importing Data |
MapReduce |
l What Is MapReduce? l Features of MapReduce l Basic Concepts l Architectural Overview l MapReduce Version 2 l Failure Recovery l Using the JobTracker Web UI |
Planning Your Hadoop Cluster
|
l General Planning Considerations l Choosing the Right Hardware l Network Considerations l Configuring Nodes l Planning for Cluster Management |
Hadoop Installation and Initial Configuration
|
l Deployment Types l Installing Hadoop l Specifying the Hadoop Configuration l Performing Initial HDFS Configuration l Performing Initial MapReduce Configuration l Log File Locations l |
Installing and Configuring Hive, Impala, and Pig
|
l Hive l Impala l Pig |
Hadoop Clients
|
l What is a Hadoop Client? l Installing and Configuring Hadoop Clients l Installing and Configuring Hue l Hue Authentication and Configuration |
Cloudera Manager
|
l The Motivation for Cloudera Manager l Cloudera Manager Features l Standard and Enterprise Versions l Cloudera Manager Topology l Installing Cloudera Manager l Installing Hadoop Using Cloudera Manager l Performing Basic Administration Tasks l Advanced Cluster Configuration l Advanced Configuration Parameters l Configuring Hadoop Ports l Explicitly Including and Excluding Hosts l Configuring HDFS for Rack Awareness l Configuring HDFS High Availability |
Hadoop Security
|
l Why Hadoop Security Is Important l Hadoop’s Security System Concepts l What Kerberos Is and How it Works l Securing a Hadoop Cluster with Kerberos |
Managing and Scheduling Jobs
|
l Managing Running Jobs l Scheduling Hadoop Jobs l Configuring the FairScheduler Cluster Maintenance l Checking HDFS Status l Copying Data Between Clusters l Adding and Removing Cluster Nodes l Rebalancing the Cluster l NameNode Metadata Backup l Cluster Upgrading |
Cluster Monitoring and Troubleshooting
|
l General System Monitoring l Managing Hadoop’s Log Files l Monitoring Hadoop Clusters l Common Troubleshooting Issues |
注:具体开课时间将根据实际进行调整,请关注青蓝咨询官方公众号消息或咨询课程顾问!
【联系青蓝咨询】
地址: 深圳市南山区高新南一道06号TCL大厦B座3楼309室 (公交站:大冲 地铁站:一号线高新园C出口)
邮编:518057
电话:0755-86950769