課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

久久久久久亚洲Av无码精品专口| 国产精品99久久精品爆乳| 国产精品美女久久久久av爽| 亚洲精品成人久久| 国产成人精品福利网站在线观看| 2021国产成人午夜精品| 久久九九久精品国产免费直播| 国产精品亚洲午夜一区二区三区| 国产三级精品三级| 国产乱人伦精品一区二区在线观看 | 国精品产露脸自拍| 色综合久久夜色精品国产| 热99re久久国超精品首页| 国产精品亚洲一区二区无码| 久久精品99久久香蕉国产色戒| 国产精品久久久久久久午夜片| 99视频精品全部在线观看| 久久久国产乱子伦精品| 一本色道久久综合亚洲精品高清| 日韩精品福利片午夜免费观着| 久久久久久久91精品免费观看| 亚洲AV永久无码精品水牛影视| 青青青青久久精品国产h久久精品五福影院1421 | 国产精品毛片一区二区三区| 国产A∨免费精品视频| 精品日韩二区三区精品视频| 国产成人精品日本亚洲网址| 91精品国产综合久| 国产精品99久久不卡| 91久久精品电影| 久久久久久亚洲精品中文字幕| 国产精品白丝AV嫩草影院| 高清国产一级精品毛片基地| 久久99热这里只有精品66| 中文字幕精品视频| 无码精品A∨在线观看| 亚洲乱码国产乱码精品精| 国产偷伦精品视频| 国产99久久九九精品无码| 国产揄拍国内精品对白| 成人国产精品视频频|