課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

主站蜘蛛池模板: 粉色视频免费试看| 狠狠综合欧美综合欧美色| 国产精品igao视频网网址| 中国老熟妇xxxxx| 欧美在线一卡二卡一卡3卡4卡5| 嗯嗯在线观看免费播放| 老司机免费在线| 女人扒下裤让男人桶到爽| 久久国产欧美日韩精品| 欧美重口绿帽video| 四虎884tt紧急大通知| 亚洲综合20p| 天堂а√8在线最新版在线| 久久午夜无码鲁丝片午夜精品| 正在播放julia女教师| 午夜激情视频在线| 黄页网址大全免费观看35| 国语对白avxxxooo| 不卡一区二区在线| 日韩欧美一区二区三区| 亚洲欧美日韩精品专区| 精品无码久久久久久尤物| 国产女人高潮视频在线观看| 99久re热视频这里只有精品6 | 日韩福利电影网| 亚洲欧美成人一区二区在线电影| 精品爆乳一区二区三区无码av | 欧美人与动人物姣配xxxx| 兴奋的阅读td全集视频| 视频在线观看一区| 国产特级毛片AAAAAA视频| 97国产在线视频公开免费| 少妇人妻偷人精品一区二区| 久久久精品人妻无码专区不卡| 欧美1区2区3区| 亚洲熟妇AV乱码在线观看| 精品97国产免费人成视频| 国产一区二区三区乱码网站| 国产女人18毛片水| 国产精品亚韩精品无码a在线| 99精品国产在热久久无码|