Overview
Sapphire Global Big Data Training makes you an expert in using BigData and Hadoop concepts. Enroll now for Big Data online training and get through the concepts of data, by utilizing the internal memory for storing a working set. Sapphire Global introduces all the key concepts in Big Data to help the learner gain more knowledge. Enroll & Become a Big Data Consultant.
Course Curriculum
Key features
This course will prepare you to:
- Explain the architecture of the Big Data
- Use the standard Big Data Sub Modules designed for Data Analytics.
Big Data and Hadoop Administration
- Introduction to Big Data
- Hadoop Architecture
- MapReduce Framework
- A typical Hadoop Cluster
- Data Loading into HDFS
Hadoop Architecture and Cluster setup
- Hadoop server roles and their usage
- Rack Awareness
- Anatomy of Write and Read
- Replication Pipeline
- Data Processing
- Hadoop Installation and Initial Configuration
- Deploying Hadoop in pseudo-distributed mode
- Deploying a multi-node Hadoop cluster
- Installing Hadoop Clients
Hadoop Cluster: Planning and Managing
- Planning the Hadoop Cluster
- Cluster Size
- Hardware and Software considerations
- Managing and Scheduling Jobs
- types of schedulers in Hadoop
- Configuring the schedulers and run MapReduce jobs
- Cluster Monitoring and Troubleshooting
Backup, Recovery and Maintenance
- Configure Rack awareness
- Setting up Hadoop Backup
- whitelist and blacklist data nodes in a cluster
- Setup quota's,
- upgrade Hadoop cluster
- Copy data across clusters using distcp
- Diagnostics and Recovery
- Cluster Maintenance
Hadoop 2.0 and High Availability
- Configuring Secondary NameNode
- Hadoop 2.0
- YARN framework
- MRv2
- Hadoop 2.0 Cluster setup
- Deploying Hadoop 2.0 in pseudo-distributed mode
- Deploying a multi-node Hadoop 2.0 cluster
Advanced Topics: QJM, HDFS Federation and Security
- Configuring HDFS Federation
- Basics of Hadoop Platform Security
- Securing the Platform
- Configuring Kerberos
Oozie, Hcatalog/Hive and HBase Administration
- Oozie, Hcatalog/Hive Administration
- HBase Architecture
- HBase setup
- HBase and Hive Integration
- HBase performance optimization
Project: Hadoop Implementation
- Understanding the Problem
- Plan
- Design, and Create a Hadoop Cluster for a Real World Use Case
- Setup and Configure commonly used Hadoop ecosystem components such as Pig and Hive
- Configure Ganglia on the Hadoop cluster and troubleshoot the common Cluster Problems
Practice Test and Interview Questions
Practice Test and Interview Questions
Reviews
There are no reviews yet.