Subscribe to DSC Newsletter

Highlights in our Online training

  • Very in depth course material with real time scenarios.
  • We are providing class with highly qualified trainer.
  • We will provide class and demo session at student flexible timings.
  • In training case studies and real time scenarios covered.
  • We will give 24*7 technical supports.
  • Each topic coverage with real time solutions.
  • We are providing normal track,weekend,fast track classes.

Course Curriculum

Introduction

  • Motivation for Hadoop
  • Big Data Characteristics, Challenges with traditional system
  • Hadoop’s History
  • Core Hadoop Concepts
  • Hadoop Clusters, Installation and Configuration

Linux and Hadoop Basic Commands

  • Linux Basic Commands
  • HDFS Basic Commands
  • Hands-On for All Commands

Hadoop Basic Concepts

  • What Hadoop is?
  • What features the Hadoop Distributed File System (HDFS) provides
    • Architecture
    • Features, Goals and Advantages of HDFS
    • Name Nodes
    • Data Nodes
    • Secondary Name Node
    • The concepts behind MapReduce
      • How Map Reduce Works?
      • Data Type
      • Input & Output Formats
      • How a Hadoop cluster operates?
        • Cluster sizing
        • Capacity planning
        • Replication
        • Blocks
        • Heartbeat Mechanism
        • Data Organization

VM Installation

  • Providing Hadoop VM and configuring it
  • Learning Eclipse and creating MapReduce JAR

Writing a Map Reduce Program

  • The Driver Code
  • The Mapper
  • The Reducer
  • The Streaming API
  • Develop a MapReduce program for WhatsApp Message Analytics project

The Hadoop Ecosystem Introduction

Hive

  • SQL Basics
  • Hive Basics
  • Internal & External Tables
  • Partitioning
  • Buckets
  • Using Hive system defined functions
  • 3 Projects

Pig

  • Pig Basics
  • Loading data files
  • Writing queries – SPLIT, FILTER, JOIN, GROUP, SAMPLE, ILLUSTRATE etc.
  • Pig UDF
  • 3 Projects

HBase

  • NoSQL Basics
  • HBase Basics
  • Region Server

Flume

  • Flume Basics
  • Flume Use Cases

Sqoop

  • Sqoop Import
  • Sqoop Export

ZooKeeper

Oozie

  • Workflow Design
  • Workflow Scheduler

Hadoop 2.X

  • YARN
  • Resource Manager
  • Node Manager
  • Application Master
  • Hadoop 1.X Vs Hadoop 2.x

Introduction to Apache Spark

  • Spark Details
  • DAG
  • Scala

Hadoop on Amazon Web Services

  • Introduction to AWS cloud infrastructure
  • Amazon SaaS, PaaS and IaaS
  • Creating EC2 instance for processing
  • Creating S3 buckets
  • Deploying data on to the cloud
  • Choosing size of our instance
  • Configuration of EMR instance
  • Creating a virtual cluster on amazon

Register for Demo session @ Hadoop Online training

Tags: AWS, Hadoop, Online, Training, with

Views: 206

Attachments:

Reply to This

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service