JAVA PLUS HADOOP TRAINING

Big data & Java has opened the door to new job opportunities

Java plus Hadoop course provides Java programmers a deep-dive into Hadoop application development. Students will learn how to design and develop efficient and effective MapReduce applications for Hadoop using the Data Platform, including how to implement combiners, partitioners, secondary sorts, custom input and output formats, joining large datasets, unit testing, and developing UDFs for Pig and Hive. Big data & Java has opened the door to new job opportunities.Hadoop is a framework written in Java for running applications on large clusters of commodity hardware and incorporates features similar to those of the Google File System and of MapReduce. HDFS is a highly fault-tolerant distributed file system and like Hadoop designed to be deployed on low-cost hardware. It provides high throughput access to application data and is suitable for applications that have large data sets. Professionals with Java skills will find it easier to extend Pig Latin by using UDF (User Defined Functions) as they can write it with JavaScript. Most importantly, they would find their Java skills advantageous when debugging Hadoop applications.

J2SE (JAVA 2 STANDARD EDITION) INTRODUCTION To JAVA

  • Understanding Requirement: why JAVA
  • Why java important to the internet
  • JAVA on LINUX PLATFORM

INTRODUCTION TO JAVA VIRTUAL MACHINE

  • Java Virtual machine Architecture
  • Class loading process by class loaders
  • Role of Just In time compiler (JIT)
  • Execution Engine

AN OVERVIEW OF JAVA AND BUZZWORDS

  • Data Types , Variables ad Arrays
  • Operators
  • Control statements
  • Object oriented paradigms
  • Abstractions
  • The Three OOP Principles:
  • (Encapsulation, Inheritance, Polymorphism)

JAVA CLASSES AND OOP IMPLEMENTATION

  • Class fundamentals
  • Commands line arguments
  • Learning Static Initializer
  • Declaration of objects
  • Instance variable Hiding
  • Overloading and Overriding of Methods
  • Understanding of Access Controls (private, public and protected)
  • Learning Nested and Inner classes
  • Dynamic method Dispatching
  • Using Abstract classes
  • Using final to prevent Overriding & Inheritance
  • Garbage collection

PACKAGES AND INTERFACES

  • Defining a package
  • Understanding CLASSPATH
  • Access Protection
  • Importing packages
  • Defining and Implementing interfaces
  • Anonymous classes
  • Abstract classes Vs Interfaces
  • Adapter classes

EXCEPTION HANDLING

  • Fundamental of Exception handling
  • Types of Exceptions
  • Learning exceptions handlers
  • Try and Catch
  • Multiple catch Clauses
  • Nested Try statements
  • Throw , throws and finally
  • Creating custom exceptions
  • Assertion

STRING HANDLING

  • Learning String Operation
  • Learning character Extraction
  • Learning string Comparison
  • Understanding string Buffer Classes
  • String builder class
  • Creating Immutable Class

NEW IN JDK 5/6/7

  • Premain method , Object size
  • Generics
  • Annotations
  • Vargs
  • Static Import
  • For each
  • String in which
  • Multiple exception handling
  • Dimond Operator
  • Finding out constant and method declaration belong to an interface
  • Creating an instance of a class whose name is not known until runtime
  • Getting and setting value of an objects field if field name is unknown until runtime
  • Invoking a method on an object if the method is unknown until runtime
  • Creating a new array whose size and components type are not know until runtime
  • Invoking private member of a class

WINDOWS PROGRAMMING Swing & AWT

  • Introduction to JFC
  • Controls

Event Delegation Method

  • Event Classes
  • Event listeners

APPLENET

  • Applenet Basics
  • Architecture and Skeleton
  • Simple Apple Display Methods
  • The HTML Applet Tag
  • Inter Applet communication
  • Trusted Applet (Applet with Database)

Multi threaded Programming

  • The java thread model
  • Creating a thread: Extended Thread and Implementing Runable
  • Creating multiple Thread and Context Switching
  • Synchronization: methods and statements
  • Inter thread Communication
  • Thread local
  • Dead lock
  • Threadpool executer class
  • Shutdown hookup
  • Re-entrent locking
  • The java thread model
  • Creating a thread: Extended Thread and Implementing Runable
  • Creating multiple Thread and Context Switching
  • Synchronization: methods and statements
  • Inter thread Communication
  • Thread local
  • Dead lock
  • Threadpool executer class
  • Shutdown hookup
  • Re-entrent locking

Introduction to I/O streams

  • File handling
  • Binary Streams
  • Character stream
  • Serialization
  • Scanner
  • Stream tokenizer
  • String tokenizer
  • GzipOutStream class
  • Data Encryption & decryption

NETWORKING

  • URL, InetAddress
  • Socket And Server Socket
  • Datagram socket
  • Socket factories

Reflection API

  • Determining the class of an object
  • Getting information about classes modifier, fields, methods, constructor, and super classes

COLLECTION FRAMEWORK

  • The Collection Interfaces (list , set, Sorted set)
  • The collection classes (The array list, Linked list, Hash set, Tree set)
  • Accessing a Collection via an Iterator
  • Working with maps
  • Working with Comparators
  • The Collection Algorithms
  • The Legacy Classes and Interfaces (Enumeration, Vector, Stack, Dictionary, Hash table)
  • Date and Time Handling
  • COLLECTION CLASS
  • Array class

BIG DATA HADOOP

  • Big Data – the actual reason for Hadoop
  • Understanding Big data
  • Collecting and cleaning data
  • Traditional approach for processing and its challenges
  • Big data vs Hadoop

An introduction to Hadoop

  • Hadoop overview
  • Hadoop components
  • Hadoop distributions
  • Getting started
  • What is HDFS<
  • What is Map Reduce
  • Hadoop stack
  • Hands On – Hadoop setup and basic operations

HDFS

  • HDFS explained
  • High availability
  • Federation
  • Architecture
  • File system Shell
  • Hands On

Map Reduce

  • Map Reduce flow
  • Hello World
  • Map Reduce API concepts
  • Mapper
  • Reducer
  • Other components – combiner,
  • partitioner, shuffle/sort
  • Hadoop 1.x vs 2.x
  • Hadoop streaming API
  • Hands on with Eclipse

YARN

  • Architecture
  • Scheduler
  • Resource Manager (RM)
  • RM HA
  • YARN commands
  • Hands On with YARN applications

Integrating Hadoop into the Workflow

  • RDBMS interaction using Sqoop
  • Workflow management using Oozie
  • Back office jobs with Zookeeper
  • Hands On with actual data sets

Data Mining

  • Unstructured data using PIG
  • Structured data mining using hive
  • Hands On with actual data sets

HBASE

  • Problem with SQL Database
  • Introduction to NOSQL
  • Hands On Exercises
  • Introduction to HBASE
  • Column Families
  • Delving deeper into HBASE
  • HBASE Architecture
  • HBASE Hands-On Exercises

Delving Deeper Into the Hadoop API

  • More about ToolRunner
  • Testing with MRUnit
  • Reducing Intermediate Data With
  • Combiners
  • The configure and close methods for
  • Map/Reduce Setup and Teardown
  • Writing Partitioners for Better Load
  • Balancing
  • Hands-On Exercise
  • Directly Accessing HDFS
  • Using the Distributed Cache

Practical Development Tips and Techniques

  • Debugging MapReduce Code
  • Using LocalJobRunner Mode for Easier
  • Debugging
  • Retrieving Job Information with Counters
  • Logging
  • Splittable File Formats
  • Determining the Optimal Number of
  • Reducers
  • Map-Only MapReduce Jobs
  • Hands-On Exercise

Joining Data Sets in MapReduce

  • Map-Side Joins
  • The Secondary Sort
  • Reduce-Side Joins

SQL

Introducing SQL Basic Commands

Introduction of sql

  • Writing basic select statements.
  • Restricting and sorting of data
  • Introducing sql function
  • Single row function and group function
  • Conditional Expression
  • Using substitution on variable
  • Introducing sql commands
  • Using DDL Statements
  • Managing Tables
  • Data manipulation operation
  • Understanding transaction
  • Using transaction control statements
  • Overview of locks
  • Using flashback and purge command
  • Granting and revoking system and object privileges
  • Designing tables by using key constraints
  • Deferred constraints
  • Retrieving data from more than one tables using join operation.
  • Aggregating Data Using Group Functions
  • Introducing With View
  • Introducing With Indexes
  • Introducing With Synonyms
  • Introducing With Sequence And Use With Database
  • Introducing Sub queries
  • Single row sub query ,Multiple row sub query
  • Correlated sub query
  • Top - n analysis
  • Using the Set Operators
  • Inserting and Updating Data
  • Deleting Data
  • Creating Other Schema Objects
  • Managing Objects with Data Dictionary Views
  • Date and Time Function
  • Rollup and Cube Operator Using Group by Function
  • Multiple Insertion and Types
  • Hierarchical Tree
  • Sql loader

Advanced SQL

  • Overview of analytical function
  • Introducing inline view
  • Introduction of With clause
  • Materialize view
  • Overview of restore.
  • overview of semi join and anti join
  • Embedding sub query with Dml operation

lINUX

  • Installation of Linux
  • Access the command line
  • Manage files from the command line
  • Create, view, and edit text files
  • Manage local Linux users and groups
COMMENCING NEW BATCHES
ENQUIRY FORM
FOLLOW US ON
SUBSCRIBE TO OUR NEWSLETTER

WE ACCEPT ONLINE PAYMENTS
PAY ONLINE