HDP Developer: Java

HDP Developer: Java

This advanced course provides Java programmers a deep-dive into Hadoop application development.

About this course

Overview:
This advanced course provides Java programmers a deep-dive into Hadoop application development. Students will learn how to design and develop efficient and effective MapReduce applications for Hadoop using the Hortonworks Data Platform, including how to implement combiners, partitioners, secondary sorts, custom input and output formats, joining large datasets, unit testing, and developing UDFs for Pig and Hive. Labs are run on a 7-node HDP 2.1 cluster running in a virtual machine that students can keep for use after the training.

Target Audience:
Experienced Java software engineers who need to develop Java MapReduce applications for Hadoop.

Prerequisites:
Students must have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE andGradle. No prior Hadoop knowledge is required.

Format:
Live Instructor Lecture and Hands-On Labs

 

How to Register:

  1. Click the  "Purchase" button at the top of the page to initiate your purchase
  2. After you have completed your purchase and registration, you will be able to select the event that you wish to attend from the classes scheduled below after logging into your account

Curriculum

  • Course Logistics
  • HDP Developer: Java Live Training Schedule
  • Lesson 1:
  • Understanding Hadoop and HDFS
  • Demonstration: Understanding Block Storage
  • Lab Guide: Configuring a Hadoop Development Environment
  • Lab Guide: Putting Files in HDFS with Java
  • Lesson 2:
  • Writing MapReduce Applications
  • Demonstration: Understanding MapReduce
  • Lab Guide: Word Count
  • Lab Guide: Distributed Grep
  • Lab Guide: Inverted Index
  • Lesson 3:
  • Map Aggregation
  • Lab Guide: Using a Combiner
  • Lab Guide: Computing an Average
  • Lesson 4:
  • Partitioning and Sorting
  • Lab Guide: Writing a Custom Partitioner
  • Lab Guide: Using TotalOrderPartitioner
  • Lab Guide: Custom Sorting
  • Lesson 5:
  • Input and Output Formats
  • Demonstration: Combining Input Files
  • Demonstration: Processing Multiple Inputs
  • Lab Guide: Writing a Custom InputFormat
  • Lab Guide: Customizing Output
  • Lab Guide: Simple Moving Average
  • Lesson 6:
  • Optimizing MapReduce Jobs
  • Lab Guide: Using Data Compression
  • Lab Guide: Defining a RawComparator
  • Lesson 7:
  • Advanced Map Reduce Features
  • Lab Guide: Performing a Map-Side Join
  • Lab Guide: Using a Bloom Filter
  • Lesson 8:
  • Unit Testing
  • Lab Guide: Unit Testing a MapReduce Job
  • Lab Guide - An HBase MapReduce Job
  • Lesson 9:
  • HBase Programming
  • Lab Guide: Importing Data to HBase
  • Lesson 10:
  • Pig Programming
  • Demonstration: Understanding Pig
  • Lab Guide: Writing a Pig UDF
  • Lab Guide: Writing a Pig Accumulator
  • Lesson 11:
  • Hive Programming
  • Lab Guide: Writing a Hive UDF
  • Lesson 12:
  • Defining Workflow
  • Lab Guide: Defining an Oozie Workflow
  • Lab Guide: TF-IDF and the JobControl Class
  • Wrapping Up
  • Course & Instructor Survey

About this course

Overview:
This advanced course provides Java programmers a deep-dive into Hadoop application development. Students will learn how to design and develop efficient and effective MapReduce applications for Hadoop using the Hortonworks Data Platform, including how to implement combiners, partitioners, secondary sorts, custom input and output formats, joining large datasets, unit testing, and developing UDFs for Pig and Hive. Labs are run on a 7-node HDP 2.1 cluster running in a virtual machine that students can keep for use after the training.

Target Audience:
Experienced Java software engineers who need to develop Java MapReduce applications for Hadoop.

Prerequisites:
Students must have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE andGradle. No prior Hadoop knowledge is required.

Format:
Live Instructor Lecture and Hands-On Labs

 

How to Register:

  1. Click the  "Purchase" button at the top of the page to initiate your purchase
  2. After you have completed your purchase and registration, you will be able to select the event that you wish to attend from the classes scheduled below after logging into your account

Curriculum

  • Course Logistics
  • HDP Developer: Java Live Training Schedule
  • Lesson 1:
  • Understanding Hadoop and HDFS
  • Demonstration: Understanding Block Storage
  • Lab Guide: Configuring a Hadoop Development Environment
  • Lab Guide: Putting Files in HDFS with Java
  • Lesson 2:
  • Writing MapReduce Applications
  • Demonstration: Understanding MapReduce
  • Lab Guide: Word Count
  • Lab Guide: Distributed Grep
  • Lab Guide: Inverted Index
  • Lesson 3:
  • Map Aggregation
  • Lab Guide: Using a Combiner
  • Lab Guide: Computing an Average
  • Lesson 4:
  • Partitioning and Sorting
  • Lab Guide: Writing a Custom Partitioner
  • Lab Guide: Using TotalOrderPartitioner
  • Lab Guide: Custom Sorting
  • Lesson 5:
  • Input and Output Formats
  • Demonstration: Combining Input Files
  • Demonstration: Processing Multiple Inputs
  • Lab Guide: Writing a Custom InputFormat
  • Lab Guide: Customizing Output
  • Lab Guide: Simple Moving Average
  • Lesson 6:
  • Optimizing MapReduce Jobs
  • Lab Guide: Using Data Compression
  • Lab Guide: Defining a RawComparator
  • Lesson 7:
  • Advanced Map Reduce Features
  • Lab Guide: Performing a Map-Side Join
  • Lab Guide: Using a Bloom Filter
  • Lesson 8:
  • Unit Testing
  • Lab Guide: Unit Testing a MapReduce Job
  • Lab Guide - An HBase MapReduce Job
  • Lesson 9:
  • HBase Programming
  • Lab Guide: Importing Data to HBase
  • Lesson 10:
  • Pig Programming
  • Demonstration: Understanding Pig
  • Lab Guide: Writing a Pig UDF
  • Lab Guide: Writing a Pig Accumulator
  • Lesson 11:
  • Hive Programming
  • Lab Guide: Writing a Hive UDF
  • Lesson 12:
  • Defining Workflow
  • Lab Guide: Defining an Oozie Workflow
  • Lab Guide: TF-IDF and the JobControl Class
  • Wrapping Up
  • Course & Instructor Survey