1) The Hadoop Distributed File System (HDFS)
· HDFS Design & Concepts
· Blocks, Name nodes and Data nodes
· Hadoop DFS The Command-Line Interface
· Basic File System Operations
· Reading Data from a Hadoop URL
· Reading Data Using the File System API
2) Map Reduce
· Map and Reduce Basics.
· How Map Reduce Works
· Anatomy of a Map Reduce Job Run
· Job Submission, Job Initialization, Task Assignment, Task Execution
· Progress and Status Updates
· Job Completion, Failures
· Shuffling and Sorting.
· Combiner
· Hadoop Streaming
8) Map/Reduce Programming - Java
· Hands on "Word Count" in Map/Reduce in Eclipse
· Sorting files using Hadoop Configuration API discussion
· Emulating "grep" for searching inside a file in Hadoop
· Chain Mapping API discussion
· Job Dependency API discussion and Hands on
· Input Format API discussion and hands on
· Input Split API discussion and hands on
· Custom Data type creation in Hadoop
4) Discussion on some business use cases