[toggle title=”SESSION 1: INTRODUCTION TO HADOOP”]
WHAT IS BIG DATA?
TYPES OF DATA
LIMITATION OF RDBMS
FEATURES OF HADOOP
HDFS
MAP-REDUCE
BLOCK- REPLICATION
HADOOP ARCHITECTURE[/toggle]
[toggle title=”SESSION 2: INTRODUCTION TO HIVE”]
LINUX COMMANDS
HIVE INTRODUCTION
SQL BASICS FOR HIVE
HIVE COMMANDS USING HIVE COMMAND LINE
LOADING FILES IN TO HIVE[/toggle]
[toggle title=”SESSION 3: FILE FORMAT”]
INTERNAL AND EXTERNAL TABLES IN HIVE
DIFFERENCE FILE FORMATS (AVRO, ORC, SEQUENTIAL FILES)
CREATING AVRO FILE TABLES
RC FILE
CREATING ORC FILE TABLES
SEQUENTIAL FILE TABLES
PARQUIT FILEFORMAT[/toggle]
[toggle title=”SESSION 4: SQOOP”]
INTRODUCTION TO SQOOP
SQOOP IMPORT
SQOOP IMPORT (TARGET-DIR)
SQOOP IMPORT SUBSET OF TABLE
SQOOP IMPORT INCREMENTAL IMPORT
SQOOP IMPORT ALL
SQOOP JOB
SQOOP EVAL
SQOOP EXPORT[/toggle]
[toggle title=”SESSION 5: PERFORMANCES TUNING IN HIVE”]
HIVE PARTITIONING
ADDING STATIC PARTITION
VERIFYING PARTITION DIRECTORIES
HIVE BUCKETING
HIVE INDEXING[/toggle]
[toggle title=”SESSION 6: MAP JOINS, REDUCER JOINS”]
MEANING OF JOINS.
INNER JOIN
LEFT OUTER JOIN
RIGHT OUTER JOIN
FULL OUTER JOIN
MAP SIDE JOIN – STRUCTURE
MAP SIDE JOIN – SYNTAX[/toggle]