Hive

[toggle title=”SESSION 1: INTRODUCTION TO HADOOP”]

WHAT IS BIG DATA?

TYPES OF DATA

LIMITATION OF RDBMS

FEATURES OF HADOOP

HDFS

MAP-REDUCE

BLOCK- REPLICATION

HADOOP ARCHITECTURE[/toggle]

[toggle title=”SESSION 2: INTRODUCTION TO HIVE”]

LINUX COMMANDS

HIVE INTRODUCTION

SQL BASICS FOR HIVE

HIVE COMMANDS USING HIVE COMMAND LINE

LOADING FILES IN TO HIVE[/toggle]

[toggle title=”SESSION 3: FILE FORMAT”]

INTERNAL AND EXTERNAL TABLES IN HIVE

DIFFERENCE FILE FORMATS (AVRO, ORC, SEQUENTIAL FILES)

CREATING AVRO FILE TABLES

RC FILE

CREATING ORC FILE TABLES

SEQUENTIAL FILE TABLES

PARQUIT FILEFORMAT[/toggle]

[toggle title=”SESSION 4: SQOOP”]

INTRODUCTION TO SQOOP

SQOOP IMPORT

SQOOP IMPORT (TARGET-DIR)

SQOOP IMPORT SUBSET OF TABLE

SQOOP IMPORT INCREMENTAL IMPORT

SQOOP IMPORT ALL

SQOOP JOB

SQOOP EVAL

SQOOP EXPORT[/toggle]

[toggle title=”SESSION 5: PERFORMANCES TUNING IN HIVE”]

HIVE PARTITIONING

ADDING STATIC PARTITION

VERIFYING PARTITION DIRECTORIES

HIVE BUCKETING

HIVE INDEXING[/toggle]

[toggle title=”SESSION 6: MAP JOINS, REDUCER JOINS”]

MEANING OF JOINS.

INNER JOIN

LEFT OUTER JOIN

RIGHT OUTER JOIN

FULL OUTER JOIN

MAP SIDE JOIN – STRUCTURE

MAP SIDE JOIN – SYNTAX[/toggle]