Unit 2 BDA
Unit 2 BDA
• Key Features:
• Applications:
2. System Principle
• Core Concept: Hadoop distributes data across multiple nodes and processes it
in parallel, ensuring high efficiency.
• Key Components:
3. Hadoop Architecture
• Layers:
• Overview: HDFS is designed for storing large datasets across multiple nodes.
• Features:
1. Data Blocks: Files are split into blocks (default size: 128 MB).
5. Hadoop MapReduce
• How it Works:
• Advantages:
• Components:
• Advantages:
• Installation:
• Modes:
8. Hadoop Commands
• HDFS Commands:
• YARN Commands:
• Using HDFS:
• Integration Tools: Sqoop for transferring data between Hadoop and relational
databases.
10. Hadoop Programming