Unit 2 Notes BDA
Unit 2 Notes BDA
**Limitations:**
HDFS is optimized for batch processing and is not suitable for
low-latency or real-time access patterns.
In summary, HDFS is a distributed file system designed to
store and manage large datasets across a cluster of machines.
Its architecture ensures fault tolerance, data replication, and
efficient data storage and retrieval, making it a core
component of the Hadoop ecosystem.