License | Apache 2.0 is used for licensing which is open-source. | Apache 2.0 is used for licensing which is open-source. |
---|
Minimum supported Java version | JAVA 7 is the minimum compatible version. | JAVA 8 is the minimum compatible version. |
---|
Fault Tolerance | Replication is the only way to handle fault tolerance which is not space optimized. | Erasure coding is used for handling fault tolerance. |
---|
Data Balancing | HDFS balancer is used for Data Balancing. | Intra-data node balancer is used which is called via HDFS disk-balancer command-line interface. |
---|
Storage Scheme | 3x Replication Scheme is used. | uses eraser encoding in HDFS. |
---|
Storage Overhead | 200% of HDFS is consumed in Hadoop 2.x | 50% used in Hadoop 3.x means we have more space to work. |
---|
YARN Timeline Service | Uses timeline service with scalability issue. | Improve the time line service along with improving scalability and reliability of this service. |
---|
Scalability | Limited Scalability, can have upto 10000 nodes in a cluster. | Scalability is improved, can have more then 10000 nodes in a cluster. |
---|
Default Port Range (32768-61000) | Linux ephemeral port range is used as default, which is failed to bind at startup time. | Ports used are out of this ephemeral port range. |
---|
Compatible File System. | HDFS(default), FTP, Amazon S3 and Windows Azure Storage Blobs (WASB) file system. | All file systems including Microsoft Azure Data Lake filesystem. |
---|
Name Node recovery | Manual intervention is needed for the namenode recovery. | No need of Manual intervention for name node recovery. |
---|
Native Support for Docker / Containers | Not available | Supports Docker-based container execution in YARN, allowing better isolation and dependency management |
---|
GPU Support in YARN | Does not support | Supports GPU-aware scheduling in YARN which important for AI/ML workloads |
---|
Backward Compatibility | Not all tools supported out-of-box | Backward compatible and supports newer versions of Hive, HBase, etc. |
---|
Shell Script Rewrite | Legacy shell scripts | Rewritten shell scripts in better cross-platform compatibility |
---|