Project Proposal - HDFS
Project Proposal - HDFS
HadoopisanindispensabletoolforBigDatacomputing.Likeanyotherdistributed
system, the success of its operation is augmented by its distributed file system
architectureknownasHDFS. HDFS ishighlyfaulttolerantandisdesignedtobe
deployed on lowcost hardware. HDFS provides high throughput access to
applicationdataandissuitableforapplicationsthathavelargedatasets.
2Requirements
ThisprojectwillinvolveWalkthroughoftheopensourceHadoopsourcecodeto
understand and illustrate following HDFS operations. Each file system operation
listedbelowwilllistfunctionsandlibrariescalledonboththeclientandtheserver
whenafileoperationoccurs.
1) Open.
2) Read.
3) Seek
4) Write
5) SecurityofFilesforoperation14.
References
1. HDFSsourcecode:http://hadoop.apache.org/hdfs/version_control.html
2HDFSJavaAPI:http://hadoop.apache.org/core/docs/current/api/