100% found this document useful (1 vote)
6K views

Pro Apache Hadoop 2nd Edition

This book covers building and administering Hadoop clusters to analyze large volumes of data using MapReduce. It teaches how to break large problems into smaller parallelized chunks and how Hadoop distributes software across nodes. The book contains 17 chapters that cover Hadoop concepts, installation, administration, MapReduce development, testing, monitoring, data warehousing, Pig, HCatalog, log analysis with HBase, data science, cloud deployment, and building YARN applications. Readers will learn how to build resilient Hadoop clusters, optimize tasks, implement proven patterns, and scale out using HDFS Federations to analyze data in short times.

Uploaded by

Dreamtech Press
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
6K views

Pro Apache Hadoop 2nd Edition

This book covers building and administering Hadoop clusters to analyze large volumes of data using MapReduce. It teaches how to break large problems into smaller parallelized chunks and how Hadoop distributes software across nodes. The book contains 17 chapters that cover Hadoop concepts, installation, administration, MapReduce development, testing, monitoring, data warehousing, Pig, HCatalog, log analysis with HBase, data science, cloud deployment, and building YARN applications. Readers will learn how to build resilient Hadoop clusters, optimize tasks, implement proven patterns, and scale out using HDFS Federations to analyze data in short times.

Uploaded by

Dreamtech Press
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

PRO APACHE HADOOP

IInd Edition

ABOUT THE BOOK

This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business
and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating smallscale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wallclock time. Learn how to let Hadoop take care of distributing and parallelizing your softwareyou just focus on the code; Hadoop
takes care of the rest.

TABLE OF CONTENTS

`699

1. Motivation for Big Data


2. Hadoop Concepts
3. Getting Started with the Hadoop Framework
4. Hadoop Administration
5. Basics of MapReduce Development
6. Advanced MapReduce Development
7. Hadoop Input Output
8. Testing Hadoop Programs
9. Monitoring Hadoop
10. Data Warehousing using Hadoop
11. Data Processing using Pig
12. HCatalog and Hadoop in the Enterprise
13. Log Analysis using Hadoop
14. Building Real-Time Systems using HBase
15. Data Science With Hadoop
16. Hadoop in the Cloud
17. Building a YARN Application

ISBN: 9788132232438 | Pages: 444 | Authors: Wadkar, Siddalingaiah, Venner

WHAT YOULL LEARN

Build a resilient and scalable Hadoop compute cluster.

Analyze large volumes of data in amazingly short time.

Optimize Hadoop tasks like a seasoned professional.

Implement bulletproof patterns that are proven successful.

Scale out using the new HDFS Federations feature set.

Chunk large problems into highly-parallel, MapReduce modules


Published by:

/dtechpress

DREAMTECH PRESS
19-A, Ansari Road, Daryaganj
New Delhi-110 002, INDIA
Tel: +91-11-2324 3463-73, Fax: +91-11-2324 3078
Email: [email protected]
Website: www.dreamtechpress.com

/dtechpress

Exclusively
Distributed by:

/dreamtechpress

WILEY INDIA PVT. LTD.


4435-36/7, Ansari Road, Daryaganj
New Delhi-110 002, INDIA
Tel: +91-11-4363 0000, Fax: +91-11-2327 5895
Email: [email protected]
Website: www.wileyindia.com
dreamtechpress.wordpress.com

You might also like