Expert Hadoop Administration Managing, Tuning, and Securing Spark, YARN, and HDFS

by
Edition: 1st
Format: Paperback
Pub. Date: 2016-12-06
Publisher(s): Addison-Wesley Professional
  • Complimentary 7-Day eTextbook Access - Read more
    When you rent or buy this book, you will receive complimentary 7-day online access to the eTextbook version from your PC, Mac, tablet, or smartphone. Feature not included on Marketplace Items.
List Price: $49.99

Buy New

Usually Ships in 2-3 Business Days
$49.94

Rent Book

Select for Price
There was a problem. Please try again later.

Rent Digital

Rent Digital Options
Online:1825 Days access
Downloadable:Lifetime Access
$57.59
*To support the delivery of the digital material to you, a digital delivery fee of $3.99 will be charged on each digital item.
$57.59*

Used Book

We're Sorry
Sold Out

How Marketplace Works:

  • This item is offered by an independent seller and not shipped from our warehouse
  • Item details like edition and cover design may differ from our description; see seller's comments before ordering.
  • Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
  • Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
  • Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.

Summary

The Comprehensive, Up-to-Date Apache Hadoop Administration Handbook and Reference

“Sam Alapati has worked with production Hadoop clusters for six years. His unique depth of experience has enabled him to write the go-to resource for all administrators looking to spec, size, expand, and secure production Hadoop clusters of any size.”

–Paul Dix, Series Editor

In Expert Hadoop® Administration, leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimizing production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples.


Alapati demystifies complex Hadoop environments, helping you understand exactly what happens behind the scenes when you administer your cluster. You’ll gain unprecedented insight as you walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes. The high-value administration skills you learn here will be indispensable no matter what Hadoop distribution you use or what Hadoop applications you run.


  • Understand Hadoop’s architecture from an administrator’s standpoint
  • Create simple and fully distributed clusters
  • Run MapReduce and Spark applications in a Hadoop cluster
  • Manage and protect Hadoop data and high availability
  • Work with HDFS commands, file permissions, and storage management
  • Move data, and use YARN to allocate resources and schedule jobs
  • Manage job workflows with Oozie and Hue
  • Secure, monitor, log, and optimize Hadoop
  • Benchmark and troubleshoot Hadoop

Author Biography

Sam R. Alapati has been working with various aspects of the Hadoop environment for the past six years. He is currently the principal Hadoop administrator at Sabre Corporation in Westlake, Texas, and works on a daily basis with multiple large Hadoop 2 clusters. In addition to being the point person for all Hadoop administration at Sabre, Sam manages multiple critical data-science- and data-analysis-related Hadoop job flows and is also an expert Oracle Database Administrator. His vast knowledge of relational databases and SQL contributes to his work with Hadoop related projects. Sam’s recognition in the database and middleware area includes having published 18 well-received books over the past 14 years, mostly on Oracle Database Administration and Oracle Weblogic Server. His experience dealing with numerous configuration, architectural, and performance-related Hadoop issues over the years led him to the realization that many working Hadoop administrators and developers would appreciate having a handy reference such as this book to turn to when creating, managing, securing and optimizing their Hadoop infrastructure.

Table of Contents

Part 1: Introduction to Hadoop 2 - Architecture, Hadoop Clusters and Application Frameworks
1. Introduction to Hadoop 2 and its Architecture
2. Creating and Configuring a Simple Hadoop 2 Cluster
3. Planning for and Creating a Fully-Distributed Cluster
4. Running Applications in a Cluster the MapReduce Framework (and Pig, Hive)
5. Running Applications in a Cluster the Spark Framework on Hadoop

 

Part II: Managing and Protecting Hadoop Data and High Availability
6. The Role of the NameNode and how HDFS Works
7. HDFS Commands, File Permissions and HDFS Storage Management
8. Data Protection, Compression and Hadoop Data Formats
9. NameNode Operations and High Availability

 

Part III: Moving Data, Allocating Resources and Scheduling Jobs
10. Moving Data into and out of Hadoop
11. YARN and Resource Allocation in a Hadoop Cluster
12 Working with Oozie and Hue to Manage Job Workflows

 

Part IV: Security, Monitoring, Optimization and Troubleshooting
13. Securing Hadoop
14. Monitoring and Logging
15. Managing YARN Jobs and Performing Common Management Tasks
16. Benchmarking and Optimization
17. Troubleshooting Hadoop 2

 

Appendix. Installing the Virtual Environment for Hadoop 2

An electronic version of this book is available through VitalSource.

This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.

By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.

Digital License

You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.

More details can be found here.

A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.

Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.

Please view the compatibility matrix prior to purchase.