Moving Hadoop to the Cloud

ebook Harnessing Cloud Features and Flexibility for Hadoop Clusters

By Bill Havanki

cover image of Moving Hadoop to the Cloud

Sign up to save your library

With an OverDrive account, you can save your favorite libraries for at-a-glance information about availability. Find out more about OverDrive accounts.

   Not today

Find this title in Libby, the library reading app by OverDrive.

Download Libby on the App Store Download Libby on Google Play

Search for a digital library with this title

Title found at these libraries:

Loading...

Until recently, Hadoop deployments existed on hardware owned and run by organizations. Now, of course, you can acquire the computing resources and network connectivity to run Hadoop clusters in the cloud. But there's a lot more to deploying Hadoop to the public cloud than simply renting machines.

This hands-on guide shows developers and systems administrators familiar with Hadoop how to install, use, and manage cloud-born clusters efficiently. You'll learn how to architect clusters that work with cloud-provider features—not just to avoid pitfalls, but also to take full advantage of these services. You'll also compare the Amazon, Google, and Microsoft clouds, and learn how to set up clusters in each of them.

  • Learn how Hadoop clusters run in the cloud, the problems they can help you solve, and their potential drawbacks
  • Examine the common concepts of cloud providers, including compute capabilities, networking and security, and storage
  • Build a functional Hadoop cluster on cloud infrastructure, and learn what the major providers require
  • Explore use cases for high availability, relational data with Hive, and complex analytics with Spark
  • Get patterns and practices for running cloud clusters, from designing for price and security to dealing with maintenance
  • Moving Hadoop to the Cloud