Field Guide to Hadoop

Type: Analyst Research

Product: Data Integration, Big Data Analytics

This book is recommended for IT managers, developers, data analysts, system architects, and similar technical workers, who are faced with having to replace current systems and skills with the new set required by NoSQL and Hadoop, or those who want to deepen their understanding of complementary technologies and databases.

Content covers:

  • An overview of Hadoop’s core technologies including Hadoop Distributed File Systems (HDFS), MapReduce, YARN, and Spark, plus get sample code and links to relevant tutorials
  • Overviews, sample code and links to tutorials for common databases used with Hadoop such as MongoDB, Cassandra, HBase, Hive, Shark, Blur, Accumulo, Memcached, Solr, and Giraph
  • The advantages of different databases, in terms of speed, scalability, security, configurability, text indexing, SQL support, and more
  • Which databases are better for different purposes, such as transactional, relational analytics, sparse data, multi-tenant support, and more