- Machine generated contents note: 1.Core Technologies -- Hadoop Distributed File System (HDFS) -- MapReduce -- YARN -- Spark -- 2.Database and Data Management -- Cassandra -- HBase -- Accumulo -- Memcached -- Blur -- Solr -- MongoDB -- Hive -- Spark SQL (formerly Shark) -- Giraph -- 3.Serialization -- Avro -- JSON -- Protocol Buffers (protobuf) -- Parquet -- 4.Management and Monitoring -- Ambari -- HCatalog -- Nagios -- Puppet -- Chef -- ZooKeeper -- Oozie -- Ganglia -- 5.Analytic Helpers -- MapReduce Interfaces -- Analytic Libraries -- Pig -- Hadoop Streaming -- Mahout -- MLLib -- Hadoop Image Processing Interface (HIPI) -- SpatialHadoop -- 6.Data Transfer -- Sqoop -- Flume -- DistCp -- Storm -- 7.Security, Access Control, and Auditing -- Sentry -- Kerberos -- Knox -- 8.Cloud Computing and Virtualization -- Serengeti -- Docker.
- "If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You'll quickly understand how Hadoop's projects, subprojects, and related technologies work together"--Back cover.
- 9781491947937 (pbk.)
- Source of Acquisition:
- Purchased with funds from the James and Joyce Gettys Libraries Endowment in the Math Library and in the School of Information Sciences and Technology; 2015.
Purchased with funds from the James and Joyce Gettys Libraries Endowment in the Math Library and in the School of Information Sciences and Technology; 2015
- Endowment Note:
- James and Joyce Gettys Libraries Endowment in the Math Library and in the School of Information Sciences and Technology
View MARC record | catkey: 17038632