Buch, Englisch, 421 Seiten, Paperback, Format (B × H): 178 mm x 254 mm, Gewicht: 8279 g
A Definitive Guide to Hadoop-Related Frameworks and Tools
Buch, Englisch, 421 Seiten, Paperback, Format (B × H): 178 mm x 254 mm, Gewicht: 8279 g
ISBN: 978-1-4842-2198-3
Verlag: Apress
While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.
What You Will Learn:
- Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
- Run a MapReduce job
- Store data with Apache Hive, and Apache HBase
- Index data in HDFS with Apache Solr
- Develop a Kafka messaging system
- Stream Logs to HDFS with Apache Flume
- Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
- Create a Hive table over Apache Solr
- Develop a Mahout User Recommender System
Who This Book Is For:
Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Fachgebiete
Weitere Infos & Material
Part I. Fundamentals.- Introduction.- 1. HDFS and MapReduce.- Part II Storing & Querying.- 2. Apache Hive.- 3. Apache HBase.- Part III Bulk Transferring & Streaming.- 4. Apache Sqoop.- 5. Apache Flume.- Part IV Serializing.- 6. Apache Avro.- 7. Apache Parquet.- Part V Messaging & Indexing.- 8. Apache Kafka.- 9. Apache Solr.- 10.Apache Mahout.