Srinivasa / Muppalla | Guide to High Performance Distributed Computing | E-Book | www.sack.de
E-Book

E-Book, Englisch, 304 Seiten, eBook

Reihe: Computer Communications and Networks

Srinivasa / Muppalla Guide to High Performance Distributed Computing

Case Studies with Hadoop, Scalding and Spark
2015
ISBN: 978-3-319-13497-0
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark

Case Studies with Hadoop, Scalding and Spark

E-Book, Englisch, 304 Seiten, eBook

Reihe: Computer Communications and Networks

ISBN: 978-3-319-13497-0
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark



This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies such as Hadoop, Scalding and Spark.

Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks.

Topics and features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; provides detailed case studies on approaches to clustering, data classification and regression analysis; explains the process of creating a working recommender system using Scalding and Spark; supplies a complete list of supplementary source code and datasets at an associated website.

Fulfilling the need for both introductory material for undergraduate students of computer science and detailed discussions for software engineering professionals, this book will aid a broad audience to understand the esoteric aspects of practical high performance computing through its use of solved problems, research case studies and working source code.

Srinivasa / Muppalla Guide to High Performance Distributed Computing jetzt bestellen!

Zielgruppe


Graduate

Weitere Infos & Material


Part I: Programming Fundamentals of High Performance Distributed Computing

Introduction

Getting Started with Hadoop

Getting Started with Spark

Programming Internals of Scalding and Spark

Part II: Case studies using Hadoop, Scalding and Spark

Case Study I: Data Clustering using Scalding and Spark

Case Study II: Data Classification using Scalding and Spark

Case Study III: Regression Analysis using Scalding and Spark

Case Study IV: Recommender System using Scalding and Spark



Ihre Fragen, Wünsche oder Anmerkungen
Vorname*
Nachname*
Ihre E-Mail-Adresse*
Kundennr.
Ihre Nachricht*
Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.
Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.