Learn how to configure your Hadoop cluster to run optimal MapReduce jobs
If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code.
MapReduce is the distribution system that the Hadoop MapReduce engine uses to distribute work around a cluster by working parallel on smaller data sets. It is useful in a wide range of applications, including distributed pattern-based searching, distributed sorting, web link-graph reversal, term-vector per host, web access log stats, inverted index construction, document clustering, machine learning, and statistical machine translation.
This book introduces you to advanced MapReduce concepts and teaches you everything from identifying the factors that affect MapReduce job performance to tuning the MapReduce configuration. Based on real-world experience, this book will help you to fully utilize your cluster's node resources to run MapReduce jobs optimally.
This book details the Hadoop MapReduce job performance optimization process. Through a number of clear and practical steps, it will help you to fully utilize your cluster's node resources.
Starting with how MapReduce works and the factors that affect MapReduce performance, you will be given an overview of Hadoop metrics and several performance monitoring tools. Further on, you will explore performance counters that help you identify resource bottlenecks, check cluster health, and size your Hadoop cluster. You will also learn about optimizing map and reduce tasks by using Combiners and compression.
The book ends with best practices and recommendations on how to use your Hadoop cluster optimally.
Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.
Khaled Tannir has been working with computers since 1980. He began programming with the legendary Sinclair Zx81 and after with all Commodore home computers products (Vic 20, Commodore 64, Commodore 128D and Amiga 500). He has a Bachelor's degree in Electronics, a Master degree in System Information Architectures in which graduated with a professional thesis and completed its education with a Research Master degree.
He is a Microsoft Certified Solution Developer (MCSD) and has more than twenty years of technical experience leading the development, implementation of software solutions and giving technical presentations. He works as an independent IT Consultant and has worked as an infrastructure engineer, senior developer, and enterprise / solution architect for many companies in France and Canada.
With a very significant experience in Microsoft .Net/Servers and Oracle Java technologies, he has extensive skills in online/offline applications design, system conversions and multi-language applications in both industries Internet and Desktops. He is always researching new technologies, learns about them and looking for new adventures between France, North America and the Middle-east area. He owns an IT and electronics laboratory with many servers, monitors, open electronics board such Arduino, Netduino, RaspBerry Pi, .Net Gadgeteer and some Smartphone devices based on Windows Phone, Android and iOS operating systems.
In 2012 he contributes to the EGC 2012 (International Complex Data Mining forum at Bordeaux University - France) and presented, in a workshop session, his work about How to optimize data distribution in a cloud computing environment . This work aims to define an approach to optimize using of Data Mining algorithms such as k-means and Apriori in a cloud computing environment. He is the author of the RavenDB 2.x Beginner s Guide book (Packt Publishing) and is a technical reviewer for the Pentaho+MongoDB transformation & reporting book(Packt Publishing) He aims to get a PhD in Cloud Computing, Big Data and wants to learn more and more about these technologies.
He enjoys taking landscape and night photos, travelling, playing video games, creating funny electronics gadgets with Arduino /.Net Gadgeteer and of course spending time with his wife and family.
You can reach him at: contact@khaledtannir.net
„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.
EUR 17,37 für den Versand von Vereinigtes Königreich nach Deutschland
Versandziele, Kosten & DauerEUR 0,55 für den Versand von USA nach Deutschland
Versandziele, Kosten & DauerAnbieter: PBShop.store US, Wood Dale, IL, USA
PAP. Zustand: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verkäufers L0-9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: Ria Christie Collections, Uxbridge, Vereinigtes Königreich
Zustand: New. In. Bestandsnummer des Verkäufers ria9781783285655_new
Anzahl: Mehr als 20 verfügbar
Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes Königreich
PAP. Zustand: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verkäufers L0-9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: Rarewaves.com UK, London, Vereinigtes Königreich
Digital. Zustand: New. This book is an examplebased tutorial that deals with Optimizing Hadoop for MapReduce job performance. If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code. Bestandsnummer des Verkäufers LU-9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: California Books, Miami, FL, USA
Zustand: New. Bestandsnummer des Verkäufers I-9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: BargainBookStores, Grand Rapids, MI, USA
Paperback or Softback. Zustand: New. Optimizing Mapreduce 0.48. Book. Bestandsnummer des Verkäufers BBS-9781783285655
Anzahl: 5 verfügbar
Anbieter: Biblios, Frankfurt am main, HESSE, Deutschland
Zustand: New. PRINT ON DEMAND pp. 120. Bestandsnummer des Verkäufers 18127754122
Anzahl: 4 verfügbar
Anbieter: THE SAINT BOOKSTORE, Southport, Vereinigtes Königreich
Paperback / softback. Zustand: New. This item is printed on demand. New copy - Usually dispatched within 5-9 working days 248. Bestandsnummer des Verkäufers C9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: Rarewaves.com USA, London, LONDO, Vereinigtes Königreich
Digital. Zustand: New. This book is an examplebased tutorial that deals with Optimizing Hadoop for MapReduce job performance. If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code. Bestandsnummer des Verkäufers LU-9781783285655
Anzahl: Mehr als 20 verfügbar
Anbieter: AHA-BUCH GmbH, Einbeck, Deutschland
Taschenbuch. Zustand: Neu. nach der Bestellung gedruckt Neuware - Printed after ordering. Bestandsnummer des Verkäufers 9781783285655
Anzahl: 1 verfügbar