9798290030715 (2 Ergebnisse)

Feedback

ISBN: 9798290030715

Mit der Detailsuche verfeinern

Sortiert nach

Beispielbild f�r diese ISBN

Big Data With Pyspark: Processing Large Datasets: A Hands-On Guide To Distributed Data Engineering, Machine Learning And Big Data Pipelines With Apache Spark And Python

Publishing, PythQuill

Verlag: Independently published, 2025

ISBN 13: 9798290030715

Sprache: Englisch

Anbieter: Best Price, Torrance, CA, USA

Verk�uferbewertung 5 von 5 Sternen

Verk�ufer kontaktieren

Neu - Softcover
Zustand: Neu

EUR 13,74
W�hrung umrechnen

EUR 25,49 f�r den Versand von USA nach Deutschland
Versandziele, Kosten & Dauer

Anzahl: 2 verf�gbar
In den Warenkorb

Zustand: New. SUPER FAST SHIPPING.
Beispielbild f�r diese ISBN

Big Data With Pyspark (Paperback)

Pythquill Publishing

Verlag: Independently Published, 2025

ISBN 13: 9798290030715

Sprache: Englisch

Anbieter: CitiRetail, Stevenage, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Verk�ufer kontaktieren

Neu - Softcover
Zustand: Neu

EUR 22,62
W�hrung umrechnen

EUR 28,91 f�r den Versand von Vereinigtes K�nigreich nach Deutschland
Versandziele, Kosten & Dauer

Anzahl: 1 verf�gbar
In den Warenkorb

Paperback. Zustand: new. Paperback. You'll LearnUnderstand the Foundations of Big Data and Distributed Computing: Gain a solid grasp of Big Data concepts, including the 5 Vs, the challenges of traditional systems, and the fundamental principles of distributed computing like parallelism, fault tolerance, and scalability.Master the PySpark Ecosystem: Learn the architecture of Apache Spark, its core components (Spark SQL, Structured Streaming, MLlib, GraphFrames), and how the PySpark API seamlessly integrates with Python.Set Up Your PySpark Environment: Get hands-on experience setting up a complete development environment on your local machine and learn how to run applications in various cloud platforms like Databricks, AWS EMR, and Google Cloud Dataproc.Process Data with RDDs and DataFrames: Master Spark's core data structures, from the low-level RDDs to the powerful and optimized DataFrames. Learn to apply a wide range of transformations and actions for data manipulation.Perform Advanced Data Wrangling and Feature Engineering: Acquire skills in data cleaning, handling missing values and duplicates, and performing complex transformations using Spark SQL, Window Functions, and User-Defined Functions (UDFs), including high-performance Pandas UDFs.Connect to Diverse Data Sources: Read and write data from various formats (CSV, JSON, Parquet) and connect to external systems like relational databases (JDBC), NoSQL stores (Cassandra, MongoDB), and cloud storage (S3, ADLS).Build Real-Time Data Pipelines: Implement modern, fault-tolerant data ingestion with Structured Streaming, including handling event time, watermarking, and performing stateful transformations for real-time analytics.Apply Machine Learning at Scale with MLlib: Learn to build and evaluate distributed machine learning pipelines for classification, regression, and clustering tasks using Spark's MLlib library.Analyze Graph-Structured Data: Explore the power of GraphFrames to model and analyze complex relationships, run graph algorithms like PageRank, and find patterns in network data.Optimize PySpark Applications for Performance: Dive deep into performance tuning, including understanding DAGs and shuffles, managing partitioning, optimizing joins, and configuring memory settings to make your code run faster and more efficiently.Monitor, Debug, and Deploy Applications: Utilize the Spark UI to monitor your jobs, troubleshoot common errors, and learn to package and deploy your PySpark applications to different cluster managers like YARN and Kubernetes.Solve Real-World Big Data Problems: Apply your knowledge through practical case studies, including building a recommendation engine, a real-time fraud detection system, and an ETL pipeline, to solidify your skills and build a portfolio. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability.

9798290030715 (2 Ergebnisse)

Big Data With Pyspark: Processing Large Datasets: A Hands-On Guide To Distributed Data Engineering, Machine Learning And Big Data Pipelines With Apache Spark And Python

Big Data With Pyspark (Paperback)

Kaufgesuch aufgeben

Hilfe

9798290030715 (2 Ergebnisse)

Suchfilter

Produktart

Zustand Mehr dazu

Einband

Weitere Eigenschaften

Sprache (1)

Preis

Gratisversand

Land des Verk�ufers

Verk�uferbewertung

Big Data With Pyspark: Processing Large Datasets: A Hands-On Guide To Distributed Data Engineering, Machine Learning And Big Data Pipelines With Apache Spark And Python

Big Data With Pyspark (Paperback)

Kaufgesuch aufgeben

Hilfe