Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala - Softcover

Eric Tome; Rupam Bhattacharjee; David Radford

9781804612583: Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala

Softcover

ISBN 10: 1804612588 ISBN 13: 9781804612583

Verlag: Packt Publishing, 2024

Alle Exemplare dieser ISBN-Ausgabe

2 Gebraucht

Von EUR 38,94

17 Neu

Von EUR 45,20

Take your data engineering skills to the next level by learning how to utilize Scala and functional programming to create continuous and scheduled pipelines that ingest, transform, and aggregate data

Key Features

Transform data into a clean and trusted source of information for your organization using Scala
Build streaming and batch-processing pipelines with step-by-step explanations
Implement and orchestrate your pipelines by following CI/CD best practices and test-driven development (TDD)
Purchase of the print or Kindle book includes a free PDF eBook

Book Description

Most data engineers know that performance issues in a distributed computing environment can easily lead to issues impacting the overall efficiency and effectiveness of data engineering tasks. While Python remains a popular choice for data engineering due to its ease of use, Scala shines in scenarios where the performance of distributed data processing is paramount.

This book will teach you how to leverage the Scala programming language on the Spark framework and use the latest cloud technologies to build continuous and triggered data pipelines. You’ll do this by setting up a data engineering environment for local development and scalable distributed cloud deployments using data engineering best practices, test-driven development, and CI/CD. You’ll also get to grips with DataFrame API, Dataset API, and Spark SQL API and its use. Data profiling and quality in Scala will also be covered, alongside techniques for orchestrating and performance tuning your end-to-end pipelines to deliver data to your end users.

By the end of this book, you will be able to build streaming and batch data pipelines using Scala while following software engineering best practices.

What you will learn

Set up your development environment to build pipelines in Scala
Get to grips with polymorphic functions, type parameterization, and Scala implicits
Use Spark DataFrames, Datasets, and Spark SQL with Scala
Read and write data to object stores
Profile and clean your data using Deequ
Performance tune your data pipelines using Scala

Who this book is for

This book is for data engineers who have experience in working with data and want to understand how to transform raw data into a clean, trusted, and valuable source of information for their organization using Scala and the latest cloud technologies.

Scala Essentials for Data Engineers
Environment Setup
An Introduction to Apache Spark and Its APIs – DataFrame, Dataset, and Spark SQL
Working with Databases
Object Stores and Data Lakes
Understanding Data Transformation
Data Profiling and Data Quality
Test-Driven Development, Code Health, and Maintainability
CI/CD with GitHub
Data Pipeline Orchestration
Performance Tuning
Building Batch Pipelines Using Spark and Scala
Building Streaming Pipelines Using Spark and Scala

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

�ber die Autorinnen und Autoren

Eric Tome has over 25 years of experience working with data. He has contributed to and led teams that ingested, cleansed, standardized, and prepared data used by business intelligence, data science, and operations teams. He has a background in mathematics and currently works as a senior solutions architect at Databricks, helping customers solve their data and AI challenges.

Rupam Bhattacharjee works as a lead data engineer at IBM. He has architected and developed data pipelines, processing massive structured and unstructured data using Spark and Scala for on-premises Hadoop and K8s clusters on the public cloud. He has a degree in electrical engineering.

David Radford has worked in big data for over 10 years, with a focus on cloud technologies. He led consulting teams for several years, completing a migration from legacy systems to modern data stacks. He holds a master's degree in computer science and works as a senior solutions architect at Databricks.

��ber diesen Titel� kann sich auf eine andere Ausgabe dieses Titels beziehen.

VerlagPackt Publishing
Erscheinungsdatum2024
ISBN 10 1804612588
ISBN 13 9781804612583
EinbandTapa blanda
SpracheEnglisch
Anzahl der Seiten300
Kontakt zum HerstellerNicht verf�gbar

Gebraucht kaufen

Zustand: Wie neu

Unread book in perfect condition...

Diesen Artikel anzeigen

EUR 38,94

W�hrung umrechnen

EUR 17,42 f�r den Versand von USA nach Deutschland

Versandziele, Kosten & Dauer

In den Warenkorb

Neu kaufen

Diesen Artikel anzeigen

EUR 45,20

W�hrung umrechnen

Gratis f�r den Versand innerhalb von/der Deutschland

Versandziele, Kosten & Dauer

In den Warenkorb

Suchergebnisse f�r Data Engineering with Scala and Spark: Build streaming...

Foto des Verk�ufers

Data Engineering with Scala and Spark : Build streaming and batch pipelines that process massive amounts of data using Scala

Eric Tome

Verlag: Packt Publishing Jan 2024, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Taschenbuch

Anbieter: AHA-BUCH GmbH, Einbeck, Deutschland

Verk�uferbewertung 5 von 5 Sternen

Taschenbuch. Zustand: Neu. Neuware - Take your data engineering skills to the next level by learning how to utilize Scala and functional programming to create continuous and scheduled pipelines that ingest, transform, and aggregate dataKey FeaturesTransform data into a clean and trusted source of information for your organization using ScalaBuild streaming and batch-processing pipelines with step-by-step explanationsImplement and orchestrate your pipelines by following CI/CD best practices and test-driven development (TDD)Purchase of the print or Kindle book includes a free PDF Elektronisches BuchBook DescriptionMost data engineers know that performance issues in a distributed computing environment can easily lead to issues impacting the overall efficiency and effectiveness of data engineering tasks. While Python remains a popular choice for data engineering due to its ease of use, Scala shines in scenarios where the performance of distributed data processing is paramount.This book will teach you how to leverage the Scala programming language on the Spark framework and use the latest cloud technologies to build continuous and triggered data pipelines. You'll do this by setting up a data engineering environment for local development and scalable distributed cloud deployments using data engineering best practices, test-driven development, and CI/CD. You'll also get to grips with DataFrame API, Dataset API, and Spark SQL API and its use. Data profiling and quality in Scala will also be covered, alongside techniques for orchestrating and performance tuning your end-to-end pipelines to deliver data to your end users.By the end of this book, you will be able to build streaming and batch data pipelines using Scala while following software engineering best practices.What you will learnSet up your development environment to build pipelines in ScalaGet to grips with polymorphic functions, type parameterization, and Scala implicitsUse Spark DataFrames, Datasets, and Spark SQL with ScalaRead and write data to object storesProfile and clean your data using DeequPerformance tune your data pipelines using ScalaWho this book is forThis book is for data engineers who have experience in working with data and want to understand how to transform raw data into a clean, trusted, and valuable source of information for their organization using Scala and the latest cloud technologies. Table of ContentsScala Essentials for Data EngineersEnvironment SetupAn Introduction to Apache Spark and Its APIs - DataFrame, Dataset, and Spark SQLWorking with DatabasesObject Stores and Data LakesUnderstanding Data TransformationData Profiling and Data QualityTest-Driven Development, Code Health, and MaintainabilityCI/CD with GitHubData Pipeline OrchestrationPerformance TuningBuilding Batch Pipelines Using Spark and ScalaBuilding Streaming Pipelines Using Spark and Scala. Bestandsnummer des Verk�ufers 9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 45,20

W�hrung umrechnen

Versand: Gratis

Innerhalb Deutschlands

Versandziele, Kosten & Dauer

Anzahl: 1 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala

Eric Tome; Rupam Bhattacharjee; David Radford

Verlag: Packt Publishing, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Softcover

Anbieter: California Books, Miami, FL, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers I-9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 36,79

W�hrung umrechnen

Versand: EUR 8,71

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala

Eric Tome; Rupam Bhattacharjee; David Radford

Verlag: Packt Publishing, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Softcover

Anbieter: Ria Christie Collections, Uxbridge, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. In. Bestandsnummer des Verk�ufers ria9781804612583_new

Verk�ufer kontaktieren

Neu kaufen

EUR 39,89

W�hrung umrechnen

Versand: EUR 5,82

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark

Eric Tome

Verlag: Packt Publishing Limited, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu PAP

Print-on-Demand

Anbieter: PBShop.store US, Wood Dale, IL, USA

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verk�ufers L0-9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 45,09

W�hrung umrechnen

Versand: EUR 0,68

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

DATA ENGINEERING WITH SCALA AND SPARK:

TOME, ERIC

Verlag: Packt Publishing, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Softcover

Anbieter: Speedyhen, London, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: NEW. Bestandsnummer des Verk�ufers NW9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 40,09

W�hrung umrechnen

Versand: EUR 5,83

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: 1 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala (Paperback or Softback)

Tome, Eric

Verlag: Packt Publishing 1/31/2024, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Paperback or Softback

Anbieter: BargainBookStores, Grand Rapids, MI, USA

Verk�uferbewertung 5 von 5 Sternen

Paperback or Softback. Zustand: New. Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala 1.14. Book. Bestandsnummer des Verk�ufers BBS-9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 36,87

W�hrung umrechnen

Versand: EUR 10,89

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: 5 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark

Eric Tome

Verlag: Packt Publishing Limited, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu PAP

Print-on-Demand

Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes K�nigreich

Verk�uferbewertung 4 von 5 Sternen

PAP. Zustand: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verk�ufers L0-9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 43,58

W�hrung umrechnen

Versand: EUR 4,61

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala

Eric Tome

Verlag: Packt Publishing, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Softcover

Anbieter: Kennys Bookshop and Art Galleries Ltd., Galway, GY, Irland

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. 2024. paperback. . . . . . Bestandsnummer des Verk�ufers V9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 46,57

W�hrung umrechnen

Versand: EUR 2,00

Von Irland nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: 1 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark: A practical guide helping you build streaming and batch pipelines that process massive amounts of data using Scala

Eric Tome

Verlag: Packt Publishing Limited, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Paperback / softback

Print-on-Demand

Anbieter: THE SAINT BOOKSTORE, Southport, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Paperback / softback. Zustand: New. This item is printed on demand. New copy - Usually dispatched within 5-9 working days 526. Bestandsnummer des Verk�ufers C9781804612583

Verk�ufer kontaktieren

Neu kaufen

EUR 44,02

W�hrung umrechnen

Versand: EUR 6,87

Von Vereinigtes K�nigreich nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala

Tome, Eric

Verlag: Packt Publishing, 2024

ISBN 10: 1804612588 ISBN 13: 9781804612583

Neu Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 47248760-n

Verk�ufer kontaktieren

Neu kaufen

EUR 34,42

W�hrung umrechnen

Versand: EUR 17,42

Von USA nach Deutschland

Versandziele, Kosten & Dauer

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Es gibt 9 weitere Exemplare dieses Buches

Alle Suchergebnisse ansehen

Verwandte Artikel zu Data Engineering with Scala and Spark: Build streaming...

Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala - Softcover

Eric Tome; Rupam Bhattacharjee; David Radford

Inhaltsangabe

Key Features

Book Description

What you will learn

Who this book is for

Table of Contents

�ber die Autorinnen und Autoren

Gebraucht kaufen

Neu kaufen

Suchergebnisse f�r Data Engineering with Scala and Spark: Build streaming...

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Es gibt 9 weitere Exemplare dieses Buches