Parallel Computing for AI and ML Engineers: Build Scalable Deep Learning Systems with GPU Programming, Multi-GPU Training, and Production Workloads - Softcover

Holbrook, M.T

9798195370404: Parallel Computing for AI and ML Engineers: Build Scalable Deep Learning Systems with GPU Programming, Multi-GPU Training, and Production Workloads

Softcover

ISBN 13: 9798195370404

Verlag: Independently published, 2026

Alle Exemplare dieser ISBN-Ausgabe

0 Gebraucht

5 Neu

Von EUR 30,26

Stop Guessing. Start Building ML Systems That Actually Scale.

Most ML engineers learn GPU computing the hard way — through production failures, mysterious hangs, and models that take three times longer to train than they should. This book gives you the understanding and the tools to get it right the first time.

What This Book Covers

-GPU architecture internals: CUDA cores, warps, shared memory, and memory coalescing

-Writing and optimizing custom CUDA kernels in C++

-Data parallel, model parallel, and pipeline parallel training with PyTorch DDP and FSDP

-Multi-node training with NCCL, MPI, and InfiniBand

-Mixed precision training and gradient scaling

-ZeRO optimizer stages 1, 2, and 3 with DeepSpeed

-Custom DataLoader optimization and NVIDIA DALI

-Production model serving with Triton Inference Server

-Kubernetes deployment with GPU autoscaling

-Complete profiling workflows with Nsight and PyTorch Profiler

-Troubleshooting CUDA OOM, NCCL hangs, and NaN losses

-Capacity planning and hardware selection for real workloads

Who This Book Is For

This book is written for ML engineers, AI researchers, and software engineers working on deep learning infrastructure who want to move beyond single-GPU experiments and build systems that perform at scale. You should be comfortable with Python and have basic familiarity with PyTorch or TensorFlow. No prior CUDA experience required.

What Makes This Book Different

Every chapter includes complete, runnable code. Architecture diagrams show how components connect. Benchmark results come from real hardware measurements. The troubleshooting appendices address the exact errors that stop real training jobs. This is not a survey of techniques. It is a working engineer's guide to building production parallel ML systems.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

Verlag: Independently published
Erscheinungsdatum: 2026
Sprache: Englisch
ISBN 13: 9798195370404
Einband: Taschenbuch
Anzahl der Seiten: 435
Kontakt zum Hersteller: Manufactured by Amazon on behalf of the author
https://www.amazon.de/hz/contact-us

c/o Amazon Media EU S.�.r.l., 38 Avenue John F. Kennedy
Luxembourg
L-1855
Luxemburg

Suchergebnisse f�r Parallel Computing for AI and ML Engineers: Build Scalable...

Beispielbild f�r diese ISBN

Parallel Computing for AI and ML Engineers: Build Scalable Deep Learning Systems with GPU Programming, Multi-GPU Training, and Production Workloads

Holbrook, M.T

Verlag: Independently published, 2026

ISBN 13: 9798195370404

Neu Softcover

Print-on-Demand

Anbieter: California Books, Miami, FL, USA

Verk�uferbewertung 4 von 5 Sternen

Zustand: New. Print on Demand. Bestandsnummer des Verk�ufers I-9798195370404

Verk�ufer kontaktieren

Neu kaufen

EUR 30,26

Versand gratis
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Parallel Computing for AI and ML Engineers: Build Scalable Deep Learning Systems with GPU Programming, Multi-GPU Training, and Production Workloads

Holbrook, M.T

Verlag: Independently published, 2026

ISBN 13: 9798195370404

Neu Softcover

Anbieter: Bluemindbooks, PACHECO, CA, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. New Book. Bestandsnummer des Verk�ufers NJ-INGR-9798195370404

Verk�ufer kontaktieren

Neu kaufen

EUR 34,11

Versand gratis
Versand innerhalb von USA

Anzahl: 1 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Parallel Computing for AI and ML Engineers

M T Holbrook

Verlag: Independently Published, 2026

ISBN 13: 9798195370404

Neu PAP

Print-on-Demand

Anbieter: PBShop.store US, Wood Dale, IL, USA

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verk�ufers L0-9798195370404

Verk�ufer kontaktieren

Neu kaufen

EUR 45,19

Versand gratis
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Parallel Computing for AI and ML Engineers

M T Holbrook

Verlag: Independently Published, 2026

ISBN 13: 9798195370404

Neu PAP

Print-on-Demand

Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Bestandsnummer des Verk�ufers L0-9798195370404

Verk�ufer kontaktieren

Neu kaufen

EUR 39,02

EUR 7,83 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Parallel Computing for AI and ML Engineers (Paperback)

M.T. Holbrook

Verlag: Independently Published, 2026

ISBN 13: 9798195370404

Neu Paperback

Print-on-Demand

Anbieter: CitiRetail, Stevenage, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: new. Paperback. Stop Guessing. Start Building ML Systems That Actually Scale.Most ML engineers learn GPU computing the hard way - through production failures, mysterious hangs, and models that take three times longer to train than they should. This book gives you the understanding and the tools to get it right the first time.What This Book Covers-GPU architecture internals: CUDA cores, warps, shared memory, and memory coalescing-Writing and optimizing custom CUDA kernels in C++-Data parallel, model parallel, and pipeline parallel training with PyTorch DDP and FSDP-Multi-node training with NCCL, MPI, and InfiniBand-Mixed precision training and gradient scaling-ZeRO optimizer stages 1, 2, and 3 with DeepSpeed-Custom DataLoader optimization and NVIDIA DALI-Production model serving with Triton Inference Server-Kubernetes deployment with GPU autoscaling-Complete profiling workflows with Nsight and PyTorch Profiler-Troubleshooting CUDA OOM, NCCL hangs, and NaN losses-Capacity planning and hardware selection for real workloadsWho This Book Is ForThis book is written for ML engineers, AI researchers, and software engineers working on deep learning infrastructure who want to move beyond single-GPU experiments and build systems that perform at scale. You should be comfortable with Python and have basic familiarity with PyTorch or TensorFlow. No prior CUDA experience required.What Makes This Book DifferentEvery chapter includes complete, runnable code. Architecture diagrams show how components connect. Benchmark results come from real hardware measurements. The troubleshooting appendices address the exact errors that stop real training jobs. This is not a survey of techniques. It is a working engineer's guide to building production parallel ML systems. This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Bestandsnummer des Verk�ufers 9798195370404

Verk�ufer kontaktieren

Neu kaufen

EUR 43,53

EUR 42,86 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: 1 verf�gbar

In den Warenkorb