LIBRISTO
LIBROAMANTO
mandatory
Become part of a community of book lovers from all over the world and get access to a whole bunch of benefits. Create an account for free
0
Austrian Post 5.49 DPD courier 3.99 DPD point 2.99

Parallel Computing for AI and ML Engineers

Build Scalable Deep Learning Systems with GPU Programming, Multi-GPU Training, and Production Workloads

Language EnglishEnglish
Book Paperback
Book Parallel Computing for AI and ML Engineers M.T Holbrook
Libristo code: 52269829
Publishers Independently published, May 2026
Stop Guessing. Start Building ML Systems That Actually Scale.Most ML engineers learn GPU computing t... Full description
? points 74 b New New
30.39 VAT included
In stock at our supplier Shipping in 9-15 days
Austria Delivery to Austria

30-day return policy

Stop Guessing. Start Building ML Systems That Actually Scale.

Most ML engineers learn GPU computing the hard way - through production failures, mysterious hangs, and models that take three times longer to train than they should. This book gives you the understanding and the tools to get it right the first time.

What This Book Covers

-GPU architecture internals: CUDA cores, warps, shared memory, and memory coalescing

-Writing and optimizing custom CUDA kernels in C++

-Data parallel, model parallel, and pipeline parallel training with PyTorch DDP and FSDP

-Multi-node training with NCCL, MPI, and InfiniBand

-Mixed precision training and gradient scaling

-ZeRO optimizer stages 1, 2, and 3 with DeepSpeed

-Custom DataLoader optimization and NVIDIA DALI

-Production model serving with Triton Inference Server

-Kubernetes deployment with GPU autoscaling

-Complete profiling workflows with Nsight and PyTorch Profiler

-Troubleshooting CUDA OOM, NCCL hangs, and NaN losses

-Capacity planning and hardware selection for real workloads

Who This Book Is For

This book is written for ML engineers, AI researchers, and software engineers working on deep learning infrastructure who want to move beyond single-GPU experiments and build systems that perform at scale. You should be comfortable with Python and have basic familiarity with PyTorch or TensorFlow. No prior CUDA experience required.

What Makes This Book Different

Every chapter includes complete, runnable code. Architecture diagrams show how components connect. Benchmark results come from real hardware measurements. The troubleshooting appendices address the exact errors that stop real training jobs. This is not a survey of techniques. It is a working engineer's guide to building production parallel ML systems.

Actress & Polyglot
EWA KASP for
Play video
Ewa Kasp
Libristo has the largest selection of foreign-language books. That’s why I buy my books there.

About the book

Full name Parallel Computing for AI and ML Engineers
Author M.T Holbrook
Language English
Binding Book - Paperback
Date of issue 2026
Number of pages 436
EAN 9798195370404
Libristo code 52269829
Weight 1006
Dimensions 216 x 280 x 22
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account