LIBRISTO
LIBROAMANTO
mandatory
Become part of a community of book lovers from all over the world and get access to a whole bunch of benefits. Create an account for free
0
Austrian Post 5.49 DPD courier 3.99 DPD point 2.99

Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

Language EnglishEnglish
E-book Adobe ePub DRM
Publishers O'Reilly Media, April 2026
Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point... Full description
? points 176 b Top Top New New
71.89 VAT included
In stock Immediate digital delivery


Customers also purchased


Top
Beyond Vibe Coding Addy Osmani / E-book Adobe ePub DRM
common.buy 63.29
Top
Steal Like an Artist Austin Kleon / Book Paperback
common.buy 12.39
Top
Bayesian Analysis with Python Osvaldo Martin / Book Paperback
common.buy 58.29
Top
Bayesian Statistics The Fun Way Will Kurt / Book Paperback
common.buy 31.99
Top
Krótka historia Europy Simon Jenkins / Book Hardback
common.buy 15.69
Dona nobis pacem Klaus Heizmann / Book Paperback
common.buy 23.59

Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era. Without proper optimization, however, LLMs can be expensive and slow to serve. Hands-On LLM Serving and Optimization is a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.In this hands-on, engineering-focused book, authors Chi Wang and Peiheng Hu combine practical examples, code, and strategies for building robust, performant, and cost-efficient AI token factories. Whether you re building the LLM inference infrastructure or the applications that consume it, a deep understanding of LLM serving will make you a more effective, future-ready engineer as AI transforms how we work and build.Learn the foundations of model serving with core concepts, design paradigms, and industry best practicesUnderstand the common challenges of hosting LLMs at scaleBalance latency and throughput to meet the demands of AI applications and business requirementsHost LLMs cost-effectively with practical, code-backed techniques

Actress & Polyglot
EWA KASP for
Play video
Ewa Kasp
Libristo has the largest selection of foreign-language books. That’s why I buy my books there.

About the book

Full name Hands-On LLM Serving and Optimization
Language English
Binding E-book - Adobe ePub DRM
Date of issue 2026
EAN 9798341621466
Libristo code 52376258
Publishers O'Reilly Media
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

You might also be interested in


The Dancing Partner Jerome K Jerome / Book Paperback
common.buy 16.49
Do You Really Want to Meet Velociraptor? Annette Bay Pimentel / Book Hardback
common.buy 41.29
Christianity the Religion of Nature Andrew P. Peabody / Book Paperback
common.buy 20.59
Bayesian Analysis with Python Osvaldo Martin / Book Paperback
common.buy 51.39

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account
Book advisor Libroamiko
Hi, I'm Libroamiko, can I help?