Hands-On LLM Serving and Optimization: Hosting LLMs at Scale - Softcover

Wang, Chi ; Hu, Peiheng

9798341621497: Hands-On LLM Serving and Optimization: Hosting LLMs at Scale

Softcover

ISBN 13: 9798341621497

Verlag: O'Reilly Media, 2026

Alle Exemplare dieser ISBN-Ausgabe

2 Gebraucht

Von EUR 56,60

16 Neu

Von EUR 52,00

As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

�ber die Autorin bzw. den Autor

Chi Wang is a director of engineering at Salesforce's Einstein AI group, with over 18 years of experience in artificial intelligence and distributed systems. He leads the development of large-scale AI platforms that enable model training, inference, and optimization for hundreds of internal teams and power AI capabilities used by millions of Salesforce customers. At Salesforce, Chi oversees multiple engineering teams focused on model inference and optimization, and data science platforms. His work spans building multi-tenant AI infrastructure, scaling distributed compute systems, and improving the performance and cost-efficiency of large language model workloads in production. Chi is the lead inventor on 12 patents across areas including model serving and optimization, data access control, and large-scale system design. He is also a passionate technical writer, focused on making complex AI systems practical and accessible for engineers. Peiheng Hu is an accomplished machine learning engineer with over 10 years of industry experience and expertise in building large-scale AI systems. He currently works at NVIDIA, where he focuses on the cutting-edge distributed LLM inference, pushing the boundaries of high-performance inference engines on the latest NVIDIA GPUs. He holds a master of science in computational science and engineering from Harvard University and a bachelor of science in industrial engineering operations research from Georgia Institute of Technology. Previously, Peiheng served as a principal member of technical staff at Salesforce, where he led the development of the company's only unified serving platform, handling thousands of per-tenant models and LLM optimizations for Agentforce that saved millions in AI infrastructure expenses. Prior to that, he was a senior ML engineer at Microsoft Azure, where he architected distributed ML processing solutions for cloud security detection and analytics, handling billions of transactions per hour.

��ber diesen Titel� kann sich auf eine andere Ausgabe dieses Titels beziehen.

Verlag: O'Reilly Media
Erscheinungsdatum: 2026
Sprache: Englisch
ISBN 13: 9798341621497
Einband: Taschenbuch
Anzahl der Seiten: 300
Kontakt zum Hersteller: O'Reilly
https://oreilly.com

1005 Gravenstein Highway North
Sebastopol
CA
95472
USA
Verantwortliche Person: Nicht verf�gbar

Gebraucht kaufen

Zustand: Wie neu

Unread book in perfect condition...

Diesen Artikel anzeigen

EUR 56,60

EUR 2,32 Versand
Versand innerhalb von USA

In den Warenkorb

Neu kaufen

Diesen Artikel anzeigen

EUR 52,00

EUR 2,32 Versand
Versand innerhalb von USA

In den Warenkorb

Suchergebnisse f�r Hands-On LLM Serving and Optimization: Hosting LLMs...

Beispielbild f�r diese ISBN

Hands-On Llm Serving And Optimization

Wang, Chi;hu, Peiheng

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Neu Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 51932241-n

Verk�ufer kontaktieren

Neu kaufen

EUR 52,00

EUR 2,32 Versand
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On LLM Serving and Optimization

Chi Wang

Verlag: O'Reilly, 2026

ISBN 13: 9798341621497

Neu PAP

Anbieter: PBShop.store US, Wood Dale, IL, USA

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Shipped from UK. Established seller since 2000. Bestandsnummer des Verk�ufers WO-9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 54,40

Versand gratis
Versand innerhalb von USA

Anzahl: 15 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On LLM Serving and Optimization

Chi Wang

Verlag: O'Reilly, 2026

ISBN 13: 9798341621497

Neu PAP

Anbieter: PBShop.store UK, Fairford, GLOS, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

PAP. Zustand: New. New Book. Shipped from UK. Established seller since 2000. Bestandsnummer des Verk�ufers WO-9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 50,47

EUR 5,86 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: 15 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On Llm Serving And Optimization

Wang, Chi;hu, Peiheng

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Gebraucht Softcover

Anbieter: GreatBookPrices, Columbia, MD, USA

Verk�uferbewertung 5 von 5 Sternen

Zustand: As New. Unread book in perfect condition. Bestandsnummer des Verk�ufers 51932241

Verk�ufer kontaktieren

Gebraucht kaufen

EUR 56,60

EUR 2,32 Versand
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On LLM Serving and Optimization: Hosting LLMs at Scale

Wang, Chi; Hu, Peiheng

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Neu Softcover

Anbieter: California Books, Miami, FL, USA

Verk�uferbewertung 4 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers I-9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 59,70

Versand gratis
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Hands-On LLM Serving and Optimization: Hosting LLMs at Scale

Chi Wang, Peiheng Hu

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Neu Paperback

Anbieter: Rarewaves USA, OSWEGO, IL, USA

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: New. Bestandsnummer des Verk�ufers LU-9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 61,50

Versand gratis
Versand innerhalb von USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On LLM Serving and Optimization (eng)

Wang, Chi

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Neu Softcover

Anbieter: Brook Bookstore On Demand, Napoli, NA, Italien

Verk�uferbewertung 5 von 5 Sternen

Zustand: new. Bestandsnummer des Verk�ufers ZXH85SGQ1S

Verk�ufer kontaktieren

Neu kaufen

EUR 56,24

EUR 6,80 Versand
Versand von Italien nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On LLM Serving and Optimization (Paperback)

Chi Wang

Verlag: O'Reilly Media, Sebastopol, 2026

ISBN 13: 9798341621497

Neu Paperback

Print-on-Demand

Anbieter: Grand Eagle Retail, Bensenville, IL, USA

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: new. Paperback. Large language models (LLMs) are rapidly becoming the backbone of AI-driven applications. Without proper optimization, however, LLMs can be expensive to run, slow to serve, and prone to performance bottlenecks. As the demand for real-time AI applications grows, along comes Hands-On Serving and Optimizing LLM Models, a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.In this hands-on book, authors Chi Wang and Peiheng Hu take a real-world approach backed by practical examples and code, and assemble essential strategies for designing robust infrastructures that are equal to the demands of modern AI applications. Whether you're building high-performance AI systems or looking to enhance your knowledge of LLM optimization, this indispensable book will serve as a pillar of your success.Learn the key principles for designing a model-serving system tailored to popular business scenariosUnderstand the common challenges of hosting LLMs at scale while minimizing costsPick up practical techniques for optimizing LLM serving performanceBuild a model-serving system that meets specific business requirementsImprove LLM serving throughput and reduce latencyHost LLMs in a cost-effective manner, balancing performance and resource efficiency As the demand for real-time AI applications grows, along comes this comprehensive guide to the complexities of deploying and optimizing LLMs at scale. The authors take a real-world approach backed by practical examples and code, and assemble essential strategies for designing infrastructures that are equal to the demands of modern AI applications. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Bestandsnummer des Verk�ufers 9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 64,32

Versand gratis
Versand innerhalb von USA

Anzahl: 1 verf�gbar

In den Warenkorb

Foto des Verk�ufers

Hands-On LLM Serving and Optimization

Chi Wang, Peiheng Hu

Verlag: O'Reilly Media, US, 2026

ISBN 13: 9798341621497

Neu Paperback

Anbieter: Rarewaves.com USA, London, LONDO, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Paperback. Zustand: New. Large language models (LLMs) are rapidly becoming the backbone of AI-driven applications. Without proper optimization, however, LLMs can be expensive to run, slow to serve, and prone to performance bottlenecks. As the demand for real-time AI applications grows, along comes Hands-On Serving and Optimizing LLM Models, a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.In this hands-on book, authors Chi Wang and Peiheng Hu take a real-world approach backed by practical examples and code, and assemble essential strategies for designing robust infrastructures that are equal to the demands of modern AI applications. Whether you're building high-performance AI systems or looking to enhance your knowledge of LLM optimization, this indispensable book will serve as a pillar of your success.Learn the key principles for designing a model-serving system tailored to popular business scenariosUnderstand the common challenges of hosting LLMs at scale while minimizing costsPick up practical techniques for optimizing LLM serving performanceBuild a model-serving system that meets specific business requirementsImprove LLM serving throughput and reduce latencyHost LLMs in a cost-effective manner, balancing performance and resource efficiency. Bestandsnummer des Verk�ufers LU-9798341621497

Verk�ufer kontaktieren

Neu kaufen

EUR 68,53

Versand gratis
Versand von Vereinigtes K�nigreich nach USA

Anzahl: 5 verf�gbar

In den Warenkorb

Beispielbild f�r diese ISBN

Hands-On Llm Serving And Optimization

Wang, Chi;hu, Peiheng

Verlag: O'Reilly Media, 2026

ISBN 13: 9798341621497

Neu Softcover

Anbieter: GreatBookPricesUK, Woodford Green, Vereinigtes K�nigreich

Verk�uferbewertung 5 von 5 Sternen

Zustand: New. Bestandsnummer des Verk�ufers 51932241-n

Verk�ufer kontaktieren

Neu kaufen

EUR 51,59

EUR 17,50 Versand
Versand von Vereinigtes K�nigreich nach USA

Anzahl: Mehr als 20 verf�gbar

In den Warenkorb

Es gibt 8 weitere Exemplare dieses Buches

Alle Suchergebnisse ansehen

Verwandte Artikel zu Hands-On LLM Serving and Optimization: Hosting LLMs...

Hands-On LLM Serving and Optimization: Hosting LLMs at Scale - Softcover

Inhaltsangabe

�ber die Autorin bzw. den Autor

Suchergebnisse f�r Hands-On LLM Serving and Optimization: Hosting LLMs...

Neu kaufen

Neu kaufen

Neu kaufen

Gebraucht kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Neu kaufen

Es gibt 8 weitere Exemplare dieses Buches