Kód: 52770465

AI Inference Optimization Engineering

Name: AI Inference Optimization Engineering
Brand: Independently published
SKU: 52770465
Price: 247.00 CZK
Availability: InStock
Author: ChatVariety Team
ISBN: 9798199720021

Autor ChatVariety Team

Předobjednávka
Novinka

Slash LLM Deployment Costs and LatencyDeploying Large Language Models (LLMs) in production is a massive economic and engineering hurdle. AI Inference Optimization Engineering is your comprehensive, hands-on guide to mastering the ... celý popis

Jazyk: Angličtina
Vazba: Brožovaná
Počet stran: 96
Nakladatelství: Independently published, 2026
Více informací o knize

247 Kč

Skladem u dodavatele
07.06.2026

Informovat o naskladnění

Přidat mezi přání

Mohlo by se vám také líbit

Moonwalk
261 Kč
Předobjednat
The Deal
285 Kč
Koupit
The Score
248 Kč
Koupit
The Mistake
285 Kč
Koupit
Jujutsu Kaisen, Vol. 30
240 Kč
Koupit
Witch Hat Atelier: Grimoire Edition 1
777 Kč
Koupit
Invincible Compendium Volume 2
1168 Kč
Koupit
Invincible Compendium Volume 1
1108 Kč
Koupit
Berserk Deluxe Volume 1
805 Kč
Koupit
Berserk Deluxe Volume 2
863 Kč
Koupit
Heated Rivalry
227 Kč
Koupit
The Goal
321 Kč
Koupit
Witch Hat Atelier Manga Box Set 1
1268 Kč
Koupit
Invincible Compendium Volume 3
1108 Kč
Koupit
Jujutsu Kaisen, Vol. 25
240 Kč
Koupit
Jujutsu Kaisen, Vol. 26
240 Kč
Koupit
Jujutsu Kaisen, Vol. 29
240 Kč
Koupit
Lord of Mysteries, Vol. 3: The Clown, Part III
355 Kč
Koupit
Moonwalk: A Memoir
525 Kč
Koupit
Murdoku
374 Kč
Koupit
Berserk Deluxe Volume 3
805 Kč
Koupit

Dárkový poukaz: Radost zaručena

Darujte poukaz v libovolné hodnotě a my se postaráme o zbytek.
Poukaz se vztahuje na celou naši nabídku.
Elektronický poukaz vytisknete z e-mailu a můžete ihned darovat.
Platnost poukazu je 12 měsíců od data vystavení.

Objednat dárkový poukaz Více informací

Informovat o naskladnění knihy

Zašleme vám zprávu jakmile knihu naskladníme

Zadejte do formuláře e-mailovou adresu a jakmile knihu naskladníme, zašleme vám o tom zprávu. Pohlídáme vše za vás.

Více informací o knize AI Inference Optimization Engineering

Parametry knihy
Anotace
Oblíbené z jiného soudku

Nákupem získáte 25 bodů

Anotace knihy

Slash LLM Deployment Costs and Latency

Deploying Large Language Models (LLMs) in production is a massive economic and engineering hurdle. AI Inference Optimization Engineering is your comprehensive, hands-on guide to mastering the full stack of modern LLM optimization techniques. From memory-bandwidth solutions to hardware-specific compilation, this book bridges the gap between research-level models and enterprise-grade execution.

What you will master inside this book:

Hardware-Aware Optimization: Dive deep into KV cache mechanics, autoregressive decoding, and GPU memory hierarchies to eliminate latency bottlenecks.
State-of-the-Art Quantization: Apply GPTQ, AWQ, and GGUF compression algorithms to scale down massive neural networks without sacrificing model accuracy.
Advanced Acceleration Methods: Implement speculative decoding with draft models (like Medusa and Eagle), PagedAttention, and FlashAttention to boost throughput by 2-3x.
Production-Grade Serving: Build ultra-low-latency deployment infrastructures using vLLM, Triton Inference Server, and continuous batching.
Cross-Platform Deployment: Optimize models for specific target hardware, including NVIDIA H100 (TensorRT-LLM), Apple Silicon (llama.cpp/Metal), and Qualcomm mobile/edge accelerators.

Whether you are an ML infrastructure engineer, an AI platform architect, or a technical leader looking to scale LLMs cost-effectively, this book provides the production-ready code, equations, and architectural patterns you need to build hyper-efficient AI pipelines.

Parametry knihy

247 Kč

Plný název: AI Inference Optimization Engineering
Podnázev: Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment
Autor: ChatVariety Team
Jazyk: Angličtina
Vazba: Brožovaná
Počet stran: 96
EAN: 9798199720021
ID: 52770465
Nakladatelství: Independently published
Hmotnost: 142 g
Rozměry: 229 × 152 × 5 mm
Datum vydání: 02. June 2026

O tomto obchodě

Nákupní rádce

Přehledy

Knihy podle jazyka

Platba

Doručení 54 Kč

Osobní odběr Praha, Brno a 47512 dalších

Česko

България Hrvatska România Magyarország Polska Slovensko