Architecting Generative AI Applications : Build, deploy, and scale production-ready generative AI systems with LLMOps best practices

予約
ポイントキャンペーン

Architecting Generative AI Applications : Build, deploy, and scale production-ready generative AI systems with LLMOps best practices

Kuligin, Leonid

Packt Publishing Limited（2026/03発売）

ただいまウェブストアではご注文を受け付けておりません。 ⇒古書を探す

製本 Paperback:紙装版/ペーパーバック版
言語 ENG
商品コード 9781806678655

Full Description

Take generative AI from prototype to production with confidence, master core LLM architectures, rigorous evaluation (offline and A/B testing), LLMOps and deployment pipelines, and the reliability practices that keep systems stable, secure, and scalable in the real world.

Key Features

Turn generative AI prototypes into production-ready applications
Master LLM evaluation, observability, and reliability engineering
Deploy and scale AI systems using LLMOps and modern DevOps tools

Book DescriptionVibe-coding tools & coding assistants make it easy to spin up generative AI prototypes. Getting those prototypes into production is where most teams stall. This book is a practical guide to building production-ready generative AI applications that are reliable, scalable, and secure, and to understanding where traditional software best practices can clash with the realities of operating LLM-based systems.
Written by a Staff AI Engineer at Google, it takes you through the full AI product lifecycle: scoping and building effective prototypes, aligning them with business goals, and scaling enterprise-wide generative AI adoption. You will learn how to evaluate LLMs with offline metrics, human-in-the-loop methods, and statistical testing. Next, you will design core architectures such as RAG, vector databases, agents, and memory systems. Next, operationalize these systems with production-grade code, robust testing, DevOps, MLOps, and LLMOps workflows, including deployment and scaling on modern LLMOps platforms. The book also covers security, Responsible AI, and modern observability and reliability for generative AI systems. By the end you'll learn how to run post-launch A/B tests, maintain systems over time, and measure business impact. The focus is on durable engineering principles, so your products succeed beyond the prototype stage.What you will learn

Design offline and online evaluation strategies (including statistical A/B testing) and collect the right data
Convert AI prototypes into production-ready applications that are stable, scalable, & secure
Reduce maintenance effort with best practices in testing, configuration, and code readability
Implement DevOps, MLOps, and LLMOps—what's common and what differs across these approaches for AI systems
Build platform teams to scale enterprise-wide generative AI adoption
Define reliability targets using SRE principles and statistical A/B testing

Who this book is forThis book is for technical leaders, AI engineers, data scientists, software engineers, and architects building generative AI applications. It is also ideal for engineering managers, product leaders, and technical decision-makers who need to understand how to deploy, scale, and maintain production-grade AI systems.

Table of Contents

Building a prototype
Evaluation
Key architectures
From a prototype to production
Devops, LLMops and other ops
Deployments
Ethics and security
Observability and reliability
Maintenance
A/B testing