Becoming a Rockstar SRE : Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems

個数:

Becoming a Rockstar SRE : Electrify your site reliability engineering mindset to build reliable, resilient, and efficient systems

  • 提携先の海外書籍取次会社に在庫がございます。通常3週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合は、ご注文数量が揃ってからまとめて発送いたします。
    3. 美品のご指定は承りかねます。

    ●3Dセキュア導入とクレジットカードによるお支払いについて
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Paperback:紙装版/ペーパーバック版/ページ数 420 p.
  • 言語 ENG
  • 商品コード 9781803239224
  • DDC分類 620.00452

Full Description

Excel in site reliability engineering by learning from field-driven lessons on observability and reliability in code, architecture, process, systems management, costs, and people to minimize downtime and enhance developers' output
Purchase of the print or Kindle book includes a free eBook in the PDF format

Key Features

Understand the goals of an SRE in terms of reliability, efficiency, and constant improvement
Master highly resilient architecture in server, serverless, and containerized workloads
Learn the why and when of employing Kubernetes, GitHub, Prometheus, Grafana, Terraform, Python, Argo CD, and GitOps

Book DescriptionSite reliability engineering is all about continuous improvement, finding the balance between business and product demands while working within technological limitations to drive higher revenue. But quantifying and understanding reliability, handling resources, and meeting developer requirements can sometimes be overwhelming. With a focus on reliability from an infrastructure and coding perspective, Becoming a Rockstar SRE brings forth the site reliability engineer (SRE) persona using real-world examples.
This book will acquaint you the role of an SRE, followed by the why and how of site reliability engineering. It walks you through the jobs of an SRE, from the automation of CI/CD pipelines and reducing toil to reliability best practices. You'll learn what creates bad code and how to circumvent it with reliable design and patterns. The book also guides you through interacting and negotiating with businesses and vendors on various technical matters and exploring observability, outages, and why and how to craft an excellent runbook. Finally, you'll learn how to elevate your site reliability engineering career, including certifications and interview tips and questions.
By the end of this book, you'll be able to identify and measure reliability, reduce downtime, troubleshoot outages, and enhance productivity to become a true rockstar SRE!What you will learn

Get insights into the SRE role and its evolution, starting from Google's original vision
Understand the key terms, such as golden signals, SLO, SLI, MTBF, MTTR, and MTTD
Overcome the challenges in adopting site reliability engineering
Employ reliable architecture and deployments with serverless, containerization, and release strategies
Identify monitoring targets and determine observability strategy
Reduce toil and leverage root cause analysis to enhance efficiency and reliability
Realize how business decisions can impact quality and reliability

Who this book is forThis book is for IT professionals, including developers looking to advance into an SRE role, system administrators mastering technologies, and executives experiencing repeated downtime in their organizations. Anyone interested in bringing reliability and automation to their organization to drive down customer impact and revenue loss while increasing development throughput will find this book useful. A basic understanding of API and web architecture and some experience with cloud computing and services will assist with understanding the concepts covered.

Contents

Table of Contents

SRE Job Role - Activities and Responsibilities
Fundamental Numbers - Reliability Statistics
Imperfect Habits - Duct Tape Architecture and Spaghetti Code
Essential Observability - Metrics, Events, Logs, and Traces (MELT)
Resolution Path - Master Troubleshooting
Operational Framework - Managing Infrastructure and Systems
Data Consumed - Observability Data Science
Reliable Architecture - Systems Strategy and Design
Valued Automation - Toil Discovery and Elimination
Exposing Pipelines - GitOps and Testing Essentials
Worker Bees - Orchestrations of Serverless, Containers, and Kubernetes
Final Exam - Tests and Capacity Planning
First Thing - Runbooks and Low Noise Outage Notifications
Rapid Response - Outage Management Techniques
Postmortem Candor - Long-Term Resolution
Chaos Injector - Advanced Systems Stability
Interview Advice - Hiring and Being Hired
Appendix A The Site Reliability Engineer Manifesto
Appendix B The 12-Factor App Questionnaire

最近チェックした商品