Cargando…

The site reliability workbook : practical ways to implement SRE /

An expansion on the understanding of Google SRE, providing 'worked examples' for each essential facet of this area of IT prepared in co-operation with Google cloud customers based on their experiences. Instructs on methodology for running services at scale and starting SRE in greenfield or...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Otros Autores: Beyer, Betsy (Editor ), Murphy, Niall Richard, Rensin, David K., Kawahara, Kent, Thorne, Stephen
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media : O'Reilly Media, 2018.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_on1046634047
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cnu|||unuuu
008 180730s2018 cau ob 001 0 eng d
040 |a N$T  |b eng  |e rda  |e pn  |c N$T  |d N$T  |d YDX  |d UMI  |d EBLCP  |d STF  |d TOH  |d OCLCF  |d TEFOD  |d MERER  |d CEF  |d G3B  |d OCLCQ  |d UAB  |d ESU  |d UKAHL  |d MNW  |d VT2  |d C6I  |d OCLCQ  |d OCLCO  |d OCLCQ 
015 |a GBB8C0181  |2 bnb 
016 7 |a 018921957  |2 Uk 
019 |a 1047556026  |a 1047781453  |a 1048260138  |a 1126089233  |a 1129333213  |a 1202551932  |a 1240528906 
020 |a 9781492029472  |q (electronic bk.) 
020 |a 1492029475  |q (electronic bk.) 
020 |a 9781492029458  |q (electronic bk.) 
020 |a 1492029459  |q (electronic bk.) 
020 |z 9781492029502 
020 |z 1492029505 
029 1 |a AU@  |b 000066134268 
029 1 |a GBVCP  |b 1029873216 
035 |a (OCoLC)1046634047  |z (OCoLC)1047556026  |z (OCoLC)1047781453  |z (OCoLC)1048260138  |z (OCoLC)1126089233  |z (OCoLC)1129333213  |z (OCoLC)1202551932  |z (OCoLC)1240528906 
037 |a CL0500000984  |b Safari Books Online 
037 |a AB525548-CC4F-45AB-BFDA-B1B70F0CC87B  |b OverDrive, Inc.  |n http://www.overdrive.com 
050 4 |a T58.64 
072 7 |a BUS  |x 082000  |2 bisacsh 
072 7 |a BUS  |x 041000  |2 bisacsh 
072 7 |a BUS  |x 042000  |2 bisacsh 
072 7 |a BUS  |x 085000  |2 bisacsh 
082 0 4 |a 658.4038  |2 23 
049 |a UAMI 
245 0 4 |a The site reliability workbook :  |b practical ways to implement SRE /  |c edited by Betsy Beyer [and 4 others]. 
264 1 |a Sebastopol, CA :  |b O'Reilly Media :  |b O'Reilly Media,  |c 2018. 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Online resource; title from PDF title page (EBSCO, viewed August 1, 2018). 
504 |a Includes bibliographical references and index. 
505 0 |a How SRE relates to DevOps -- Foundations. Implementing SLOs -- SLO engineering case studies -- Alerting on SLOs -- Eliminating toil -- Simplicity -- Practices. On-call -- Incident response -- Postmortem culture: learning from failure -- Managing load -- Introducing non-abstract large system design -- Data processing pipelines -- Configuration design and best practices -- Configuration specifics -- Canarying releases -- Processes. Identifying and recovering from overload -- SRE engagement model -- SRE: reaching beyond your walls -- SRE team lifecycles -- Organizational change management in SRE -- A. Example SLO document -- B. Example error budget policy -- C. Results of postmortem analysis. 
505 0 |a Intro; Copyright; Table of Contents; Foreword I; Foreword II; Preface; Conventions Used in This Book; Using Code Examples; O'Reilly Safari; How to Contact Us; Acknowledgments; Chapter 1. How SRE Relates to DevOps; Background on DevOps; No More Silos; Accidents Are Normal; Change Should Be Gradual; Tooling and Culture Are Interrelated; Measurement Is Crucial; Background on SRE; Operations Is a Software Problem; Manage by Service Level Objectives (SLOs); Work to Minimize Toil; Automate This Year's Job Away; Move Fast by Reducing the Cost of Failure; Share Ownership with Developers 
505 8 |a Use the Same Tooling, Regardless of Function or Job TitleCompare and Contrast; Organizational Context and Fostering Successful Adoption; Narrow, Rigid Incentives Narrow Your Success; It's Better to Fix It Yourself; Don't Blame Someone Else; Consider Reliability Work as a Specialized Role; When Can Substitute for Whether; Strive for Parity of Esteem: Career and Financial; Conclusion; Part I. Foundations; Chapter 2. Implementing SLOs; Why SREs Need SLOs; Getting Started; Reliability Targets and Error Budgets; What to Measure: Using SLIs; A Worked Example 
505 8 |a Moving from SLI Specification to SLI ImplementationMeasuring the SLIs; Using the SLIs to Calculate Starter SLOs; Choosing an Appropriate Time Window; Getting Stakeholder Agreement; Establishing an Error Budget Policy; Documenting the SLO and Error Budget Policy; Dashboards and Reports; Continuous Improvement of SLO Targets; Improving the Quality of Your SLO; Decision Making Using SLOs and Error Budgets; Advanced Topics; Modeling User Journeys; Grading Interaction Importance; Modeling Dependencies; Experimenting with Relaxing Your SLOs; Conclusion; Chapter 3. SLO Engineering Case Studies 
505 8 |a Evernote's SLO StoryWhy Did Evernote Adopt the SRE Model?; Introduction of SLOs: A Journey in Progress; Breaking Down the SLO Wall Between Customer and Cloud Provider; Current State; The Home Depot's SLO Story; The SLO Culture Project; Our First Set of SLOs; Evangelizing SLOs; Automating VALET Data Collection; The Proliferation of SLOs; Applying VALET to Batch Applications; Using VALET in Testing; Future Aspirations; Summary; Conclusion; Chapter 4. Monitoring; Desirable Features of a Monitoring Strategy; Speed; Calculations; Interfaces; Alerts; Sources of Monitoring Data; Examples 
505 8 |a Managing Your Monitoring SystemTreat Your Configuration as Code; Encourage Consistency; Prefer Loose Coupling; Metrics with Purpose; Intended Changes; Dependencies; Saturation; Status of Served Traffic; Implementing Purposeful Metrics; Testing Alerting Logic; Conclusion; Chapter 5. Alerting on SLOs; Alerting Considerations; Ways to Alert on Significant Events; 1: Target Error Rate ≥ SLO Threshold; 2: Increased Alert Window; 3: Incrementing Alert Duration; 4: Alert on Burn Rate; 5: Multiple Burn Rate Alerts; 6: Multiwindow, Multi-Burn-Rate Alerts; Low-Traffic Services and Error Budget Alerting 
520 8 |a An expansion on the understanding of Google SRE, providing 'worked examples' for each essential facet of this area of IT prepared in co-operation with Google cloud customers based on their experiences. Instructs on methodology for running services at scale and starting SRE in greenfield or brownfield fashion.  |b In 2016, Google's Site Reliability Engineering book ignited an industry discussion on what it means to run production services today-and why reliability considerations are fundamental to service design. Now, Google engineers who worked on that bestseller introduce The Site Reliability Workbook, a hands-on companion that uses concrete examples to show you how to put SRE principles and practices to work in your environment. This new workbook not only combines practical examples from Google's experiences, but also provides case studies from Google's Cloud Platform customers who underwent this journey. Evernote, The Home Depot, The New York Times, and other companies outline hard-won experiences of what worked for them and what didn't. Dive into this workbook and learn how to flesh out your own SRE practice, no matter what size your company is. You'll learn:How to run reliable services in environments you don't completely control-like cloudPractical applications of how to create, monitor, and run your services via Service Level ObjectivesHow to convert existing ops teams to SRE-including how to dig out of operational overloadMethods for starting SRE from either greenfield or brownfield. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Information technology  |x Management. 
650 0 |a Reliability (Engineering) 
650 0 |a Computer engineering. 
650 6 |a Technologie de l'information  |x Gestion. 
650 6 |a Fiabilité. 
650 6 |a Ordinateurs  |x Conception et construction. 
650 7 |a BUSINESS & ECONOMICS  |x Industrial Management.  |2 bisacsh 
650 7 |a BUSINESS & ECONOMICS  |x Management.  |2 bisacsh 
650 7 |a BUSINESS & ECONOMICS  |x Management Science.  |2 bisacsh 
650 7 |a BUSINESS & ECONOMICS  |x Organizational Behavior.  |2 bisacsh 
650 7 |a Computer engineering.  |2 fast  |0 (OCoLC)fst00872078 
650 7 |a Information technology  |x Management.  |2 fast  |0 (OCoLC)fst00973112 
650 7 |a Reliability (Engineering)  |2 fast  |0 (OCoLC)fst01093646 
700 1 |a Beyer, Betsy,  |e editor. 
700 1 |a Murphy, Niall Richard. 
700 1 |a Rensin, David K. 
700 1 |a Kawahara, Kent. 
700 1 |a Thorne, Stephen. 
776 0 8 |i Print version:  |z 1492029505  |z 9781492029502  |w (OCoLC)1029786800 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781492029496/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH35094736 
938 |a Askews and Holts Library Services  |b ASKH  |n AH35067593 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5475462 
938 |a EBSCOhost  |b EBSC  |n 1857160 
938 |a YBP Library Services  |b YANK  |n 15616321 
994 |a 92  |b IZTAP