Cargando…

Designing Machine Learning Systems

Many tutorials show you how to develop ML systems from ideation to deployed models. But with constant changes in tooling, those systems can quickly become outdated. Without an intentional design to hold the components together, these systems will become a technical liability, prone to errors and be...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Huyen, Chip (Autor)
Autor Corporativo: Safari, an O'Reilly Media Company
Formato: Electrónico eBook
Idioma:Inglés
Publicado: O'Reilly Media, Inc., 2022.
Edición:1st edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a22000007a 4500
001 OR_on1302703427
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cn|||||||||
008 130821s2022 xx o 000 0 eng d
040 |a TAC  |b eng  |c TAC  |d ORMDA  |d EBLCP  |d OCLCQ  |d OCLCO 
020 |z 9781098107963 
024 8 |a 9781098107956 
035 |a (OCoLC)1302703427 
037 |a 9781098107956  |b O'Reilly Media 
050 4 |a Q325.5 
082 0 4 |a 006.3/1  |2 23/eng/20220525 
049 |a UAMI 
100 1 |a Huyen, Chip,  |e author. 
245 1 0 |a Designing Machine Learning Systems  |h [electronic resource] /  |c Chip Huyen. 
250 |a 1st edition. 
264 1 |b O'Reilly Media, Inc.,  |c 2022. 
300 |a 1 online resource (350 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
365 |b 59.99 
520 |a Many tutorials show you how to develop ML systems from ideation to deployed models. But with constant changes in tooling, those systems can quickly become outdated. Without an intentional design to hold the components together, these systems will become a technical liability, prone to errors and be quick to fall apart. In this book, Chip Huyen provides a framework for designing real-world ML systems that are quick to deploy, reliable, scalable, and iterative. These systems have the capacity to learn from new data, improve on past mistakes, and adapt to changing requirements and environments. Youâ??ll learn everything from project scoping, data management, model development, deployment, and infrastructure to team structure and business analysis. Learn the challenges and requirements of an ML system in production Build training data with different sampling and labeling methods Leverage best techniques to engineer features for your ML models to avoid data leakage Select, develop, debug, and evaluate ML models that are best suit for your tasks Deploy different types of ML systems for different hardware Explore major infrastructural choices and hardware designs Understand the human side of ML, including integrating ML into business, user experience, and team structure. 
542 |f Copyright © 2022 Huyen Thi Khanh Nguyen 
550 |a Made available through: Safari, an O'Reilly Media Company. 
588 0 |a Online resource; Title from title page (viewed June 25, 2022). 
505 0 |a Cover -- Copyright -- Table of Contents -- Preface -- Who This Book Is For -- What This Book Is Not -- Navigating This Book -- GitHub Repository and Community -- Conventions Used in This Book -- Using Code Examples -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Overview of Machine Learning Systems -- When to Use Machine Learning -- Machine Learning Use Cases -- Understanding Machine Learning Systems -- Machine Learning in Research Versus in Production -- Machine Learning Systems Versus Traditional Software -- Summary 
505 8 |a Chapter 2. Introduction to Machine Learning Systems Design -- Business and ML Objectives -- Requirements for ML Systems -- Reliability -- Scalability -- Maintainability -- Adaptability -- Iterative Process -- Framing ML Problems -- Types of ML Tasks -- Objective Functions -- Mind Versus Data -- Summary -- Chapter 3. Data Engineering Fundamentals -- Data Sources -- Data Formats -- JSON -- Row-Major Versus Column-Major Format -- Text Versus Binary Format -- Data Models -- Relational Model -- NoSQL -- Structured Versus Unstructured Data -- Data Storage Engines and Processing 
505 8 |a Transactional and Analytical Processing -- ETL: Extract, Transform, and Load -- Modes of Dataflow -- Data Passing Through Databases -- Data Passing Through Services -- Data Passing Through Real-Time Transport -- Batch Processing Versus Stream Processing -- Summary -- Chapter 4. Training Data -- Sampling -- Nonprobability Sampling -- Simple Random Sampling -- Stratified Sampling -- Weighted Sampling -- Reservoir Sampling -- Importance Sampling -- Labeling -- Hand Labels -- Natural Labels -- Handling the Lack of Labels -- Class Imbalance -- Challenges of Class Imbalance -- Handling Class Imbalance 
505 8 |a Data Augmentation -- Simple Label-Preserving Transformations -- Perturbation -- Data Synthesis -- Summary -- Chapter 5. Feature Engineering -- Learned Features Versus Engineered Features -- Common Feature Engineering Operations -- Handling Missing Values -- Scaling -- Discretization -- Encoding Categorical Features -- Feature Crossing -- Discrete and Continuous Positional Embeddings -- Data Leakage -- Common Causes for Data Leakage -- Detecting Data Leakage -- Engineering Good Features -- Feature Importance -- Feature Generalization -- Summary 
505 8 |a Chapter 6. Model Development and Offline Evaluation -- Model Development and Training -- Evaluating ML Models -- Ensembles -- Experiment Tracking and Versioning -- Distributed Training -- AutoML -- Model Offline Evaluation -- Baselines -- Evaluation Methods -- Summary -- Chapter 7. Model Deployment and Prediction Service -- Machine Learning Deployment Myths -- Myth 1: You Only Deploy One or Two ML Models at a Time -- Myth 2: If We Don't Do Anything, Model Performance Remains the Same -- Myth 3: You Won't Need to Update Your Models as Much 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Machine learning  |x Development. 
650 6 |a Apprentissage automatique  |x Développement. 
710 2 |a Safari, an O'Reilly Media Company. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781098107956/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6989361 
994 |a 92  |b IZTAP