|
|
|
|
LEADER |
00000cam a22000007a 4500 |
001 |
OR_on1302703427 |
003 |
OCoLC |
005 |
20231017213018.0 |
006 |
m o d |
007 |
cr cn||||||||| |
008 |
130821s2022 xx o 000 0 eng d |
040 |
|
|
|a TAC
|b eng
|c TAC
|d ORMDA
|d EBLCP
|d OCLCQ
|d OCLCO
|
020 |
|
|
|z 9781098107963
|
024 |
8 |
|
|a 9781098107956
|
035 |
|
|
|a (OCoLC)1302703427
|
037 |
|
|
|a 9781098107956
|b O'Reilly Media
|
050 |
|
4 |
|a Q325.5
|
082 |
0 |
4 |
|a 006.3/1
|2 23/eng/20220525
|
049 |
|
|
|a UAMI
|
100 |
1 |
|
|a Huyen, Chip,
|e author.
|
245 |
1 |
0 |
|a Designing Machine Learning Systems
|h [electronic resource] /
|c Chip Huyen.
|
250 |
|
|
|a 1st edition.
|
264 |
|
1 |
|b O'Reilly Media, Inc.,
|c 2022.
|
300 |
|
|
|a 1 online resource (350 pages)
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
347 |
|
|
|a text file
|
365 |
|
|
|b 59.99
|
520 |
|
|
|a Many tutorials show you how to develop ML systems from ideation to deployed models. But with constant changes in tooling, those systems can quickly become outdated. Without an intentional design to hold the components together, these systems will become a technical liability, prone to errors and be quick to fall apart. In this book, Chip Huyen provides a framework for designing real-world ML systems that are quick to deploy, reliable, scalable, and iterative. These systems have the capacity to learn from new data, improve on past mistakes, and adapt to changing requirements and environments. Youâ??ll learn everything from project scoping, data management, model development, deployment, and infrastructure to team structure and business analysis. Learn the challenges and requirements of an ML system in production Build training data with different sampling and labeling methods Leverage best techniques to engineer features for your ML models to avoid data leakage Select, develop, debug, and evaluate ML models that are best suit for your tasks Deploy different types of ML systems for different hardware Explore major infrastructural choices and hardware designs Understand the human side of ML, including integrating ML into business, user experience, and team structure.
|
542 |
|
|
|f Copyright © 2022 Huyen Thi Khanh Nguyen
|
550 |
|
|
|a Made available through: Safari, an O'Reilly Media Company.
|
588 |
0 |
|
|a Online resource; Title from title page (viewed June 25, 2022).
|
505 |
0 |
|
|a Cover -- Copyright -- Table of Contents -- Preface -- Who This Book Is For -- What This Book Is Not -- Navigating This Book -- GitHub Repository and Community -- Conventions Used in This Book -- Using Code Examples -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Chapter 1. Overview of Machine Learning Systems -- When to Use Machine Learning -- Machine Learning Use Cases -- Understanding Machine Learning Systems -- Machine Learning in Research Versus in Production -- Machine Learning Systems Versus Traditional Software -- Summary
|
505 |
8 |
|
|a Chapter 2. Introduction to Machine Learning Systems Design -- Business and ML Objectives -- Requirements for ML Systems -- Reliability -- Scalability -- Maintainability -- Adaptability -- Iterative Process -- Framing ML Problems -- Types of ML Tasks -- Objective Functions -- Mind Versus Data -- Summary -- Chapter 3. Data Engineering Fundamentals -- Data Sources -- Data Formats -- JSON -- Row-Major Versus Column-Major Format -- Text Versus Binary Format -- Data Models -- Relational Model -- NoSQL -- Structured Versus Unstructured Data -- Data Storage Engines and Processing
|
505 |
8 |
|
|a Transactional and Analytical Processing -- ETL: Extract, Transform, and Load -- Modes of Dataflow -- Data Passing Through Databases -- Data Passing Through Services -- Data Passing Through Real-Time Transport -- Batch Processing Versus Stream Processing -- Summary -- Chapter 4. Training Data -- Sampling -- Nonprobability Sampling -- Simple Random Sampling -- Stratified Sampling -- Weighted Sampling -- Reservoir Sampling -- Importance Sampling -- Labeling -- Hand Labels -- Natural Labels -- Handling the Lack of Labels -- Class Imbalance -- Challenges of Class Imbalance -- Handling Class Imbalance
|
505 |
8 |
|
|a Data Augmentation -- Simple Label-Preserving Transformations -- Perturbation -- Data Synthesis -- Summary -- Chapter 5. Feature Engineering -- Learned Features Versus Engineered Features -- Common Feature Engineering Operations -- Handling Missing Values -- Scaling -- Discretization -- Encoding Categorical Features -- Feature Crossing -- Discrete and Continuous Positional Embeddings -- Data Leakage -- Common Causes for Data Leakage -- Detecting Data Leakage -- Engineering Good Features -- Feature Importance -- Feature Generalization -- Summary
|
505 |
8 |
|
|a Chapter 6. Model Development and Offline Evaluation -- Model Development and Training -- Evaluating ML Models -- Ensembles -- Experiment Tracking and Versioning -- Distributed Training -- AutoML -- Model Offline Evaluation -- Baselines -- Evaluation Methods -- Summary -- Chapter 7. Model Deployment and Prediction Service -- Machine Learning Deployment Myths -- Myth 1: You Only Deploy One or Two ML Models at a Time -- Myth 2: If We Don't Do Anything, Model Performance Remains the Same -- Myth 3: You Won't Need to Update Your Models as Much
|
590 |
|
|
|a O'Reilly
|b O'Reilly Online Learning: Academic/Public Library Edition
|
650 |
|
0 |
|a Machine learning
|x Development.
|
650 |
|
6 |
|a Apprentissage automatique
|x Développement.
|
710 |
2 |
|
|a Safari, an O'Reilly Media Company.
|
856 |
4 |
0 |
|u https://learning.oreilly.com/library/view/~/9781098107956/?ar
|z Texto completo (Requiere registro previo con correo institucional)
|
938 |
|
|
|a ProQuest Ebook Central
|b EBLB
|n EBL6989361
|
994 |
|
|
|a 92
|b IZTAP
|