Cargando…

Sharing data and models in software engineering /

Data Science for Software Engineering: Sharing Data and Models presents guidance and procedures for reusing data and models between projects to produce results that are useful and relevant. Starting with a background section of practical lessons and warnings for beginner data scientists for software...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Menzies, Tim (Autor)
Otros Autores: Rogers, Mark (Diseñador)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Waltham, Massachusetts : Morgan Kaufmann, [2015]
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 SCIDIR_ocn896901265
003 OCoLC
005 20231120111916.0
006 m o d
007 cr |||||||||||
008 141021t20152015maua ob 001 0 eng d
040 |a UKMGB  |b eng  |e rda  |e pn  |c UKMGB  |d OCLCO  |d N$T  |d OPELS  |d YDXCP  |d COO  |d N$T  |d OCLCF  |d E7B  |d UV0  |d NAM  |d EBLCP  |d NLGGC  |d DEBSZ  |d B24X7  |d LIV  |d MERUC  |d U3W  |d D6H  |d INT  |d OTZ  |d OCLCQ  |d CUY  |d LOA  |d ZCU  |d ICG  |d K6U  |d COCUF  |d VT2  |d DKC  |d OCLCQ  |d LQU  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCLCO 
016 7 |a 016947800  |2 Uk 
016 7 |a 016945759  |2 Uk 
019 |a 899158463  |a 900889430  |a 1105189399  |a 1105573366 
020 |a 9780124173071  |q (electronic bk.) 
020 |a 0124173071  |q (electronic bk.) 
020 |z 9780124172951  |q (pbk.) 
020 |z 0124172954 
020 |z 9780124172951 
035 |a (OCoLC)896901265  |z (OCoLC)899158463  |z (OCoLC)900889430  |z (OCoLC)1105189399  |z (OCoLC)1105573366 
050 4 |a QA76.758 
072 7 |a COM  |x 051330  |2 bisacsh 
082 0 4 |a 005.1  |2 23 
245 0 0 |a Sharing data and models in software engineering /  |c Tim Menzies [and four others] ; designer, Mark Rogers. 
264 1 |a Waltham, Massachusetts :  |b Morgan Kaufmann,  |c [2015] 
264 4 |c �2015 
300 |a 1 online resource (xxvii, 386 pages) :  |b illustrations (some color) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
504 |a Includes bibliographical references (pages 357-378) and index. 
588 0 |a Online resource; title from PDF title page (EBSCO; viewed on March 18, 2015). 
520 |a Data Science for Software Engineering: Sharing Data and Models presents guidance and procedures for reusing data and models between projects to produce results that are useful and relevant. Starting with a background section of practical lessons and warnings for beginner data scientists for software engineering, this edited volume proceeds to identify critical questions of contemporary software engineering related to data and models. Learn how to adapt data from other organizations to local problems, mine privatized data, prune spurious information, simplify complex results, how to update models for new platforms, and more. Chapters share largely applicable experimental results discussed with the blend of practitioner focused domain expertise, with commentary that highlights the methods that are most useful, and applicable to the widest range of projects. Each chapter is written by a prominent expert and offers a state-of-the-art solution to an identified problem facing data scientists in software engineering. Throughout, the editors share best practices collected from their experience training software engineering students and practitioners to master data science, and highlight the methods that are most useful, and applicable to the widest range of projects. 
505 0 |a Front Cover; Sharing Data and Models in Software Engineering; Copyright; Why this book?; Foreword; Contents; List of Figures; Chapter 1: Introduction; 1.1 Why Read This Book?; 1.2 What Do We Mean by ``Sharing''?; 1.2.1 Sharing Insights; 1.2.2 Sharing Models; 1.2.3 Sharing Data; 1.2.4 Sharing Analysis Methods; 1.2.5 Types of Sharing; 1.2.6 Challenges with Sharing; 1.2.7 How to Share; 1.3 What? (Our Executive Summary); 1.3.1 An Overview; 1.3.2 More Details; 1.4 How to Read This Book; 1.4.1 Data Analysis Patterns; 1.5 But What About ...? (What Is Not in This Book); 1.5.1 What About ``Big Data''? 
505 8 |a 1.5.2 What About Related Work?1.5.3 Why All the Defect Prediction and Effort Estimation?; 1.6 Who? (About the Authors); 1.7 Who Else? (Acknowledgments); Part I: Data Mining for Managers; Chapter 2: Rules for Managers; 2.1 The Inductive Engineering Manifesto; 2.2 More Rules; Chapter 3: Rule #1: Talk to the Users; 3.1 Users Biases; 3.2 Data Mining Biases; 3.3 Can We Avoid Bias?; 3.4 Managing Biases; 3.5 Summary; Chapter 4: Rule #2: Know the Domain; 4.1 Cautionary Tale #1: ``Discovering'' Random Noise; 4.2 Cautionary Tale #2: Jumping at Shadows; 4.3 Cautionary Tale #3: It Pays to Ask. 
505 8 |a 4.4 SummaryChapter 5: Rule #3: Suspect Your Data; 5.1 Controlling Data Collection; 5.2 Problems with Controlled Data Collection; 5.3 Rinse (and Prune) Before Use; 5.3.1 Row Pruning; 5.3.2 Column Pruning; 5.4 On the Value of Pruning; 5.5 Summary; Chapter 6: Rule #4: Data Science Is Cyclic; 6.1 The Knowledge Discovery Cycle; 6.2 Evolving Cyclic Development; 6.2.1 Scouting; 6.2.2 Surveying; 6.2.3 Building; 6.2.4 Effort; 6.3 Summary; Part II: Data Mining: A Technical Tutorial; Chapter 7: Data Mining and SE; 7.1 Some Definitions; 7.2 Some Application Areas; Chapter 8: Defect Prediction. 
505 8 |a 8.1 Defect Detection Economics8.2 Static Code Defect Prediction; 8.2.1 Easy to Use; 8.2.2 Widely Used; 8.2.3 Useful; Chapter 9: Effort Estimation; 9.1 The Estimation Problem; 9.2 How to Make Estimates; 9.2.1 Expert-Based Estimation; 9.2.2 Model-Based Estimation; 9.2.3 Hybrid Methods; Chapter 10: Data Mining (Under the Hood); 10.1 Data Carving; 10.2 About the Data; 10.3 Cohen Pruning; 10.4 Discretization; 10.4.1 Other Discretization Methods; 10.5 Column Pruning; 10.6 Row Pruning; 10.7 Cluster Pruning; 10.7.1 Advantages of Prototypes; 10.7.2 Advantages of Clustering; 10.8 Contrast Pruning. 
505 8 |a 10.9 Goal Pruning10.10 Extensions for Continuous Classes; 10.10.1 How RTs Work; 10.10.2 Creating Splits for Categorical Input Features; 10.10.3 Splits on Numeric Input Features; 10.10.4 Termination Condition and Predictions; 10.10.5 Potential Advantages of RTs for Software Effort Estimation; 10.10.6 Predictions for Multiple Numeric Goals; Part III: Sharing Data; Chapter 11: Sharing Data: Challenges and Methods; 11.1 Houston, We Have a Problem; 11.2 Good News, Everyone; Chapter 12: Learning Contexts; 12.1 Background; 12.2 Manual Methods for Contextualization; 12.3 Automatic Methods. 
650 0 |a Software engineering. 
650 0 |a Data structures (Computer science) 
650 6 |a G�enie logiciel.  |0 (CaQQLa)201-0121595 
650 6 |a Structures de donn�ees (Informatique)  |0 (CaQQLa)201-0051595 
650 7 |a COMPUTERS  |x Software Development & Engineering  |x Quality Assurance & Testing.  |2 bisacsh 
650 7 |a Data structures (Computer science)  |2 fast  |0 (OCoLC)fst00887978 
650 7 |a Software engineering  |2 fast  |0 (OCoLC)fst01124185 
700 1 |a Menzies, Tim,  |e author. 
700 1 |a Rogers, Mark,  |e designer. 
776 0 8 |i Print version:  |z 9780124172951 
856 4 0 |u https://sciencedirect.uam.elogim.com/science/book/9780124172951  |z Texto completo