Cargando…

Statistical analysis techniques in particle physics : fits, density estimation and supervised learning /

Modern analysis of HEP data needs advanced statistical tools to separate signal from background. This is the first book which focuses on machine learning techniques. It will be of interest to almost every high energy physicist, and, due to its coverage, suitable for students.

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Narsky, Ilya (Autor), Porter, Frank Clifford (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Weinheim : Wiley-VCH, 2013.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBOOKCENTRAL_ocn861559452
003 OCoLC
005 20240329122006.0
006 m o d
007 cr |n|||||||||
008 131026t20142013gr a ob 001 0 eng d
040 |a EBLCP  |b eng  |e rda  |e pn  |c EBLCP  |d IDEBK  |d CUS  |d N$T  |d DG1  |d UKMGB  |d COO  |d YDXCP  |d OCLCF  |d OCLCQ  |d E7B  |d DEBSZ  |d DEBBG  |d OCLCQ  |d CDX  |d DG1  |d COCUF  |d DG1  |d CCO  |d LIP  |d PIFBY  |d ZCU  |d NRC  |d MERUC  |d OCLCQ  |d U3W  |d OCLCQ  |d UUM  |d STF  |d ICG  |d INT  |d VT2  |d AU@  |d OCLCQ  |d TKN  |d OCLCQ  |d DKC  |d OCLCQ  |d UKAHL  |d OCLCQ  |d VHC  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCLCL 
066 |c (S 
015 |a GBC073900  |2 bnb 
016 7 |a 016504754  |2 Uk 
016 7 |a 019807943  |2 Uk 
019 |a 864390976  |a 891395988  |a 961581593  |a 976499663  |a 1162438550  |a 1290102510  |a 1303500835 
020 |a 9783527677320  |q (electronic bk.) 
020 |a 3527677321  |q (electronic bk.) 
020 |a 9783527677313  |q (electronic bk.) 
020 |a 9783527677290  |q (electronic bk.) 
020 |a 3527677291  |q (electronic bk.) 
020 |a 9783527677306  |q (electronic bk.) 
020 |a 3527677305  |q (electronic bk.) 
020 |a 3527677313  |q (electronic bk.) 
020 |a 3527410864  |q (Paper) 
020 |a 9783527410866  |q (Paper) 
020 |a 9781306028868  |q (MyiLibrary) 
020 |a 1306028868  |q (MyiLibrary) 
020 |z 9783527410866 
029 1 |a AU@  |b 000055925530 
029 1 |a CHBIS  |b 010441951 
029 1 |a CHNEW  |b 000942450 
029 1 |a CHVBK  |b 48022739X 
029 1 |a DEBBG  |b BV041565186 
029 1 |a DEBBG  |b BV041906872 
029 1 |a DEBBG  |b BV044064219 
029 1 |a DEBSZ  |b 431544085 
029 1 |a DEBSZ  |b 449392406 
029 1 |a DEBSZ  |b 485042703 
029 1 |a GBVCP  |b 790210479 
029 1 |a NZ1  |b 15340156 
029 1 |a UKMGB  |b 019807943 
035 |a (OCoLC)861559452  |z (OCoLC)864390976  |z (OCoLC)891395988  |z (OCoLC)961581593  |z (OCoLC)976499663  |z (OCoLC)1162438550  |z (OCoLC)1290102510  |z (OCoLC)1303500835 
037 |a 534137  |b MIL 
050 4 |a QC174.8 
072 7 |a SCI  |x 024000  |2 bisacsh 
072 7 |a SCI  |x 041000  |2 bisacsh 
072 7 |a SCI  |x 055000  |2 bisacsh 
082 0 4 |a 530.4 
049 |a UAMI 
100 1 |a Narsky, Ilya,  |e author. 
245 1 0 |a Statistical analysis techniques in particle physics :  |b fits, density estimation and supervised learning /  |c Ilya Narsky and Frank C. Porter. 
264 1 |a Weinheim :  |b Wiley-VCH,  |c 2013. 
264 4 |c ©2014 
300 |a 1 online resource (xvii, 441 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
500 |a Includes index. 
520 |a Modern analysis of HEP data needs advanced statistical tools to separate signal from background. This is the first book which focuses on machine learning techniques. It will be of interest to almost every high energy physicist, and, due to its coverage, suitable for students. 
505 0 |6 880-01  |a Why We Wrote This Book and How You Should Read It -- Parametric Likelihood Fits -- Goodness of Fit -- Resampling Techniques -- Density Estimation -- Basic Concepts and Definitions of Machine Learning -- Data Preprocessing -- Linear Transformations and Dimensionality Reduction -- Introduction to Classification -- Assessing Classifier Performance -- Linear and Quadratic Discriminant Analysis, Logistic Regression, and Partial Least Squares Regression -- Neural Networks -- Local Learning and Kernel Expansion -- Decision Trees -- Ensemble Learning -- Reducing Multiclass to Binary -- How to Choose the Right Classifier for Your Analysis and Apply It Correctly -- Methods for Variable Ranking and Selection -- Bump Hunting in Multivariate Data -- Software Packages for Machine Learning -- Appendix A: Optimization Algorithms. 
504 |a Includes bibliographical references and index. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
650 0 |a Particles (Nuclear physics)  |x Statistical methods. 
650 0 |a Physics. 
650 0 |a Condensed matter. 
650 2 |a Physics 
650 6 |a Particules (Physique nucléaire)  |x Méthodes statistiques. 
650 6 |a Physique. 
650 6 |a Matière condensée. 
650 7 |a physics.  |2 aat 
650 7 |a SCIENCE  |x Energy.  |2 bisacsh 
650 7 |a SCIENCE  |x Mechanics  |x General.  |2 bisacsh 
650 7 |a SCIENCE  |x Physics  |x General.  |2 bisacsh 
650 7 |a Condensed matter  |2 fast 
650 7 |a Particles (Nuclear physics)  |x Statistical methods  |2 fast 
650 7 |a Physics  |2 fast 
650 7 |a Science.  |2 ukslc 
700 1 |a Porter, Frank Clifford,  |e author. 
758 |i has work:  |a Statistical analysis techniques in particle physics (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCG4yDgTXMVBR3QPpfMFKr3  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |a Narsky, Ilya.  |t Statistical Analysis Techniques in Particle Physics : Fits, Density Estimation and Supervised Learning.  |d Hoboken : Wiley, ©2013  |z 9783527410866  |w (OCoLC)863691504 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=1486350  |z Texto completo 
880 0 0 |6 505-01/(S  |g Machine generated contents note:  |g 1.  |t Why We Wrote This Book and How You Should Read It --  |g 2.  |t Parametric Likelihood Fits --  |g 2.1.  |t Preliminaries --  |g 2.1.1.  |t Example: CP Violation via Mixing --  |g 2.1.2.  |t Exponential Family --  |g 2.1.3.  |t Confidence Intervals --  |g 2.1.4.  |t Hypothesis Tests --  |g 2.2.  |t Parametric Likelihood Fits --  |g 2.2.1.  |t Nuisance Parameters --  |g 2.2.2.  |t Confidence Intervals from Pivotal Quantities --  |g 2.2.3.  |t Asymptotic Inference --  |g 2.2.4.  |t Profile Likelihood --  |g 2.2.5.  |t Conditional Likelihood --  |g 2.3.  |t Fits for Small Statistics --  |g 2.3.1.  |t Sample Study of Coverage at Small Statistics --  |g 2.3.2.  |t When the pdf Goes Negative --  |g 2.4.  |t Results Near the Boundary of a Physical Region --  |g 2.5.  |t Likelihood Ratio Test for Presence of Signal --  |g 2.6.  |t sPlots --  |g 2.7.  |t Exercises --  |t References --  |g 3.  |t Goodness of Fit --  |g 3.1.  |t Binned Goodness of Fit Tests --  |g 3.2.  |t Statistics Converging to Chi-Square --  |g 3.3.  |t Univariate Unbinned Goodness of Fit Tests --  |g 3.3.1.  |t Kolmogorov--Smirnov --  |g 3.3.2.  |t Anderson--Darling --  |g 3.3.3.  |t Watson --  |g 3.3.4.  |t Neyman Smooth --  |g 3.4.  |t Multivariate Tests --  |g 3.4.1.  |t Energy Tests --  |g 3.4.2.  |t Transformations to a Uniform Distribution --  |g 3.4.3.  |t Local Density Tests --  |g 3.4.4.  |t Kernel-based Tests --  |g 3.4.5.  |t Mixed Sample Tests --  |g 3.4.6.  |t Using a Classifier --  |g 3.5.  |t Exercises --  |t References --  |g 4.  |t Resampling Techniques --  |g 4.1.  |t Permutation Sampling --  |g 4.2.  |t Bootstrap --  |g 4.2.1.  |t Bootstrap Confidence Intervals --  |g 4.2.2.  |t Smoothed Bootstrap --  |g 4.2.3.  |t Parametric Bootstrap --  |g 4.3.  |t Jackknife --  |g 4.4.  |t BCa Confidence Intervals --  |g 4.5.  |t Cross-Validation --  |g 4.6.  |t Resampling Weighted Observations --  |g 4.7.  |t Exercises --  |t References --  |g 5.  |t Density Estimation --  |g 5.1.  |t Empirical Density Estimate --  |g 5.2.  |t Histograms --  |g 5.3.  |t Kernel Estimation --  |g 5.3.1.  |t Multivariate Kernel Estimation --  |g 5.4.  |t Ideogram --  |g 5.5.  |t Parametric vs. Nonparametric Density Estimation --  |g 5.6.  |t Optimization --  |g 5.6.1.  |t Choosing Histogram Binning --  |g 5.7.  |t Estimating Errors --  |g 5.8.  |t Curse of Dimensionality --  |g 5.9.  |t Adaptive Kernel Estimation --  |g 5.10.  |t Naive Bayes Classification --  |g 5.11.  |t Multivariate Kernel Estimation --  |g 5.12.  |t Estimation Using Orthogonal Series --  |g 5.13.  |t Using Monte Carlo Models --  |g 5.14.  |t Unfolding --  |g 5.14.1.  |t Unfolding: Regularization --  |g 5.15.  |t Exercises --  |t References --  |g 6.  |t Basic Concepts and Definitions of Machine Learning --  |g 6.1.  |t Supervised, Unsupervised, and Semi-Supervised --  |g 6.2.  |t Tall and Wide Data --  |g 6.3.  |t Batch and Online Learning --  |g 6.4.  |t Parallel Learning --  |g 6.5.  |t Classification and Regression --  |t References --  |g 7.  |t Data Preprocessing --  |g 7.1.  |t Categorical Variables --  |g 7.2.  |t Missing Values --  |g 7.2.1.  |t Likelihood Optimization --  |g 7.2.2.  |t Deletion --  |g 7.2.3.  |t Augmentation --  |g 7.2.4.  |t Imputation --  |g 7.2.5.  |t Other Methods --  |g 7.3.  |t Outliers --  |g 7.4.  |t Exercises --  |t References --  |g 8.  |t Linear Transformations and Dimensionality Reduction --  |g 8.1.  |t Centering, Scaling, Reflection and Rotation --  |g 8.2.  |t Rotation and Dimensionality Reduction --  |g 8.3.  |t Principal Component Analysis (PCA) --  |g 8.3.1.  |t Theory --  |g 8.3.2.  |t Numerical Implementation --  |g 8.3.3.  |t Weighted Data --  |g 8.3.4.  |t How Many Principal Components Are Enough--  |g 8.3.5.  |t Example: Apply PCA and Choose the Optimal Number of Components --  |g 8.4.  |t Independent Component Analysis (ICA) --  |g 8.4.1.  |t Theory --  |g 8.4.2.  |t Numerical implementation --  |g 8.4.3.  |t Properties --  |g 8.5.  |t Exercises --  |t References --  |g 9.  |t Introduction to Classification --  |g 9.1.  |t Loss Functions: Hard Labels and Soft Scores --  |g 9.2.  |t Bias, Variance, and Noise --  |g 9.3.  |t Training, Validating and Testing: The Optimal Splitting Rule --  |g 9.4.  |t Resampling Techniques: Cross-Validation and Bootstrap --  |g 9.4.1.  |t Cross-Validation --  |g 9.4.2.  |t Bootstrap --  |g 9.4.3.  |t Sampling with Stratification --  |g 9.5.  |t Data with Unbalanced Classes --  |g 9.5.1.  |t Adjusting Prior Probabilities --  |g 9.5.2.  |t Undersampling the Majority Class --  |g 9.5.3.  |t Oversampling the Minority Class --  |g 9.5.4.  |t Example: Classification of Forest Cover Type Data --  |g 9.6.  |t Learning with Cost --  |g 9.7.  |t Exercises --  |t References --  |g 10.  |t Assessing Classifier Performance --  |g 10.1.  |t Classification Error and Other Measures of Predictive Power --  |g 10.2.  |t Receiver Operating Characteristic (ROC) and Other Curves --  |g 10.2.1.  |t Empirical ROC curve --  |g 10.2.2.  |t Other Performance Measures --  |g 10.2.3.  |t Optimal Operating Point --  |g 10.2.4.  |t Area Under Curve --  |g 10.2.5.  |t Smooth ROC Curves --  |g 10.2.6.  |t Confidence Bounds for ROC Curves --  |g 10.3.  |t Testing Equivalence of Two Classification Models --  |g 10.4.  |t Comparing Several Classifiers --  |g 10.5.  |t Exercises --  |t References --  |g 11.  |t Linear and Quadratic Discriminant Analysis, Logistic Regression, and Partial Least Squares Regression --  |g 11.1.  |t Discriminant Analysis --  |g 11.1.1.  |t Estimating the Covariance Matrix --  |g 11.1.2.  |t Verifying Discriminant Analysis Assumptions --  |g 11.1.3.  |t Applying LDA When LDA Assumptions Are Invalid --  |g 11.1.4.  |t Numerical Implementation --  |g 11.1.5.  |t Regularized Discriminant Analysis --  |g 11.1.6.  |t LDA for Variable Transformation --  |g 11.2.  |t Logistic Regression --  |g 11.2.1.  |t Binomial Logistic Regression: Theory and Numerical Implementation --  |g 11.2.2.  |t Properties of the Binomial Model --  |g 11.2.3.  |t Verifying Model Assumptions --  |g 11.2.4.  |t Logistic Regression with Multiple Classes --  |g 11.3.  |t Classification by Linear Regression --  |g 11.4.  |t Partial Least Squares Regression --  |g 11.5.  |t Example: Linear Models for MAGIC Telescope Data --  |g 11.6.  |t Choosing a Linear Classifier for Your Analysis --  |g 11.7.  |t Exercises --  |t References --  |g 12.  |t Neural Networks --  |g 12.1.  |t Perceptrons --  |g 12.2.  |t Feed-Forward Neural Network --  |g 12.3.  |t Backpropagation --  |g 12.4.  |t Bayes Neural Networks --  |g 12.5.  |t Genetic Algorithms --  |g 12.6.  |t Exercises --  |t References --  |g 13.  |t Local Learning and Kernel Expansion --  |g 13.1.  |t From Input Variables to the Feature Space --  |g 13.1.1.  |t Kernel Regression --  |g 13.2.  |t Regularization --  |g 13.2.1.  |t Kernel Ridge Regression --  |g 13.3.  |t Making and Choosing Kernels --  |g 13.4.  |t Radial Basis Functions --  |g 13.4.1.  |t Example: RBF Classification for the MAGIC Telescope Data --  |g 13.5.  |t Support Vector Machines (SVM) --  |g 13.5.1.  |t SVM with Weighted Data --  |g 13.5.2.  |t SVM with Probabilistic Outputs --  |g 13.5.3.  |t Numerical Implementation --  |g 13.5.4.  |t Multiclass Extensions --  |g 13.6.  |t Empirical Local Methods --  |g 13.6.1.  |t Classification by Probability Density Estimation --  |g 13.6.2.  |t Locally Weighted Regression --  |g 13.6.3.  |t Nearest Neighbors and Fuzzy Rules --  |g 13.7.  |t Kernel Methods: The Good, the Bad and the Curse of Dimensionality --  |g 13.8.  |t Exercises --  |t References --  |g 14.  |t Decision Trees --  |g 14.1.  |t Growing Trees --  |g 14.2.  |t Predicting by Decision Trees --  |g 14.3.  |t Stopping Rules --  |g 14.4.  |t Pruning Trees --  |g 14.4.1.  |t Example: Pruning a Classification Tree --  |g 14.5.  |t Trees for Multiple Classes --  |g 14.6.  |t Splits on Categorical Variables --  |g 14.7.  |t Surrogate Splits --  |g 14.8.  |t Missing Values --  |g 14.9.  |t Variable importance --  |g 14.10.  |t Why Are Decision Trees Good (or Bad)--  |g 14.11.  |t Exercises --  |t References --  |g 15.  |t Ensemble Learning --  |g 15.1.  |t Boosting --  |g 15.1.1.  |t Early Boosting --  |g 15.1.2.  |t AdaBoost for Two Classes --  |g 15.1.3.  |t Minimizing Convex Loss by Stagewise Additive Modeling --  |g 15.1.4.  |t Maximizing the Minimal Margin --  |g 15.1.5.  |t Nonconvex Loss and Robust Boosting --  |g 15.1.6.  |t Boosting for Multiple Classes --  |g 15.2.  |t Diversifying the Weak Learner: Bagging, Random Subspace and Random Forest --  |g 15.2.1.  |t Measures of Diversity --  |g 15.2.2.  |t Bagging and Random Forest --  |g 15.2.3.  |t Random Subspace --  |g 15.2.4.  |t Example: K/π Separation for BaBar PID --  |g 15.3.  |t Choosing an Ensemble for Your Analysis --  |g 15.4.  |t Exercises --  |t References --  |g 16.  |t Reducing Multiclass to Binary --  |g 16.1.  |t Encoding --  |g 16.2.  |t Decoding --  |g 16.3.  |t Summary: Choosing the Right Design --  |t References --  |g 17.  |t How to Choose the Right classifier for Your Analysis and Apply It Correctly --  |g 17.1.  |t Predictive Performance and Interpretability --  |g 17.2.  |t Matching Classifiers and Variables --  |g 17.3.  |t Using Classifier Predictions --  |g 17.4.  |t Optimizing Accuracy --  |g 17.5.  |t CPU and Memory Requirements --  |g 18.  |t Methods for Variable Ranking and Selection --  |g 18.1.  |t Definitions --  |g 18.1.1.  |t Variable Ranking and Selection --  |g 18.1.2.  |t Strong and Weak Relevance --  |g 18.2.  |t Variable Ranking --  |g 18.2.1.  |t Filters: Correlation and Mutual Information --  |g 18.2.2.  |t Wrappers: Sequential Forward Selection (SFS), Sequential Backward Elimination  
880 0 0 |t (SBE), and Feature-based Sensitivity of Posterior Probabilities (FSPP) --  |g 18.2.3.  |t Embedded Methods: Estimation of Variable Importance by Decision Trees, Neural Networks, Nearest Neighbors, and Linear Models --  |g 18.3.  |t Variable Selection --  |g 18.3.1.  |t Optimal-Set Search Strategies --  |g 18.3.2.  |t Multiple Testing: Backward Elimination by change in Margin (BECM) --  |g 18.3.3.  |t Estimation of the Reference Distribution by Permutations: Artificial Contrasts with Ensembles (ACE) Algorithm --  |g 18.4.  |t Exercises --  |t References --  |g 19.  |t Bump Hunting in Multivariate Data --  |g 19.1.  |t Voronoi Tessellation and SLEUTH Algorithm --  |g 19.2.  |t Identifying Box Regions by PRIM and Other Algorithms --  |g 19.3.  |t Bump Hunting Through Supervised Learning --  |t References --  |g 20.  |t Software Packages for Machine Learning --  |g 20.1.  |t Tools Developed in HEP --  |g 20.2.  |t R --  |g 20.3.  |t Matlab --  |g 20.4.  |t Tools for Java and Python --  |g 20.5.  |t What Software Tool Is Right for You--  |t References --  |g Appendix  |t A Optimization Algorithms --  |g A.1.  |t Line Search --  |g A.2.  |t Linear Programming (LP). 
938 |a Askews and Holts Library Services  |b ASKH  |n AH26418014 
938 |a Askews and Holts Library Services  |b ASKH  |n AH25802860 
938 |a Coutts Information Services  |b COUT  |n 26532223 
938 |a EBL - Ebook Library  |b EBLB  |n EBL1486350 
938 |a ebrary  |b EBRY  |n ebr10925517 
938 |a EBSCOhost  |b EBSC  |n 653912 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n cis26532223 
938 |a YBP Library Services  |b YANK  |n 11260456 
938 |a YBP Library Services  |b YANK  |n 12679604 
938 |a YBP Library Services  |b YANK  |n 10706500 
994 |a 92  |b IZTAP