Cargando…

Fundamentals of Data Mining in Genomics and Proteomics

More than ever before, research and development in genomics and proteomics depends on the analysis and interpretation of large amounts of data generated by high-throughput techniques. With the advance of computational systems biology, this situation will become even more manifest as scientists will...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor Corporativo: SpringerLink (Online service)
Otros Autores: Dubitzky, Werner (Editor ), Granzow, Martin (Editor ), Berrar, Daniel P. (Editor )
Formato: Electrónico eBook
Idioma:Inglés
Publicado: New York, NY : Springer US : Imprint: Springer, 2007.
Edición:1st ed. 2007.
Temas:
Acceso en línea:Texto Completo

MARC

LEADER 00000nam a22000005i 4500
001 978-0-387-47509-7
003 DE-He213
005 20220114173138.0
007 cr nn 008mamaa
008 100301s2007 xxu| s |||| 0|eng d
020 |a 9780387475097  |9 978-0-387-47509-7 
024 7 |a 10.1007/978-0-387-47509-7  |2 doi 
050 4 |a QH324.2-324.25 
072 7 |a UY  |2 bicssc 
072 7 |a PS  |2 bicssc 
072 7 |a SCI056000  |2 bisacsh 
072 7 |a PSAX  |2 thema 
082 0 4 |a 570.285  |2 23 
245 1 0 |a Fundamentals of Data Mining in Genomics and Proteomics  |h [electronic resource] /  |c edited by Werner Dubitzky, Martin Granzow, Daniel P. Berrar. 
250 |a 1st ed. 2007. 
264 1 |a New York, NY :  |b Springer US :  |b Imprint: Springer,  |c 2007. 
300 |a XXII, 281 p. 68 illus.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
505 0 |a to Genomic and Proteomic Data Analysis -- Design Principles for Microarray Investigations -- Pre-Processing DNA Microarray Data -- Pre-Processing Mass Spectrometry Data -- Visualization in Genomics and Proteomics -- Clustering - Class Discovery in the Post-Genomic Era -- Feature Selection and Dimensionality Reduction in Genomics and Proteomics -- Resampling Strategies for Model Assessment and Selection -- Classification of Genomic and Proteomic Data Using Support Vector Machines -- Networks in Cell Biology -- Identifying Important Explanatory Variables for Time-Varying Outcomes -- Text Mining in Genomics and Proteomics. 
520 |a More than ever before, research and development in genomics and proteomics depends on the analysis and interpretation of large amounts of data generated by high-throughput techniques. With the advance of computational systems biology, this situation will become even more manifest as scientists will generate truly large-scale data sets by simulating of biological systems and conducting synthetic experiments. To optimally exploit such data, life scientists need to understand the fundamental concepts and properties of the fast-growing arsenal of analytical techniques and methods from statistics and data mining. Typically, the relevant literature and products present these techniques in a form which is either very simplistic or highly mathematical, favoring formal rigor over conceptual clarity and practical relevance. Fundamentals of Data Mining in Genomics and Proteomics addresses these shortcomings by adopting an approach which focuses on fundamental concepts and practical applications. The book presents key analytical techniques used to analyze genomic and proteomic data by detailing their underlying principles, merits and limitations. An important goal of this text is to provide a highly intuitive and conceptual (as opposed to intricate mathematical) account of the discussed methodologies. This treatment will enable readers with interest in analysis of genomic and proteomic data to quickly learn and appreciate the essential properties of relevant data mining methodologies without recourse to advanced mathematics. To complement the conceptual discussions, the book draws upon the lessons learned from applying the presented techniques to concrete analysis problems in genomics and proteomics. The caveats and pitfalls of the discussed methods are highlighted by addressing questions such as: What can go wrong? Under which circumstances can a particular method be applied and when should it not be used? What alternative methods exist? Extensive references to related material and resources are provided to assist readers in identifying and exploring additional information. The structure of this text mirrors the typical stages involved in deploying a data mining solution, spanning from data pre-processing to knowledge discovery to result post-processing. It is hoped that this will equip researchers and practitioners with a useful and practical framework to tackle their own data mining problems in genomics and proteomics. In contrast to some texts on machine learning and biological data analysis, a deliberate effort has been made to incorporate important statistical notions. By doing so the book is following demands for a more statistical data mining approach to analyzing high-throughput data. Finally, by highlighting limitations and open issues Fundamentals of Data Mining in Genomics and Proteomics is intended to instigate critical thinking and avenues for new research in the field. 
650 0 |a Bioinformatics. 
650 0 |a Biochemistry. 
650 0 |a Statistics . 
650 0 |a Biotechnology. 
650 0 |a Proteins . 
650 1 4 |a Bioinformatics. 
650 2 4 |a Biochemistry. 
650 2 4 |a Statistics. 
650 2 4 |a Computational and Systems Biology. 
650 2 4 |a Biotechnology. 
650 2 4 |a Protein Biochemistry. 
700 1 |a Dubitzky, Werner.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Granzow, Martin.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Berrar, Daniel P.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer Nature eBook 
776 0 8 |i Printed edition:  |z 9781441942913 
776 0 8 |i Printed edition:  |z 9780387516608 
776 0 8 |i Printed edition:  |z 9780387475080 
856 4 0 |u https://doi.uam.elogim.com/10.1007/978-0-387-47509-7  |z Texto Completo 
912 |a ZDB-2-SBL 
912 |a ZDB-2-SXB 
950 |a Biomedical and Life Sciences (SpringerNature-11642) 
950 |a Biomedical and Life Sciences (R0) (SpringerNature-43708)