Introduction to Computational Proteomics introduces the field of computational biology through a focused approach that tackles the different steps and problems involved with protein analysis, classification, and meta-organization. The book starts with the analysis of individual entities and works its way through the analysis of more complex entities, from protein families to interactions, cellular pathways, and gene networks.
The first part of the book presents methods for identifying the building blocks of the protein space, such as motifs and domains. It also describes algorithms for assessing similarity between proteins based on sequence and structure analysis as well as mathematical models, such as hidden Markov models and support vector machines, that are used to represent protein families and classify new instances.
The second part covers methods that investigate higher order structure in the protein space through the application of unsupervised learning algorithms, such as clustering and embedding. The book also explores the broader context of proteins. It discusses methods for analyzing gene expression data, predicting protein-protein interactions, elucidating cellular pathways, and reconstructing gene networks.
This book provides a coherent and thorough introduction to proteome analysis. It offers rigorous, formal descriptions, along with detailed algorithmic solutions and models. Each chapter includes problem sets from courses taught by the author at Cornell University and the Technion. Software downloads, data sets, and other material are available at biozon.org