Contributors |
|
ix | |
|
|
1 | (40) |
|
Introduction to Data Mining in Bioinformatics |
|
|
3 | (6) |
|
|
3 | (1) |
|
|
4 | (4) |
|
|
8 | (1) |
|
Survey of Biodata Analysis from a Data Mining Perspective |
|
|
9 | (32) |
|
|
9 | (3) |
|
Data Cleaning, Data Preprocessing, and Data Integration |
|
|
12 | (4) |
|
Exploration of Data Mining Tools for Biodata Analysis |
|
|
16 | (5) |
|
Discovery of Frequent Sequential and Structured Patterns |
|
|
21 | (3) |
|
|
24 | (1) |
|
|
25 | (3) |
|
Computational Modeling of Biological Networks |
|
|
28 | (3) |
|
Data Visualization and Visual Data Mining |
|
|
31 | (4) |
|
|
35 | (3) |
|
|
38 | (3) |
|
Part II. Sequence and Structure Alignment |
|
|
41 | (42) |
|
AntiClust A1: Multiple Sequence Alignment by Antipole Clustering |
|
|
43 | (16) |
|
|
43 | (2) |
|
|
45 | (2) |
|
Antipole Tree Data Structure for Clustering |
|
|
47 | (1) |
|
AntiClustA1: Multiple Sequence Alignment via Antipoles |
|
|
48 | (3) |
|
Comparing ClustalW and AntiClustA1 |
|
|
51 | (2) |
|
|
53 | (1) |
|
|
54 | (2) |
|
Future Developments and Research Problems |
|
|
56 | (3) |
|
RNA Structure Comparison and Alignment |
|
|
59 | (24) |
|
|
59 | (1) |
|
RNA Structure Comparison and Alignment Models |
|
|
60 | (7) |
|
|
67 | (1) |
|
Algorithms for RNA Secondary Structure Comparison |
|
|
67 | (4) |
|
Algorithms for RNA Structure Alignment |
|
|
71 | (5) |
|
Some Experimental Results |
|
|
76 | (7) |
|
Part III. Biological Data Mining |
|
|
83 | (134) |
|
Piecewise Constant Modeling of Sequential Data Using Reversible Jump Markov Chain Monte Carlo |
|
|
85 | (20) |
|
|
85 | (3) |
|
Bayesian Approach and MCMC Methods |
|
|
88 | (6) |
|
|
94 | (8) |
|
|
102 | (3) |
|
Gene Mapping by Pattern Discovery |
|
|
105 | (22) |
|
|
105 | (1) |
|
|
106 | (4) |
|
Haplotype Patterns as a Basis for Gene Mapping |
|
|
110 | (7) |
|
Instances of the Generalized Algorithm |
|
|
117 | (7) |
|
|
124 | (1) |
|
|
124 | (3) |
|
Predicting Protein Folding Pathways |
|
|
127 | (16) |
|
|
127 | (2) |
|
|
129 | (3) |
|
Predicting Folding Pathways |
|
|
132 | (5) |
|
Pathways for Other Proteins |
|
|
137 | (4) |
|
|
141 | (2) |
|
Data Mining Methods for a Systematics of Protein Subcellular Location |
|
|
143 | (46) |
|
|
144 | (3) |
|
|
147 | (39) |
|
|
186 | (3) |
|
Mining Chemical Compounds |
|
|
189 | (28) |
|
|
189 | (2) |
|
|
191 | (2) |
|
|
193 | (3) |
|
Classification Based on Frequent Subgraphs |
|
|
196 | (8) |
|
|
204 | (9) |
|
Conclusions and Directions for Future Research |
|
|
213 | (4) |
|
Part IV. Biological Data Management |
|
|
217 | (80) |
|
Phyloinformatics: Toward a Phylogenetic Database |
|
|
219 | (24) |
|
|
219 | (3) |
|
What Is a Phylogenetic Database For? |
|
|
222 | (2) |
|
|
224 | (5) |
|
|
229 | (1) |
|
Synthesizing Bigger Trees |
|
|
230 | (4) |
|
|
234 | (1) |
|
|
234 | (5) |
|
|
239 | (1) |
|
Prospects and Research Problems |
|
|
240 | (3) |
|
Declarative and Efficient Querying on Protein Secondary Structures |
|
|
243 | (32) |
|
|
243 | (3) |
|
|
246 | (1) |
|
Query Language and Sample Queries |
|
|
246 | (2) |
|
Query Evaluation Techniques |
|
|
248 | (4) |
|
Query Optimizer and Estimation |
|
|
252 | (15) |
|
Experimental Evaluation and Application of Periscope/PS2 |
|
|
267 | (4) |
|
Conclusions and Future Work |
|
|
271 | (4) |
|
Scalable Index Structures for Biological Data |
|
|
275 | (22) |
|
|
275 | (2) |
|
Index Structure for Sequences |
|
|
277 | (3) |
|
Indexing Protein Structures |
|
|
280 | (3) |
|
Comparative and Integrative Analysis of Pathways |
|
|
283 | (12) |
|
|
295 | (2) |
Glossary |
|
297 | (6) |
References |
|
303 | (24) |
Biographies |
|
327 | (10) |
Index |
|
337 | |