294 Pages
    by Routledge

    296 Pages
    by Routledge

    The fields of molecular biology and genetics are faced with an enormous accumulation of information: DNA sequencing, associated sequences of amino acids in proteins, genetics, macromolecular structures and other sets have created a quantitative backlog of data which needs to be organized and analyzed. Moreover, the rate of data acquisitions is accelerating as improved technologies are used and as organized programs such as the Human Genome Initiative are established. Because of this data's importance, molecular biologists have turned to computational scientists for help in processing this mass of information.The Santa Fe Institute organized a workshop on "The Interface Between Computational Science and DNA Sequencing" in 1998 to address this information crisis. Approximately one hundred molecular biologists, computer scientists, mathematicians, and other scientists in diverse fields met to discuss how computational science can best keep pace with molecular biology. The papers presented at that meeting and included in this volume serve as a comprehensive introduction to the field and as a discussion of research on some of the current problems.

    Preface , Introductory , The Human Genome: An Introduction , Computation and the Human Genome Project: An Historical Perspective , What We Have Learned and Will Learn from Sequence Databases , Databases , The Flow of Nucleotide Sequence Data into Data Banks: Role and Impact of Large-Scale Sequencing Projects , DNA Data Bank of Japan: Present Status and Future Plans , BIONET: Status and Future Plans , The Human Gene Mapping Library: Present Status and Future Directions , Sequence Comparisons , High-Efficiency Sequence Database Searching: Use of the Distributed Array Processor , Sequence Searching on Supercomputers , Protein Sequence Comparison on the Connection Machine CM-2 , Computation of d 2: A Measure of Sequence Dissimilarity , Multiple Hypothesis Testing for Sequence Comparisons , Analysis , Superhelical Transitions and DNA Regulation , Non-Continuous Patterns and Compositional Complexity of Nucleic Acid Sequences , Application of Neural Networks and Other Machine Learning Algorithms to DNA Sequence Analysis , CPGA 1 : A Prototype for Analysis of the Sequence-Structure-Function Relationship in the Human Genome Project , Statistical of Nucleic Acid Sequences , GeneSys: A Knowledge Management System for Molecular Biology , Simulation and Analysis of Physical Mapping , Genetic Sequence Semantic and Syntactic Patterns


    Thomas Marr