Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!utgpu!water!watmath!uunet!ig!daemon From: daemon@ig.UUCP Newsgroups: bionet.molbio.news Subject: CSLG: COMMENTARY: From Ellis Golub (4) Message-ID: <4262@ig.ig.com> Date: Tue, 1-Dec-87 14:43:03 EST Article-I.D.: ig.4262 Posted: Tue Dec 1 14:43:03 1987 Date-Received: Sat, 5-Dec-87 13:18:17 EST Sender: daemon@presto.ig.com Lines: 21 From: Sunil MaulikComputer Applications in the Sequencing of Large Genomes To make such information retrieval possible, computer programs capable of very rapid scanning and filtering of sequence similarities must be developed. To facilitate these developments, unification of the present databases must begin now, before the task of integrating them becomes unmanageable. Second, new algorithms must be developed to scan the data at much faster rates than are available today. It may be necessary to develop secondary data bases which consist of sequence "words" of 10 - 20 nucleotides in length, thus reducing the number of comparisons made in individual searches. Another approach might involve a cumulative data base which would "learn" from each new search, the location of sequence patterns of interest, thus making available to the next searcher, a fast track to the same sequences. Third, new hardware will be brought online including supercomputers, parallel processors and dedicated sequence search engines. These machines, when coupled to efficient integrated software can allow useful data retrieval from the proposed databases. -------