Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP
Path: utzoo!utgpu!water!watmath!uunet!ig!daemon
From: daemon@ig.UUCP
Newsgroups: bionet.molbio.news
Subject: CSLG: COMMENTARY: From Ellis Golub (4)
Message-ID: <4262@ig.ig.com>
Date: Tue, 1-Dec-87 14:43:03 EST
Article-I.D.: ig.4262
Posted: Tue Dec  1 14:43:03 1987
Date-Received: Sat, 5-Dec-87 13:18:17 EST
Sender: daemon@presto.ig.com
Lines: 21

From: Sunil Maulik 

         Computer Applications in the Sequencing of Large Genomes

    To make such information retrieval possible, computer programs capable 
of very rapid scanning and filtering of sequence similarities must be 
developed. To facilitate these developments, unification of the present 
databases must begin now, before the task of integrating them becomes 
unmanageable. Second, new algorithms must be developed to scan the data at 
much faster rates than are available today. It may be necessary to develop 
secondary data bases which consist of sequence "words" of 10 - 20 
nucleotides in length, thus reducing the number of comparisons made in 
individual searches. Another approach might involve a cumulative data base 
which would "learn" from each new search, the location of sequence 
patterns of interest, thus making available to the next searcher, a fast 
track to the same sequences. Third, new hardware will be brought online 
including supercomputers, parallel processors and dedicated sequence 
search engines. These machines, when coupled to efficient integrated 
software can allow useful data retrieval from the proposed databases.

-------