Using large data bases for interactive problem solving

Author(s):  
Eric D. Carlson
Computer ◽  
1981 ◽  
Vol 14 (1) ◽  
pp. 53-53
Keyword(s):  

Author(s):  
Célia Talma Gonçalves ◽  
Rui Camacho ◽  
Eugénio Oliveira

Whenever new sequences of DNA or proteins have been decoded it is almost compulsory to look at similar sequences and papers describing those sequences in order to both collect relevant information concerning the function and activity of the new sequences and/or know what is known already about similar sequences. In current web sites and data bases of sequences there are, usually, a set of curated paper references linked to each sequence. Those links are a good starting point to look for relevant information related to a set of sequences. One way to implement such approach is to do a blast with the new decoded sequences, and collect similar sequences. Then one looks at the papers linked with the similar sequences. Most often the number of retrieved papers is small and one has to search large data bases for relevant papers. This paper proposes a process of generating a classifier based on the initially set of relevant papers. First, the authors collect similar sequences using an alignment algorithm like Blast. Then, the authors use the enlarges set of papers to construct a classifier. Finally a classifier is used to automatically enlarge the set of relevant papers by searching the MEDLINE using the automatically constructed classifier.


Online Review ◽  
1977 ◽  
Vol 1 (3) ◽  
pp. 207-215 ◽  
Author(s):  
James A. Luedke

The activities that are at present furthering the visibility of numeric data bases and systems are discussed. A tentative estimate of the numbers of existing numeric data bases and systems in various categories of accessibility (on‐line, batch, and remotely accessible) and availability (public, restricted, and in‐house) is made. Numeric data bases and systems are becoming offered by information retrieval services through many of the channels that made bibliographic and textual information systems successful. These include remote accessibility and marketing by large data base vendors. The user community for numeric data bases is generally more restricted, with the possible exception of business and financial numeric data bases. User training needs are generally greater as well. Numeric data bases may nevertheless reach large user audiences with the advent of distributed processing and, ultimately, data base maps which can automatically locate and link together data bases in many locations.


1976 ◽  
Vol 19 (1) ◽  
pp. 17-20 ◽  
Author(s):  
G. Schlageter
Keyword(s):  

1981 ◽  
Vol 25 ◽  
pp. 237-244
Author(s):  
T. M. Hare ◽  
J. C. Russ ◽  
M. J. Lanzo

X-ray diffraction to determine the phases present in complex specimens generally proceeds by acquiring a pattern of D-spacings and intensities from the sample, which is then compared to patterns from a series of known or standard phases. The process of searching through large data bases to identify matches is too time consuming for manual methods, and so computerised search/match programs have come into use. The magnitude of the general problem, in which perhaps 40,000 known patterns (corresponding to the present size of the JCPDS powder diffraction file) are involved, places it in the realm of large nainframe computers, which are often accessible on a time-share basis by many laboratories or researchers using XRD. On the other hand, in many practical applications, the search need only be carried out over a limited range of compounds which are expected or may possibly be present based on the known history of the material, or on its independently determined composition. In this case, the data base becomes much smaller, and can be accommodated In micro- and minicomputers. The system we report here utilizes a 64K-byte 8 -bit microcomputer (Apple 11+) with dual floppy disk drives. One 5-1/4 inch disk can hold the pattern information for fifty compounds, and can be searched for possible matches in a few minutes. The same computer can control the scanning diffractometer to acquire the pattern data, and can act as an intelligent terminal to larger time-share machines when it is necessary to access the large data bases.


Sign in / Sign up

Export Citation Format

Share Document