BioZone Exploting Source-Capability Information for Integrated Access to Multiple Bioinformatics Data Sources [electronic resource].
- Published
- Washington, D.C. : United States. Dept. of Energy, 2002.
Oak Ridge, Tenn. : Distributed by the Office of Scientific and Technical Information, U.S. Dept. of Energy. - Physical Description
- 15p : digital, PDF file
- Additional Creators
- Lawrence Berkeley National Laboratory, United States. Department of Energy, and United States. Department of Energy. Office of Scientific and Technical Information
Access Online
- Restrictions on Access
- Free-to-read Unrestricted online access
- Summary
- Modern Bioinformatics data sources are widely used by molecular biologists for homology searching and new drug discovery. User-friendly and yet responsive access is one of the most desirable properties for integrated access to the rapidly growing, heterogeneous, and distributed collection of data sources. The increasing volume and diversity of digital information related to bioinformatics (such as genomes, protein sequences, protein structures, etc.) have led to a growing problem that conventional data management systems do not have, namely finding which information sources out of many candidate choices are the most relevant and most accessible to answer a given user query. We refer to this problem as the query routing problem. In this paper we introduce the notation and issues of query routing, and present a practical solution for designing a scalable query routing system based on multi-level progressive pruning strategies. The key idea is to create and maintain source-capability profiles independently, and to provide algorithms that can dynamically discover relevant information sources for a given query through the smart use of source profiles. Compared to the keyword-based indexing techniques adopted in most of the search engines and software, our approach offers fine-granularity of interest matching, thus it is more powerful and effective for handling queries with complex conditions.
- Report Numbers
- E 1.99:ucrl-jc-146952
ucrl-jc-146952 - Subject(s)
- Other Subject(s)
- Note
- Published through SciTech Connect.
01/28/2002.
"ucrl-jc-146952"
The 10th International Conference on Intelligent Systems for Molecular Biology, Edmontom Alberta Canada, Aug 03 - Aug 07, 2002.
Liu, L; Buttler, D; Pu, C; Paques, H; Critchlow. - Funding Information
- W-7405-ENG-48
View MARC record | catkey: 13811192