NCSU Institutional Repository >
NC State Theses and Dissertations >
Please use this identifier to cite or link to this item:
|Title: ||Implementing a Metasearch Framework with Content-directed Result Merging|
|Authors: ||Selvadurai, Santthosh Babu|
|Advisors: ||Dr. Munindar P. Singh, Committee Member|
Dr.Gregory T. Byrd, Committee Chair
Dr. Yan Solihin, Committee Member
|Issue Date: ||9-Nov-2007|
|Discipline: ||Computer Engineering|
|Abstract: ||A metasearch engine is a system that provides integrated access to multiple existing search engine mechanisms. Once a query is executed on a metasearch engine, the system passes the query to its participating component search engines, collects the individual results and merges them into a single ranked list. Metasearch engines increase the search coverage of the Web, help solve the scalability issues in searching the internet, and improve the retrieval effectiveness, and consequently the relevance, of results.
Result merging is a key constituent of metasearch engines. When results from several search engines are collected, the metasearch system has to merge them into a unified list. The effectiveness of the metasearch mechanism and the relevance of the result set are closely related to the result merging algorithm used.
The purpose of this research is to build a flexible, general purpose metasearch framework and explore a content-directed result merging approach to rank results. Here the content-direction is provided to the framework by the user in the form of documents or text artifacts.
A modular metasearch application programming interface (API) based on Java has been implemented. The API framework provides interfaces and utilities to develop components of a metasearch system like segregators, scheduler, aggregators, and search service providers. A prototype metasearch engine has been built based on this framework to study the content-directed result merging algorithm.|
|Appears in Collections:||Theses|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.