As the popularity and complexity of Internet search engines increase, the design, development and maintenance of large, complex web-based Information Retrieval (WIR) systems become a challenge. The difficulty of designing a WIR system is compounded by information overload triggered from various different information sources. From a standpoint of the search engine users, it is more usable for the WIR to provide a single search point to multiple databases. To tackle this issue, we present the design and implementation of a cross-search component for the CS-Engine (Cross-Search Engine). The CS-Engine allows the user to search heterogeneous, multiple databases with one command. The CS-Engine is also distinguished from meta-search engines in that the CS-Engine does not need to trigger other search engines and translate a query for other search engines. Our performance benchmark tests show that the CS-Engine is scalable and usable. We also compare CS-engine with other search engines such as Google and AltaVista. The CS-Engine was developed with UML and design patterns including: (1) use case diagram, (2) class diagram, (3) package diagram, (4) interaction diagram, (5) Factory pattern, and (6) Strategy patterns. We conclude our paper with technical lessons learned as well as organizational issues encountered during the development phase.
|Number of pages||14|
|Journal||Information Systems Frontiers|
|Publication status||Published - 2004 Mar|
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Information Systems
- Computer Networks and Communications