Enabling soft queries for data retrieval

Hwanjo Yu, Seungwon Hwang, Kevin Chen Chuan Chang

Research output: Contribution to journalArticle

21 Citations (Scopus)

Abstract

Data retrieval finding relevant data from large databases - has become a serious problem as myriad databases have been brought online in the Web. For instance, querying the for-sale houses in Chicago from realtor.com returns thousands of matching houses. Similarly, querying "digital camera" in froogle.com returns hundreds of thousand of results. This data retrieval is essentially an online ranking problem, i.e., ranking data results according to the user's preference effectively and efficiently. This paper proposes a new rank query framework, for effectively incorporating "user-friendly" rank-query formulation into "data base (DB)-friendly" rank-query processing, in order to enable "soft" queries on databases. Our framework assumes, as the "back-end," the score-based ranking model for expressive and efficient query processing. On top of the score-based model, as the "front-end," we adopt an SVM-ranking mechanism for providing intuitive and exploratory query formulation. In essence, our framework enables users to formulate queries simply by ordering some sample objects, while learning the "DB-friendly" ranking function F from the partial orders. Such learned functions can then be processed and optimized by existing database systems. We demonstrate the efficiency and effectiveness of our framework using real-life user queries and datasets: our results show that the system effectively learns quantitative ranking functions from qualitative feedback from users with efficient online processing.

Original languageEnglish
Pages (from-to)560-574
Number of pages15
JournalInformation Systems
Volume32
Issue number4
DOIs
Publication statusPublished - 2007 Jun 1

Fingerprint

Query processing
Digital cameras
Sales
Feedback
Query
Data base
Processing
Ranking
Ranking function

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Management of Technology and Innovation
  • Hardware and Architecture
  • Information Systems
  • Software

Cite this

Yu, Hwanjo ; Hwang, Seungwon ; Chang, Kevin Chen Chuan. / Enabling soft queries for data retrieval. In: Information Systems. 2007 ; Vol. 32, No. 4. pp. 560-574.
@article{bd3bfe8473c34bba952bc6e9c9951c12,
title = "Enabling soft queries for data retrieval",
abstract = "Data retrieval finding relevant data from large databases - has become a serious problem as myriad databases have been brought online in the Web. For instance, querying the for-sale houses in Chicago from realtor.com returns thousands of matching houses. Similarly, querying {"}digital camera{"} in froogle.com returns hundreds of thousand of results. This data retrieval is essentially an online ranking problem, i.e., ranking data results according to the user's preference effectively and efficiently. This paper proposes a new rank query framework, for effectively incorporating {"}user-friendly{"} rank-query formulation into {"}data base (DB)-friendly{"} rank-query processing, in order to enable {"}soft{"} queries on databases. Our framework assumes, as the {"}back-end,{"} the score-based ranking model for expressive and efficient query processing. On top of the score-based model, as the {"}front-end,{"} we adopt an SVM-ranking mechanism for providing intuitive and exploratory query formulation. In essence, our framework enables users to formulate queries simply by ordering some sample objects, while learning the {"}DB-friendly{"} ranking function F from the partial orders. Such learned functions can then be processed and optimized by existing database systems. We demonstrate the efficiency and effectiveness of our framework using real-life user queries and datasets: our results show that the system effectively learns quantitative ranking functions from qualitative feedback from users with efficient online processing.",
author = "Hwanjo Yu and Seungwon Hwang and Chang, {Kevin Chen Chuan}",
year = "2007",
month = "6",
day = "1",
doi = "10.1016/j.is.2006.02.001",
language = "English",
volume = "32",
pages = "560--574",
journal = "Information Systems",
issn = "0306-4379",
publisher = "Elsevier Limited",
number = "4",

}

Enabling soft queries for data retrieval. / Yu, Hwanjo; Hwang, Seungwon; Chang, Kevin Chen Chuan.

In: Information Systems, Vol. 32, No. 4, 01.06.2007, p. 560-574.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Enabling soft queries for data retrieval

AU - Yu, Hwanjo

AU - Hwang, Seungwon

AU - Chang, Kevin Chen Chuan

PY - 2007/6/1

Y1 - 2007/6/1

N2 - Data retrieval finding relevant data from large databases - has become a serious problem as myriad databases have been brought online in the Web. For instance, querying the for-sale houses in Chicago from realtor.com returns thousands of matching houses. Similarly, querying "digital camera" in froogle.com returns hundreds of thousand of results. This data retrieval is essentially an online ranking problem, i.e., ranking data results according to the user's preference effectively and efficiently. This paper proposes a new rank query framework, for effectively incorporating "user-friendly" rank-query formulation into "data base (DB)-friendly" rank-query processing, in order to enable "soft" queries on databases. Our framework assumes, as the "back-end," the score-based ranking model for expressive and efficient query processing. On top of the score-based model, as the "front-end," we adopt an SVM-ranking mechanism for providing intuitive and exploratory query formulation. In essence, our framework enables users to formulate queries simply by ordering some sample objects, while learning the "DB-friendly" ranking function F from the partial orders. Such learned functions can then be processed and optimized by existing database systems. We demonstrate the efficiency and effectiveness of our framework using real-life user queries and datasets: our results show that the system effectively learns quantitative ranking functions from qualitative feedback from users with efficient online processing.

AB - Data retrieval finding relevant data from large databases - has become a serious problem as myriad databases have been brought online in the Web. For instance, querying the for-sale houses in Chicago from realtor.com returns thousands of matching houses. Similarly, querying "digital camera" in froogle.com returns hundreds of thousand of results. This data retrieval is essentially an online ranking problem, i.e., ranking data results according to the user's preference effectively and efficiently. This paper proposes a new rank query framework, for effectively incorporating "user-friendly" rank-query formulation into "data base (DB)-friendly" rank-query processing, in order to enable "soft" queries on databases. Our framework assumes, as the "back-end," the score-based ranking model for expressive and efficient query processing. On top of the score-based model, as the "front-end," we adopt an SVM-ranking mechanism for providing intuitive and exploratory query formulation. In essence, our framework enables users to formulate queries simply by ordering some sample objects, while learning the "DB-friendly" ranking function F from the partial orders. Such learned functions can then be processed and optimized by existing database systems. We demonstrate the efficiency and effectiveness of our framework using real-life user queries and datasets: our results show that the system effectively learns quantitative ranking functions from qualitative feedback from users with efficient online processing.

UR - http://www.scopus.com/inward/record.url?scp=33846928532&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33846928532&partnerID=8YFLogxK

U2 - 10.1016/j.is.2006.02.001

DO - 10.1016/j.is.2006.02.001

M3 - Article

VL - 32

SP - 560

EP - 574

JO - Information Systems

JF - Information Systems

SN - 0306-4379

IS - 4

ER -