SDSS Log Viewer: Visual exploratory analysis of large-volume SQL log data

Jian Zhang, Chaomei Chen, Michael S. Vogeley, Danny Pan, Ani Thakar, Jordan Raddic

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

User-generated Structured Query Language (SQL) queries are a rich source of information for database analysts, information scientists, and the end users of databases. In this study a group of scientists in astronomy and computer and information scientists work together to analyze a large volume of SQL log data generated by users of the Sloan Digital Sky Survey (SDSS) data archive in order to better understand users' data seeking behavior. While statistical analysis of such logs is useful at aggregated levels, efficiently exploring specific patterns of queries is often a challenging task due to the typically large volume of the data, multivariate features, and data requirements specified in SQL queries. To enable and facilitate effective and efficient exploration of the SDSS log data, we designed an interactive visualization tool, called the SDSS Log Viewer, which integrates time series visualization, text visualization, and dynamic query techniques. We describe two analysis scenarios of visual exploration of SDSS log data, including understanding unusually high daily query traffic and modeling the types of data seeking behaviors of massive query generators. The two scenarios demonstrate that the SDSS Log Viewer provides a novel and potentially valuable approach to support these targeted tasks.

Original languageEnglish
Title of host publicationProceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012
DOIs
Publication statusPublished - 2012 Feb 16
EventVisualization and Data Analysis 2012 - Burlingame, CA, United States
Duration: 2012 Jan 232012 Jan 25

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume8294
ISSN (Print)0277-786X

Other

OtherVisualization and Data Analysis 2012
CountryUnited States
CityBurlingame, CA
Period12/1/2312/1/25

Fingerprint

query languages
Exploratory Analysis
Datalog
Query languages
Query Language
Query
Visualization
Astronomy
Scenario Analysis
astronomy
statistical analysis
traffic
Survey Data
Multivariate Data
Time series
Statistical methods
generators
Statistical Analysis
Vision
requirements

All Science Journal Classification (ASJC) codes

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Cite this

Zhang, J., Chen, C., Vogeley, M. S., Pan, D., Thakar, A., & Raddic, J. (2012). SDSS Log Viewer: Visual exploratory analysis of large-volume SQL log data. In Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012 [82940D] (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 8294). https://doi.org/10.1117/12.907097
Zhang, Jian ; Chen, Chaomei ; Vogeley, Michael S. ; Pan, Danny ; Thakar, Ani ; Raddic, Jordan. / SDSS Log Viewer : Visual exploratory analysis of large-volume SQL log data. Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012. 2012. (Proceedings of SPIE - The International Society for Optical Engineering).
@inproceedings{07ae980d02a94010915555f970de2468,
title = "SDSS Log Viewer: Visual exploratory analysis of large-volume SQL log data",
abstract = "User-generated Structured Query Language (SQL) queries are a rich source of information for database analysts, information scientists, and the end users of databases. In this study a group of scientists in astronomy and computer and information scientists work together to analyze a large volume of SQL log data generated by users of the Sloan Digital Sky Survey (SDSS) data archive in order to better understand users' data seeking behavior. While statistical analysis of such logs is useful at aggregated levels, efficiently exploring specific patterns of queries is often a challenging task due to the typically large volume of the data, multivariate features, and data requirements specified in SQL queries. To enable and facilitate effective and efficient exploration of the SDSS log data, we designed an interactive visualization tool, called the SDSS Log Viewer, which integrates time series visualization, text visualization, and dynamic query techniques. We describe two analysis scenarios of visual exploration of SDSS log data, including understanding unusually high daily query traffic and modeling the types of data seeking behaviors of massive query generators. The two scenarios demonstrate that the SDSS Log Viewer provides a novel and potentially valuable approach to support these targeted tasks.",
author = "Jian Zhang and Chaomei Chen and Vogeley, {Michael S.} and Danny Pan and Ani Thakar and Jordan Raddic",
year = "2012",
month = "2",
day = "16",
doi = "10.1117/12.907097",
language = "English",
isbn = "9780819489418",
series = "Proceedings of SPIE - The International Society for Optical Engineering",
booktitle = "Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012",

}

Zhang, J, Chen, C, Vogeley, MS, Pan, D, Thakar, A & Raddic, J 2012, SDSS Log Viewer: Visual exploratory analysis of large-volume SQL log data. in Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012., 82940D, Proceedings of SPIE - The International Society for Optical Engineering, vol. 8294, Visualization and Data Analysis 2012, Burlingame, CA, United States, 12/1/23. https://doi.org/10.1117/12.907097

SDSS Log Viewer : Visual exploratory analysis of large-volume SQL log data. / Zhang, Jian; Chen, Chaomei; Vogeley, Michael S.; Pan, Danny; Thakar, Ani; Raddic, Jordan.

Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012. 2012. 82940D (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 8294).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - SDSS Log Viewer

T2 - Visual exploratory analysis of large-volume SQL log data

AU - Zhang, Jian

AU - Chen, Chaomei

AU - Vogeley, Michael S.

AU - Pan, Danny

AU - Thakar, Ani

AU - Raddic, Jordan

PY - 2012/2/16

Y1 - 2012/2/16

N2 - User-generated Structured Query Language (SQL) queries are a rich source of information for database analysts, information scientists, and the end users of databases. In this study a group of scientists in astronomy and computer and information scientists work together to analyze a large volume of SQL log data generated by users of the Sloan Digital Sky Survey (SDSS) data archive in order to better understand users' data seeking behavior. While statistical analysis of such logs is useful at aggregated levels, efficiently exploring specific patterns of queries is often a challenging task due to the typically large volume of the data, multivariate features, and data requirements specified in SQL queries. To enable and facilitate effective and efficient exploration of the SDSS log data, we designed an interactive visualization tool, called the SDSS Log Viewer, which integrates time series visualization, text visualization, and dynamic query techniques. We describe two analysis scenarios of visual exploration of SDSS log data, including understanding unusually high daily query traffic and modeling the types of data seeking behaviors of massive query generators. The two scenarios demonstrate that the SDSS Log Viewer provides a novel and potentially valuable approach to support these targeted tasks.

AB - User-generated Structured Query Language (SQL) queries are a rich source of information for database analysts, information scientists, and the end users of databases. In this study a group of scientists in astronomy and computer and information scientists work together to analyze a large volume of SQL log data generated by users of the Sloan Digital Sky Survey (SDSS) data archive in order to better understand users' data seeking behavior. While statistical analysis of such logs is useful at aggregated levels, efficiently exploring specific patterns of queries is often a challenging task due to the typically large volume of the data, multivariate features, and data requirements specified in SQL queries. To enable and facilitate effective and efficient exploration of the SDSS log data, we designed an interactive visualization tool, called the SDSS Log Viewer, which integrates time series visualization, text visualization, and dynamic query techniques. We describe two analysis scenarios of visual exploration of SDSS log data, including understanding unusually high daily query traffic and modeling the types of data seeking behaviors of massive query generators. The two scenarios demonstrate that the SDSS Log Viewer provides a novel and potentially valuable approach to support these targeted tasks.

UR - http://www.scopus.com/inward/record.url?scp=84863020206&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84863020206&partnerID=8YFLogxK

U2 - 10.1117/12.907097

DO - 10.1117/12.907097

M3 - Conference contribution

AN - SCOPUS:84863020206

SN - 9780819489418

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012

ER -

Zhang J, Chen C, Vogeley MS, Pan D, Thakar A, Raddic J. SDSS Log Viewer: Visual exploratory analysis of large-volume SQL log data. In Proceedings of SPIE-IS and T Electronic Imaging - Visualization and Data Analysis 2012. 2012. 82940D. (Proceedings of SPIE - The International Society for Optical Engineering). https://doi.org/10.1117/12.907097