Integrating code search into the development session

Mu Woong Lee, Seungwon Hwang, Sunghun Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].

Original languageEnglish
Title of host publication2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
Pages1336-1339
Number of pages4
DOIs
Publication statusPublished - 2011 Jun 6
Event2011 IEEE 27th International Conference on Data Engineering, ICDE 2011 - Hannover, Germany
Duration: 2011 Apr 112011 Apr 16

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Other

Other2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
CountryGermany
CityHannover
Period11/4/1111/4/16

Fingerprint

Syntactics
Software engineering
Human computer interaction
Search engines
Demonstrations

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Information Systems

Cite this

Lee, M. W., Hwang, S., & Kim, S. (2011). Integrating code search into the development session. In 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011 (pp. 1336-1339). [5767948] (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2011.5767948
Lee, Mu Woong ; Hwang, Seungwon ; Kim, Sunghun. / Integrating code search into the development session. 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011. 2011. pp. 1336-1339 (Proceedings - International Conference on Data Engineering).
@inproceedings{637a5bb4ae974f679da1b059fbbb24cc,
title = "Integrating code search into the development session",
abstract = "To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].",
author = "Lee, {Mu Woong} and Seungwon Hwang and Sunghun Kim",
year = "2011",
month = "6",
day = "6",
doi = "10.1109/ICDE.2011.5767948",
language = "English",
isbn = "9781424489589",
series = "Proceedings - International Conference on Data Engineering",
pages = "1336--1339",
booktitle = "2011 IEEE 27th International Conference on Data Engineering, ICDE 2011",

}

Lee, MW, Hwang, S & Kim, S 2011, Integrating code search into the development session. in 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011., 5767948, Proceedings - International Conference on Data Engineering, pp. 1336-1339, 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011, Hannover, Germany, 11/4/11. https://doi.org/10.1109/ICDE.2011.5767948

Integrating code search into the development session. / Lee, Mu Woong; Hwang, Seungwon; Kim, Sunghun.

2011 IEEE 27th International Conference on Data Engineering, ICDE 2011. 2011. p. 1336-1339 5767948 (Proceedings - International Conference on Data Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Integrating code search into the development session

AU - Lee, Mu Woong

AU - Hwang, Seungwon

AU - Kim, Sunghun

PY - 2011/6/6

Y1 - 2011/6/6

N2 - To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].

AB - To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].

UR - http://www.scopus.com/inward/record.url?scp=79957878003&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79957878003&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2011.5767948

DO - 10.1109/ICDE.2011.5767948

M3 - Conference contribution

SN - 9781424489589

T3 - Proceedings - International Conference on Data Engineering

SP - 1336

EP - 1339

BT - 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011

ER -

Lee MW, Hwang S, Kim S. Integrating code search into the development session. In 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011. 2011. p. 1336-1339. 5767948. (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2011.5767948