TY - GEN
T1 - Integrating code search into the development session
AU - Lee, Mu Woong
AU - Hwang, Seung Won
AU - Kim, Sunghun
PY - 2011
Y1 - 2011
N2 - To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].
AB - To support rapid and efficient software development, we propose to demonstrate our tool, integrating code search into software development process. For example, a developer, right during writing a module, can find a code piece sharing the same syntactic structure from a large code corpus representing the wisdom of other developers in the same team (or in the universe of open-source code). While there exist commercial code search engines on the code universe, they treat software as text (thus oblivious of syntactic structure), and fail at finding semantically related code. Meanwhile, existing tools, searching for syntactic clones, do not focus on efficiency, focusing on post-mortem usage scenario of detecting clones after the code development is completed. In clear contrast, we focus on optimizing efficiency for syntactic code search and making this search interactive for large-scale corpus, to complement the existing two lines of research. From our demonstration, we will show how such interactive search supports rapid software development, as similarly claimed lately in SE and HCI communities [1], [2]. As an enabling technology, we design efficient index building and traversal techniques, optimized for code corpus and code search workload. Our tool can identify relevant code in the corpus of 1.7 million code pieces in a sub-second response time, without compromising any accuracy obtained by a state-of-the-art tool, as we report our extensive evaluation results in [3].
UR - http://www.scopus.com/inward/record.url?scp=79957878003&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79957878003&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2011.5767948
DO - 10.1109/ICDE.2011.5767948
M3 - Conference contribution
AN - SCOPUS:79957878003
SN - 9781424489589
T3 - Proceedings - International Conference on Data Engineering
SP - 1336
EP - 1339
BT - 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
T2 - 2011 IEEE 27th International Conference on Data Engineering, ICDE 2011
Y2 - 11 April 2011 through 16 April 2011
ER -