A Dependency Treebank of Spoken Second Language English

Kristopher Kyle, Masaki Eguchi, Aaron Miller, Theodore Sither

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we introduce a dependency treebank of spoken second language (L2) English that is annotated with part of speech (Penn POS) tags and syntactic dependencies (Universal Dependencies). We then evaluate the degree to which the use of this treebank as training data affects POS and UD annotation accuracy for L1 web texts, L2 written texts, and L2 spoken texts as compared to models trained on L1 texts only.

Original languageEnglish
Title of host publicationBEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings
EditorsEkaterina Kochmar, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Nitin Madnani, Anais Tack, Victoria Yaneva, Zheng Yuan, Torsten Zesch
PublisherAssociation for Computational Linguistics (ACL)
Pages39-45
Number of pages7
ISBN (Electronic)9781955917834
Publication statusPublished - 2022
Event17th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2022 - Seattle, United States
Duration: 2022 Jul 15 → …

Publication series

NameBEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings

Conference

Conference17th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2022
Country/TerritoryUnited States
CitySeattle
Period22/7/15 → …

Bibliographical note

Funding Information:
This project was supported by a Learning Early Career Research Grant.

Publisher Copyright:
© 2022 Association for Computational Linguistics.

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'A Dependency Treebank of Spoken Second Language English'. Together they form a unique fingerprint.

Cite this