Confidence Learning from Noisy Labels for Arabic Dialect Identification

Zainab Alhakeem, Hong Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a new deep learning network for Arabic Dialect Identification (ADI) that addresses the incorrect label problem using confidence information. The recently released dataset for the MGB-5 ADI challenge includes a small amount of verified labels but a large amount of noisy labels, which makes the ADI task very challenging. We propose a confidence learning network (CLN) that utilizes a multi-task learning strategy to handle confidence information by leveraging the verified and noisy label sets. The proposed CLN employs a confidence refinement module using a Gumbel-softmax sampler that generates representations with more discriminative capabilities. Experimental results demonstrate that the proposed CLN model shows higher classification accuracy than conventional state-of-the-art ADI systems.

Original languageEnglish
Title of host publication2021 36th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665435536
DOIs
Publication statusPublished - 2021 Jun 27
Event36th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2021 - Jeju, Korea, Republic of
Duration: 2021 Jun 272021 Jun 30

Publication series

Name2021 36th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2021
Volume2021-January

Conference

Conference36th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2021
Country/TerritoryKorea, Republic of
CityJeju
Period21/6/2721/6/30

Bibliographical note

Funding Information:
ACKNOWLEDGMENT This work was supported in part by Artificial Intelligence Graduate School Program under Grant 2020-0-01361.

Publisher Copyright:
© 2021 IEEE.

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Confidence Learning from Noisy Labels for Arabic Dialect Identification'. Together they form a unique fingerprint.

Cite this