Diagnostic performances and interobserver agreement according to observer experience: a comparison study using three guidelines for management of thyroid nodules

Jieun Koh, Soo Yeon Kim, Hye Sun Lee, Eun Kyung Kim, Jin Young Kwak, Hee Jung Moon, Jung Hyun Yoon

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Background: The differences regarding categorization of thyroid nodules among different guidelines may affect the diagnostic performances and agreement among observers. Purpose: To compare the diagnostic performances and agreements between observers with various degree of experience when applying different guidelines for stratifying thyroid nodules using suspicious ultrasonography (US) features. Material and Methods: This retrospective study included 370 thyroid nodules (≥10 mm). Four observers, grouped as experienced and inexperienced, evaluated the US features and made final assessments according to the Kim criteria, Thyroid Imaging Reporting and Data System (TIRADS) by Kwak et al., and the 2015 American Thyroid Association (ATA) guideline. Diagnostic performances and agreements among the two groups were compared. Results: The Kim criteria shows higher specificity with significantly lower sensitivity when compared to TIRADS and the 2015 ATA guideline (all P < 0.001), regardless of the level of experience. The experienced group showed significantly higher specificity with the Kim criteria and the 2015 ATA guideline compared to the inexperienced group (P < 0.001), and the inexperienced group showed significantly higher sensitivity using the Kim criteria (P = 0.002). The experienced group showed significantly higher agreement than the inexperienced group when using TIRADS while higher agreement was seen when using the 2015 ATA guideline for the inexperienced group. Agreement was not significantly different for the Kim criteria according to observer experience. Conclusion: The diagnostic performances and agreements show significant differences in risk stratification of thyroid nodules according to the three guidelines using suspicious US features and the level of experience of the observer.

Original languageEnglish
Pages (from-to)917-923
Number of pages7
JournalActa Radiologica
Volume59
Issue number8
DOIs
Publication statusPublished - 2018 Aug 1

Fingerprint

Thyroid Nodule
Guidelines
Thyroid Gland
Information Systems
Ultrasonography
Retrospective Studies

All Science Journal Classification (ASJC) codes

  • Radiological and Ultrasound Technology
  • Radiology Nuclear Medicine and imaging

Cite this

@article{8e07ba3374f04895b9d10d640a0c6f6b,
title = "Diagnostic performances and interobserver agreement according to observer experience: a comparison study using three guidelines for management of thyroid nodules",
abstract = "Background: The differences regarding categorization of thyroid nodules among different guidelines may affect the diagnostic performances and agreement among observers. Purpose: To compare the diagnostic performances and agreements between observers with various degree of experience when applying different guidelines for stratifying thyroid nodules using suspicious ultrasonography (US) features. Material and Methods: This retrospective study included 370 thyroid nodules (≥10 mm). Four observers, grouped as experienced and inexperienced, evaluated the US features and made final assessments according to the Kim criteria, Thyroid Imaging Reporting and Data System (TIRADS) by Kwak et al., and the 2015 American Thyroid Association (ATA) guideline. Diagnostic performances and agreements among the two groups were compared. Results: The Kim criteria shows higher specificity with significantly lower sensitivity when compared to TIRADS and the 2015 ATA guideline (all P < 0.001), regardless of the level of experience. The experienced group showed significantly higher specificity with the Kim criteria and the 2015 ATA guideline compared to the inexperienced group (P < 0.001), and the inexperienced group showed significantly higher sensitivity using the Kim criteria (P = 0.002). The experienced group showed significantly higher agreement than the inexperienced group when using TIRADS while higher agreement was seen when using the 2015 ATA guideline for the inexperienced group. Agreement was not significantly different for the Kim criteria according to observer experience. Conclusion: The diagnostic performances and agreements show significant differences in risk stratification of thyroid nodules according to the three guidelines using suspicious US features and the level of experience of the observer.",
author = "Jieun Koh and Kim, {Soo Yeon} and Lee, {Hye Sun} and Kim, {Eun Kyung} and Kwak, {Jin Young} and Moon, {Hee Jung} and Yoon, {Jung Hyun}",
year = "2018",
month = "8",
day = "1",
doi = "10.1177/0284185117744001",
language = "English",
volume = "59",
pages = "917--923",
journal = "Acta Radiologica",
issn = "0284-1851",
publisher = "SAGE Publications Ltd",
number = "8",

}

Diagnostic performances and interobserver agreement according to observer experience : a comparison study using three guidelines for management of thyroid nodules. / Koh, Jieun; Kim, Soo Yeon; Lee, Hye Sun; Kim, Eun Kyung; Kwak, Jin Young; Moon, Hee Jung; Yoon, Jung Hyun.

In: Acta Radiologica, Vol. 59, No. 8, 01.08.2018, p. 917-923.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Diagnostic performances and interobserver agreement according to observer experience

T2 - a comparison study using three guidelines for management of thyroid nodules

AU - Koh, Jieun

AU - Kim, Soo Yeon

AU - Lee, Hye Sun

AU - Kim, Eun Kyung

AU - Kwak, Jin Young

AU - Moon, Hee Jung

AU - Yoon, Jung Hyun

PY - 2018/8/1

Y1 - 2018/8/1

N2 - Background: The differences regarding categorization of thyroid nodules among different guidelines may affect the diagnostic performances and agreement among observers. Purpose: To compare the diagnostic performances and agreements between observers with various degree of experience when applying different guidelines for stratifying thyroid nodules using suspicious ultrasonography (US) features. Material and Methods: This retrospective study included 370 thyroid nodules (≥10 mm). Four observers, grouped as experienced and inexperienced, evaluated the US features and made final assessments according to the Kim criteria, Thyroid Imaging Reporting and Data System (TIRADS) by Kwak et al., and the 2015 American Thyroid Association (ATA) guideline. Diagnostic performances and agreements among the two groups were compared. Results: The Kim criteria shows higher specificity with significantly lower sensitivity when compared to TIRADS and the 2015 ATA guideline (all P < 0.001), regardless of the level of experience. The experienced group showed significantly higher specificity with the Kim criteria and the 2015 ATA guideline compared to the inexperienced group (P < 0.001), and the inexperienced group showed significantly higher sensitivity using the Kim criteria (P = 0.002). The experienced group showed significantly higher agreement than the inexperienced group when using TIRADS while higher agreement was seen when using the 2015 ATA guideline for the inexperienced group. Agreement was not significantly different for the Kim criteria according to observer experience. Conclusion: The diagnostic performances and agreements show significant differences in risk stratification of thyroid nodules according to the three guidelines using suspicious US features and the level of experience of the observer.

AB - Background: The differences regarding categorization of thyroid nodules among different guidelines may affect the diagnostic performances and agreement among observers. Purpose: To compare the diagnostic performances and agreements between observers with various degree of experience when applying different guidelines for stratifying thyroid nodules using suspicious ultrasonography (US) features. Material and Methods: This retrospective study included 370 thyroid nodules (≥10 mm). Four observers, grouped as experienced and inexperienced, evaluated the US features and made final assessments according to the Kim criteria, Thyroid Imaging Reporting and Data System (TIRADS) by Kwak et al., and the 2015 American Thyroid Association (ATA) guideline. Diagnostic performances and agreements among the two groups were compared. Results: The Kim criteria shows higher specificity with significantly lower sensitivity when compared to TIRADS and the 2015 ATA guideline (all P < 0.001), regardless of the level of experience. The experienced group showed significantly higher specificity with the Kim criteria and the 2015 ATA guideline compared to the inexperienced group (P < 0.001), and the inexperienced group showed significantly higher sensitivity using the Kim criteria (P = 0.002). The experienced group showed significantly higher agreement than the inexperienced group when using TIRADS while higher agreement was seen when using the 2015 ATA guideline for the inexperienced group. Agreement was not significantly different for the Kim criteria according to observer experience. Conclusion: The diagnostic performances and agreements show significant differences in risk stratification of thyroid nodules according to the three guidelines using suspicious US features and the level of experience of the observer.

UR - http://www.scopus.com/inward/record.url?scp=85046025348&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046025348&partnerID=8YFLogxK

U2 - 10.1177/0284185117744001

DO - 10.1177/0284185117744001

M3 - Article

C2 - 29181988

AN - SCOPUS:85046025348

VL - 59

SP - 917

EP - 923

JO - Acta Radiologica

JF - Acta Radiologica

SN - 0284-1851

IS - 8

ER -