Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size

S. J. Park, S. H. Park, Y. J. Choi, D. W. Kim, E. J. Son, H. S. Lee, J. H. Yoon, E. K. Kim, H. J. Moon, J. Y. Kwak

Research output: Contribution to journalArticle

28 Citations (Scopus)

Abstract

Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.

Original languageEnglish
Pages (from-to)E186-E190
JournalUltraschall in der Medizin
Volume33
Issue number7
DOIs
Publication statusPublished - 2012 Oct 30

Fingerprint

Thyroid Nodule
Observer Variation
Research Ethics Committees
Informed Consent
Retrospective Studies
Radiologists

All Science Journal Classification (ASJC) codes

  • Radiology Nuclear Medicine and imaging

Cite this

Park, S. J. ; Park, S. H. ; Choi, Y. J. ; Kim, D. W. ; Son, E. J. ; Lee, H. S. ; Yoon, J. H. ; Kim, E. K. ; Moon, H. J. ; Kwak, J. Y. / Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size. In: Ultraschall in der Medizin. 2012 ; Vol. 33, No. 7. pp. E186-E190.
@article{01e19b30cf064663b46f3c5f37ed9359,
title = "Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size",
abstract = "Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.",
author = "Park, {S. J.} and Park, {S. H.} and Choi, {Y. J.} and Kim, {D. W.} and Son, {E. J.} and Lee, {H. S.} and Yoon, {J. H.} and Kim, {E. K.} and Moon, {H. J.} and Kwak, {J. Y.}",
year = "2012",
month = "10",
day = "30",
doi = "10.1055/s-0032-1325404",
language = "English",
volume = "33",
pages = "E186--E190",
journal = "Ultraschall in der Medizin",
issn = "0172-4614",
publisher = "Georg Thieme Verlag",
number = "7",

}

Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size. / Park, S. J.; Park, S. H.; Choi, Y. J.; Kim, D. W.; Son, E. J.; Lee, H. S.; Yoon, J. H.; Kim, E. K.; Moon, H. J.; Kwak, J. Y.

In: Ultraschall in der Medizin, Vol. 33, No. 7, 30.10.2012, p. E186-E190.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size

AU - Park, S. J.

AU - Park, S. H.

AU - Choi, Y. J.

AU - Kim, D. W.

AU - Son, E. J.

AU - Lee, H. S.

AU - Yoon, J. H.

AU - Kim, E. K.

AU - Moon, H. J.

AU - Kwak, J. Y.

PY - 2012/10/30

Y1 - 2012/10/30

N2 - Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.

AB - Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.

UR - http://www.scopus.com/inward/record.url?scp=84871643783&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84871643783&partnerID=8YFLogxK

U2 - 10.1055/s-0032-1325404

DO - 10.1055/s-0032-1325404

M3 - Article

C2 - 23108925

AN - SCOPUS:84871643783

VL - 33

SP - E186-E190

JO - Ultraschall in der Medizin

JF - Ultraschall in der Medizin

SN - 0172-4614

IS - 7

ER -