Supporting ground-truth annotation of image datasets using clustering

B.J. Boom, P.X. Huang, Jiyin He, R.B. Fisher

Research output: Chapter in Book/Report/Conference proceedingConference contribution


As more subject-specific image datasets (medical images, birds, etc) become available, high quality labels associated with these datasets are essential for building statistical models and method evaluation. Obtaining these annotations is a time-consuming and thus a costly business. We propose a clustering method to support this annotation task, making the task easier and more efficient to perform for users. In this paper, we provide a framework to illustrate how a clustering method can support the annotation task. A large reduction in both the time to annotate images and number of mouse clicks needed for the annotation is achieved. By investigating the quality of the annotation, we show that this framework is affected by the particular clustering method used. This, however, does not have a large influence on the overall accuracy and disappears if the data is annotated by multiple persons.
Original languageEnglish
Title of host publicationPattern Recognition (ICPR), 2012 21st International Conference on
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages4
ISBN (Print)978-1-4673-2216-4
Publication statusPublished - 2012


  • Labeling
  • ground-truth annotation
  • groundtruth classifications
  • visual databases
  • statistical analysis
  • Image color analysis
  • subject-specific image datasets
  • Accuracy
  • statistical models
  • Cleaning
  • Clustering methods
  • method evaluation
  • clustering method
  • Histograms
  • Birds
  • image classification
  • pattern clustering

Fingerprint Dive into the research topics of 'Supporting ground-truth annotation of image datasets using clustering'. Together they form a unique fingerprint.

Cite this