DAGM 2007 competition
2D Polygon
License: Unknown


The data is artificially generated, but similar to real world problems. The first six out of ten datasets, denoted as development datasets, are supposed to be used for algorithm development. The remaining four datasets, which are referred to as competition datasets, can be used to evaluate the performance. Researchers should consider not using or analyzing the competition datasets before the development is completed as a code of honour.
In the following we provide some details about the datasets:

  • Each development (competition) dataset consists of 1000 (2000) 'non-defective' and of 150 (300) 'defective' images saved in grayscale 8-bit PNG format.
  • Each dataset is generated by a different texture model and defect model.
  • 'Non-defective' images show the background texture without defects, 'defective' images have exactly one labelled defect on the background texture.
  • All datasets has been randomly split into a training and testing sub-dataset of equal size.
  • Weak labels are provided as ellipses roughly indicating the defective area. Technically, defective images are augmented with a separate grayscale 8-bit image in the PNG format located in a folder 'Label'. The values 0 and 255 denote background and de
Data Summary
Provided by
(HCI)Heidelberg Collaboratory for Image Processing
The Heidelberg Collaboratory for Image Processing (HCI) is an "Industry on Campus" project and is run jointly by the Heidelberg Universityand the companies: Bosch, Heidelberg Engineering, Volume Graphics and Bayer. The HCI is part of the Interdisciplinary Center for Scientific Computing (IWR).
Start Building AI Now