INRIA Copydays
No Label
Image Search
License: Custom


The Copydays dataset is a set of images which is exclusively composed of our personal holidays photos. Each image has suffered three kinds of artificial attacks: JPEG, cropping and "strong". The motivation is to evaluate the behavior of indexing algorithms for most common image copies (for video, you may be interest in our video copy generation software). Because of its small size, it is evaluated by merging the original images in a large image dabase.

The dataset can be downloaded from this page, see details below. The material given includes the images themselves. On request we may also provide the set of descriptors extracted from these images. For the eva/luation, one should ideally use the same sets of distractor images downloaded from Flickr than we used. We can provide them on request.


It contains:

  • Original images (208MB)
  • Cropped images, from 10% to 80% of the image surface removed, (2.0GB)
  • Scale+JPEG attacked images(77MB), scale: 1/16 (pixels), JPEG quality factors: 75, 50, 30, 20, 15, 10, 8, 5, 3
  • 229 Strongly attacked images, print and scan, blur, paint,...



Data Summary
Provided by
National Institute for Research in Digital Science and Technology
Start Building AI Now