GOT-10k
License:
CC BY-NC-SA 4.0
Overview
A large, high-diversity, one-shot database for generic object tracking in the wild
Key Features
- Large-Scale
The dataset contains more than 10,000 video segments of real-world moving objects and over 1.5 million manually labeled bounding boxes. - Generic Classes
The dataset is backboned by WordNet and it covers a majority of 560+ classes of real-world moving objects and 80+ classes of motion patterns. - One-Shot
The dataset encourages the development of generic purposed trackers by following the one-shot rule that object classes between train and test sets are zero-overlapped. - Unified Training Data
The fair comparison of deep trackers is ensured with the protocol that all approaches are using the same training data provided by the dataset. - Extra Labeling
The dataset provides extra labels including object visible ratios and motion classes as additional supervision for handling specific challenges. - Efficient Evaluation
The test set embodies 84 object classes and 32 motion classes with only 180 video segments, allowing for efficient evaluation.
Paper
Please cite this paper if GOT-10k helps your research. [PDF] [BibTex]]
Data Annotation
Each sequence folder contains 4 annotation files and 1 meta file. A brief description of these files follows (let N denotes sequence length):
- groundtruth.txt -- An N×4 matrix with each line representing object location [xmin, ymin, width, height] in one frame.
- cover.label -- An N×1 array representing object visible ratios, with levels ranging from 0~8.
- absense.label -- An binary N×1 array indicating whether an object is absent or present in each frame.
- cut_by_image.label -- An binary N×1 array indicating whether an object is cut by image in each frame.
- meta_info.ini -- Meta information about the sequence, including object and motion classes, video URL and more.
Values 0~8 in file cover.label correspond to ranges of object visible ratios: 0%, (0%, 15%], (15%~30%], (30%, 45%], (45%, 60%], (60%, 75%], (75%, 90%], (90%, 100%) and 100% respectively.
Data Format
The downloaded and extracted full dataset should follow the file structure:
|-- GOT-10k/
|-- train/
| |-- GOT-10k_Train_000001/
| | ......
| |-- GOT-10k_Train_009335/
| |-- list.txt
|-- val/
| |-- GOT-10k_Val_000001/
| | ......
| |-- GOT-10k_Val_000180/
| |-- list.txt
|-- test/
| |-- GOT-10k_Test_000001/
| | ......
| |-- GOT-10k_Test_000180/
| |-- list.txt
Instruction
Code
The benchmark offers light-weighted and compile-free toolkits written in pure Python and MATLAB. You will find tutorials and examples in the corresponding repositories.
- Python repository ($ pip install got10k)
- MATLAB repository
Citation
@article{Huang_2019,
title={GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild},
ISSN={1939-3539},
url={http://dx.doi.org/10.1109/TPAMI.2019.2957464},
DOI={10.1109/tpami.2019.2957464},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
publisher={Institute of Electrical and Electronics Engineers (IEEE)},
author={Huang, Lianghua and Zhao, Xin and Huang, Kaiqi},
year={2019},
pages={1–1}
}