graviti
Products
Resources
About us
Facial Keypoint Detection
2D Keypoints
Face
|...
License: Unknown

Overview

Detecing facial keypoints is a very challenging problem. Facial features vary greatly from
one individual to another, and even for a single individual, there is a large amount of variation
due to 3D pose, size, position, viewing angle, and illumination conditions. Computer vision
research has come a long way in addressing these difficulties, but there remain many opportunities
for improvement.This dataset is from competition of kaggle which is aimed to encourage development
of facial detection algorithm.

Data Format

Each predicted keypoint is specified by an (x,y) real-valued pair in the space of pixel indices.
There are 15 keypoints, which represent the following elements of the face:

left_eye_center,
right_eye_center, left_eye_inner_corner, left_eye_outer_corner, right_eye_inner_corner, right_eye_outer_corner,
left_eyebrow_inner_end, left_eyebrow_outer_end, right_eyebrow_inner_end, right_eyebrow_outer_end,
nose_tip, mouth_left_corner, mouth_right_corner, mouth_center_top_lip, mouth_center_bottom_lip

Left and right here refers to the point of view of the subject.

In some examples, some of the target keypoint positions are misssing (encoded as missing entries
in the csv, i.e., with nothing between two commas).

The input image is given in the last field
of the data files, and consists of a list of pixels (ordered by row), as integers in (0,255).
The images are 96x96 pixels.

Data files

  • training.csv: list of training 7049 images. Each row contains
    the (x,y) coordinates for 15 keypoints, and image data as row-ordered list of pixels.
  • test.csv:
    list of 1783 test images. Each row contains ImageId and image data as row-ordered list of pixels
  • submissionFileFormat.csv: list of 27124 keypoints to predict. Each row contains a RowId,
    ImageId, FeatureName, Location. FeatureName are "left_eye_center_x," "right_eyebrow_outer_end_y,"
    etc. Location is what you need to predict.
Data Summary
Type
Image,
Amount
--
Size
77.11MB
Provided by
Kaggle
Kaggle is the world's largest data science community with powerful tools and resources to help you achieve your data science goals.
| Amount -- | Size 77.11MB
Facial Keypoint Detection
2D Keypoints
Face
License: Unknown

Overview

Detecing facial keypoints is a very challenging problem. Facial features vary greatly from
one individual to another, and even for a single individual, there is a large amount of variation
due to 3D pose, size, position, viewing angle, and illumination conditions. Computer vision
research has come a long way in addressing these difficulties, but there remain many opportunities
for improvement.This dataset is from competition of kaggle which is aimed to encourage development
of facial detection algorithm.

Data Format

Each predicted keypoint is specified by an (x,y) real-valued pair in the space of pixel indices.
There are 15 keypoints, which represent the following elements of the face:

left_eye_center,
right_eye_center, left_eye_inner_corner, left_eye_outer_corner, right_eye_inner_corner, right_eye_outer_corner,
left_eyebrow_inner_end, left_eyebrow_outer_end, right_eyebrow_inner_end, right_eyebrow_outer_end,
nose_tip, mouth_left_corner, mouth_right_corner, mouth_center_top_lip, mouth_center_bottom_lip

Left and right here refers to the point of view of the subject.

In some examples, some of the target keypoint positions are misssing (encoded as missing entries
in the csv, i.e., with nothing between two commas).

The input image is given in the last field
of the data files, and consists of a list of pixels (ordered by row), as integers in (0,255).
The images are 96x96 pixels.

Data files

  • training.csv: list of training 7049 images. Each row contains
    the (x,y) coordinates for 15 keypoints, and image data as row-ordered list of pixels.
  • test.csv:
    list of 1783 test images. Each row contains ImageId and image data as row-ordered list of pixels
  • submissionFileFormat.csv: list of 27124 keypoints to predict. Each row contains a RowId,
    ImageId, FeatureName, Location. FeatureName are "left_eye_center_x," "right_eyebrow_outer_end_y,"
    etc. Location is what you need to predict.
0
Start building your AI now
graviti
wechat-QR
Long pressing the QR code to follow wechat official account

Copyright@Graviti