Bonk-pose Dataset

Overview

The Bonk-pose dataset is a novel, publicly available 3D bounding box estimation dataset for marine vessels, created by fusing monocular RGB images with Automatic Identification System (AIS) data. It addresses the gap in maritime datasets by providing fully annotated 3D bounding boxes for vessel pose estimation.

6D pose annotation showcase

Annotations

6D Pose Estimation

3,753 images with 3D bounding box annotations including location, vessel dimensions and orientation.
3,829 vessel annotations

Object Detection

1,000 images with 2D bounding box annotations for vessel detection.
1463 Vessels annotated in the foreground
3957 Vessels and vessel like objects annotated

6D pose estimation

Created without human annotation cost

Annotations are created by an automated data fusion approach, making time-consuming human pose annotations unnecessary.
Annotations include vessel centroid location in the real world, vessel dimensions and the directions of the vessel coordinite system axes in the real world.
The intrinsic camera matrix is provided to project this data into the image.

Automatic Annotation Quality

86.4% of vessels labeled on par with human annotations.
94.5% of annotations are deemed of acceptable quality.

Object detection

Human annotations

Object detection bounding boxes were created by humans. Vessels are annotated in five classes. The classes allow differentiation between vessels in the foreground, moving vessels and vessels part of the background.

Object classes

Ship – Full vessels in the frame.
Ship Leaving Frame – Vessels partially out of the frame.
Ship Moored – Vessels anchored and stationary in the background.
Ship Partial – Partially visible or occluded vessels.
Subvessel – Independent units within a multi-vessel setup.

Performance of MSCOCO trained detectors on the object detection dataset.

Detector performance on vessels in the foreground

YOLOX-X

mAP@0.5:95: 0.626
mAP@0.5: 0.805
AR: 0.756

YOLOX-L

mAP@0.5:95: 0.603
mAP@0.5: 0.794
AR: 0.746

YOLOX-S

mAP@0.5:95: 0.542
mAP@0.5: 0.764
AR: 0.698

YOLOv3

mAP@0.5:95: 0.379
mAP@0.5: 0.659
AR: 0.549

DETR

mAP@0.5:95: 0.451
mAP@0.5: 0.696
AR: 0.645

Def. DETR

mAP@0.5:95: 0.510
mAP@0.5: 0.737
AR: 0.703

Def. DETR 2-Stage

mAP@0.5:95: 0.585
mAP@0.5: 0.775
AR: 0.761

Cascade R-CNN r50

mAP@0.5:95: 0.511
mAP@0.5: 0.725
AR: 0.680

Detector performance on All Vessel-like Objects