Probabilistic Object Detection
1. Overview
- Contributions of this work:
- Introduce a new robotic vision task: probabilistic object detection
- Define a new performance measure for probabilistic object detection: probabilistic detection quality (PDQ)
- Evaluate PDQ
- Evaluate state-of-the-art detectors using PDQ
- Questions:
- Is probabilistic object detection necessary?
- Can/should this be applied in SLAM?
2. Motivation
- Current CV paradigm for object detection: deterministic bounding box and semantic class with confidence score/label distribution
- Evaluation metrics based on this idea of object detection, influence training
- Conventional object detectors can over confidently assign incorrect label
- Need spatial and label uncertainty estimates
data:image/s3,"s3://crabby-images/50410/5041061cc3738fcac6ad58be21372b283b715076" alt=""
3. Background & Related Work
3.1. Evaluation Metrics
- Detections classified with threshold on Intersection over Union (IoU)
- Average precision (AP): sort detections by confidence, compute area under precision-recall curve
- Mean average precision (mAP) - vary IoU threshold from 0.5:0.05:0.95, take mean of APs
data:image/s3,"s3://crabby-images/87297/87297984536a1d2124cc98a06d08e7e4fb0e35f6" alt=""
3.2. Conventional Object Detection
- Typical detector output: bounding boxes and class label scores
- CNNs rapidly improving in accuracy and speed
- Spatial and semantic uncertainty not typically provided
data:image/s3,"s3://crabby-images/45370/45370e4ac158e4aa64d3a6753309c214724007af" alt=""
3.3. Uncertainty Estimation in Object Detection
- Semantic Segmentation of Small Objects and Modeling of Uncertainty in Urban
Remote Sensing Images Using Deep Convolutional Neural Networks (Kampffmeyer, 2016)
- Monte Carlo dropout approximates Bayesian inference
- Pixel-wise classification uncertainty for semantic labels
- Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding (Kendall, 2017)
- Bayesian model outputs pixel-wise semantic segmentation with model uncertainty per class
- What Uncertainties Do We Need in Bayesian Deep
Learning for Computer Vision? (Kendall and Gal, 2017)
- Examine aleatoric (observation) and epistemic (model) uncertainty
- Evaluating Merging Strategies for Sampling-based Uncertainty
Techniques in Object Detection (Miller, 2019)
- Estimate spatial and classification uncertainties for object detection
- Use uncertainty to accept/reject detections in near open-set conditions
4. Probabilistic Object Detection
- Object detection consists of:
- Probability distribution over known labels
- Bounding box: Gaussian distributions for corner positions
data:image/s3,"s3://crabby-images/90ec1/90ec13b7f8737a8f3384e60c4a66c606634b1a97" alt=""
data:image/s3,"s3://crabby-images/1376b/1376bec7cad5b7bfad641e057b24ae380afd539d" alt=""
5. Probability-based Detection Quality (PDQ)
- Frame , evaluate detections with ground truth objects
data:image/s3,"s3://crabby-images/d75d9/d75d94ecc0e976e67542f5597f5e670a944159d5" alt=""
- Foreground Loss
data:image/s3,"s3://crabby-images/5048a/5048a51650dc46484cafd344c2f090c02c466ea2" alt=""
- Background Loss
data:image/s3,"s3://crabby-images/ef424/ef42419db3e5888c502c72b0d52124c80ce1b8e7" alt=""
data:image/s3,"s3://crabby-images/d0a1b/d0a1bb78b74299527977a38000373e85f92967f7" alt=""
- Spatial Quality
data:image/s3,"s3://crabby-images/f6123/f61230423c02b7f547515b506427bc1c031589c5" alt=""
- Label Quality
data:image/s3,"s3://crabby-images/1c5c3/1c5c3ec4aa7e786162f453a2557ebbe189e7391d" alt=""
- Pairwise PDQ
data:image/s3,"s3://crabby-images/5be8a/5be8ad5d98dfec1d90e92081432447fc17fe6987" alt=""
- PDQ Score
data:image/s3,"s3://crabby-images/926cb/926cb688485ad31e0d6e27955baec510e78e3479" alt=""
- Example PDQ scores
data:image/s3,"s3://crabby-images/72f3b/72f3bbae84aa8711951e427d1cafb8d177b02139" alt=""
data:image/s3,"s3://crabby-images/6ccc6/6ccc65d5355c02a404fff66fe8e9ba8b27aeb68b" alt=""
6. PDQ Evaluation
- Perform experiments with simulated object detectors
- For all ground truth objects, simulate detector by adding true variance to detections
- Independent of true variance, simulated detector gives observations with some random reported variance
- PDQ score maximized when reported variance best matches true variance
- PDQ directly affected by label probability, vs mAP which only affected by dominant class
data:image/s3,"s3://crabby-images/6afeb/6afebc24e8e8614bba2e9a44d10770e45e35530c" alt=""
7. Evaluation of Object Detectors
- Evaluate state-of-the-art detectors
- Convert standard detections to probabilistic detections, assuming for pixels inside bounding box, for pixels outside
- Probabilistic detectors
- MC-Dropout SSD: use MC Dropout with SSD-400 detector, BSAS clustering to estimate Gaussians
- probFRCNN: find detections suppressed by nonmax supression with IoU > 0.75, cluster and estimate Gaussians for corners
- Observations
- Probabilistic detectors perform best in PDQ
- Top performing standard detectors have poor spatial quality eg YOLOv3
- mAP measure does not penalise high false positives eg FRCNN X+FPN(0.05)
data:image/s3,"s3://crabby-images/3a7dc/3a7dc3af626b068e09fa2c3a282ab8c50a0810f8" alt=""
data:image/s3,"s3://crabby-images/22ab1/22ab1706a786c622bb8214a20f97d03fe69c6e22" alt=""
data:image/s3,"s3://crabby-images/ab2bc/ab2bcb957ba6114da4c701f5170a896d1a2f1d6b" alt=""
8. Conclusions and Future Work
- Performance metrics strongly influence how object detectors are designed and trained to perform
- New performance measure for probabilistic object detection will steer work towards better spatial uncertainty estimation
- Extend to probabilistic instance segmentation - replace bounding box spatial probability density with segmentation mask density
9. Discussion
- Probabilistic object detection better suited to robotic deployment in real world
- How should probabilistic object detection be used in SLAM?
- PDQ metric, probabilistic approach to object detection will steer object detection field towards characterising uncertainty
- Add uncertain object labels to factor graph
- Future SLAM maps should incorporate instance segmentations - these need accurate uncertainty estimates