3D

Published: 28 Jul 2021 Category: deep_learning

Papers

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Collaborative Regression of Expressive Bodies using Moderation

Hand Image Understanding via Deep Multi-Task Learning

VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild

https://arxiv.org/abs/2108.02452

EventHPE: Event-based 3D Human Pose and Shape Estimation

  • intro: ICCV 2021
  • intro: University of Alberta & Shandong University & Celepixel Technology & University of Guelph & Nanyang Technological University
  • arxiv: https://arxiv.org/abs/2108.06819

Monocular 3D Object Detection

Monocular 3D Object Detection and Box Fitting Trained End-to-End Using Intersection-over-Union Loss

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Learning Depth-Guided Convolutions for Monocular 3D Object Detection

RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving

SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation

Center3D: Center-based Monocular 3D Object Detection with Joint Depth Understanding

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

M3DSSD: Monocular 3D Single Stage Object Detector

  • intro: CVPR 2021
  • intro: Zhejiang University & Mohamed bin Zayed University of Artificial Intelligence & Inception Institute of Artificial Intelligence
  • arxiv: https://arxiv.org/abs/2103.13164

Delving into Localization Errors for Monocular 3D Object Detection

Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

Objects are Different: Flexible Monocular 3D Object Detection

Geometry-based Distance Decomposition for Monocular 3D Object Detection

https://arxiv.org/abs/2104.03775

Geometry-aware data augmentation for monocular 3D object detection

https://arxiv.org/abs/2104.05858

OCM3D: Object-Centric Monocular 3D Object Detection

https://arxiv.org/abs/2104.06041

Exploring 2D Data Augmentation for 3D Monocular Object Detection

https://arxiv.org/abs/2104.10786

Progressive Coordinate Transforms for Monocular 3D Object Detection

AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Categorical Depth Distribution Network for Monocular 3D Object Detection

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection

SGM3D: Stereo Guided Monocular 3D Object Detection

MonoDistill: Learning Spatial Features for Monocular 3D Object Detection

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection

MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer

MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection

Homography Loss for Monocular 3D Object Detection

Towards Model Generalization for Monocular 3D Object Detection

Delving into the Pre-training Paradigm of Monocular 3D Object Detection

MonoGround: Detecting Monocular 3D Objects from the Ground

Densely Constrained Depth Estimator for Monocular 3D Object Detection

Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection

DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

Monocular 3D Object Detection with Depth from Motion

MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones

SEFormer: Structure Embedding Transformer for 3D Object Detection

Multi-Modal 3D Object Detection

AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection

  • intro: IJCAI 2022
  • intro: University of Science and Technology & Harbin Institute of Technology & SenseTime Research & The Chinese University of Hong Kong & Tsinghua University
  • arxiv: https://arxiv.org/abs/2201.06493

AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection

Monocular 3D Detection and Tracking

Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving

Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

Multi-Camera 3D Object Detection

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Sparse4D

Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion

Sparse4D v2: Recurrent Temporal Fusion with Sparse Model

Sparse4D v3: Advancing End-to-End 3D Detection and Tracking

Multi-Camera Multiple 3D Object Tracking

Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles

SRCN3D: Sparse R-CNN 3D Surround-View Camera Object Detection and Tracking for Autonomous Driving