MP3: A Unified Model to Map, Perceive, Predict and Plan(Finalist) Multi-Modal Fusion Transformer for End-to-End Autonomous DrivingLearning by Watching
仿真
GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving (Finalist) 参考论文作者总结:https://zhuanlan.zhihu.com/p/377570852
场景生成
SceneGen: Learning to Generate Realistic Traffic Scenes Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view TransformationAdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles
地图
HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps
预测
Shared Cross-Modal Trajectory Prediction for Autonomous Driving Pedestrian and Ego-vehicle Trajectory Prediction from Monocular CameraSGCN:Sparse Graph Convolution Network for Pedestrian Trajectory PredictionInterpretable Social Anchors for Human Trajectory Forecasting in CrowdsIntrovert: Human Trajectory Prediction via Conditional 3D AttentionFocus on Local: Detecting Lane Marker from Bottom Up via Key PointDivide-and-Conquer for Lane-Aware Diverse Trajectory PredictionLaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents
场景识别
Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
感知
Exploring Intermediate Representation for Monocular Vehicle Pose Estimation Delving into Localization Errors for Monocular 3D Object DetectionRobust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar SignalsViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation
导航
Binary TTC: A Temporal Geofence for Autonomous Navigation
运动估计
Self-Supervised Pillar Motion Learning for Autonomous Driving
RAD: Realtime and Accurate 3D Object Detection on Embedded Systems Latent Space Regularization for Unsupervised Domain Adaptation in Semantic SegmentationLearning Depth-Guided Convolutions for Monocular 3D Object DetectionAccurate 3D Object Detection using Energy-Based ModelsSemi-synthesis: A fast way to produce effective datasets for stereo matchingMulti-task Learning with Attention for End-to-end Autonomous DrivingMVFuseNet: Improving End-to-End Object Detection and Motion Forecasting through Multi-View Fusion of LiDAR DataLCCNet: LiDAR and Camera Self-Calibration using Cost Volume NetworkSoft Cross Entropy Loss and Bottleneck Tri-Cost Volume For Efficient Stereo Depth PredictionOcclusion Guided Scene Flow Estimation on 3D Point CloudsVideo Class Agnostic Segmentation Benchmark for Autonomous DrivingRethinking of Radar’s Role: A Camera-Radar Dataset and Systematic Annotator via Coordinate Alignment
Waymo Open Dataset Challenge
MOTION PREDICTION CHALLENGE:
第一名:DenseTNT Waymo Open Dataset Motion Prediction Challenge 1 st Place Solution第二名:ReCoAt A Deep Learning Framework with Attention Mechanism for Multi-Modal Motion Prediction
INTERACTION PREDICTION CHALLENGE:
第一名:Multi-Modal Interactive Agent Trajectory Prediction Using Heterogeneous Edge-Enhanced Graph Attention Network荣誉奖:AIR 2 for Interaction Prediction
REAL-TIME 3D CHALLENGE:
第一名:1 st Place Solutions to the Real-time 3D Detection and the Most Efficient Model of the Waymo Open Dataset Challenges 2021第二名:CenterPoint++ submission to the Waymo Real-time 3D Detection Challenge第三名:3rd Place Solution of Waymo Open Dataset Challenge 2021 Real-time 3D Detection Track荣誉奖:Real-time 3D Object Detection using Feature Map Flow
REAL-TIME 2D CHALLENGE:
第一名:1st Place Solution for Waymo Open Dataset Challenge 2021 Real-time 2D Detection第二名:2nd Place Solution for Waymo Open Dataset Challenge — Real-time 2D Object Detection第三名:3rd place waymo real-time 2D object detection: yolov5 self-ensemble.荣誉奖:Object Detection with Camera-wise Training荣誉奖:Waymo Open Dataset Real-Time 2D Object Detection Challenge