![]() |
Ping TanProfessor |
![]() |
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT |
![]() |
SkillMimic: Learning Reusable Basketball Skills from Demonstrations |
![]() |
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner |
![]() |
RaDe-GS: Rasterizing Depth in Gaussian Splatting |
![]() |
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints |
![]() |
SweetDreamer: Aligning Geometric Priors
in 2D Diffusion for Consistent Text-to-3D |
![]() |
High-resolution Volumetric Reconstruction for Clothed Humans |
![]() |
Streaming Radiance Fields for 3D Video Synthesis |
![]() |
QuadTree Attention for Vision Transformers |
![]() |
RAGO: Recurrent Graph Optimizer For Multiple Rotation Averaging |
![]() |
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation |
![]() |
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings |
![]() |
OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Grasping and Manipulation |
![]() |
CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution |
![]() |
FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting |