Rohit Girdhar
Rohit Girdhar
Home
Projects
Light
Dark
Automatic
paper-conference
Omnivore: A Single Model for Many Visual Modalities
A single model for images, video and single-view 3D.
Rohit Girdhar
,
Mannat Singh
,
Nikhila Ravi
,
Laurens van der Maaten
,
Armand Joulin
,
Ishan Misra
PDF
Cite
Code
Ego4D: Around the World in 3,000 Hours of Egocentric Video
The largest egocentric video dataset.
Kristen Grauman
,
Andrew Westbury
,
Rohit Girdhar
,
et al
PDF
Cite
Video
Code
Detecting Twenty-thousand Classes using Image-level Supervision
Leverages image classification data to build an object detector
Xingyi Zhou
,
Rohit Girdhar
,
Armand Joulin
,
Philipp Krähenbühl
,
Ishan Misra
PDF
Cite
Colab
Code
Masked-attention Mask Transformer for Universal Image Segmentation
Single architecture state-of-the-art in instance, semantic and panoptic segmentation.
Bowen Cheng
,
Ishan Misra
,
Alexander G. Schwing
,
Alexander Kirillov
,
Rohit Girdhar
PDF
Cite
Code
3DETR: An End-to-End Transformer Model for 3D Object Detection
First Transformer based detection architecture for 3D data.
Ishan Misra
,
Rohit Girdhar
,
Armand Joulin
PDF
Cite
Code
Anticipative Video Transformer
An autoregressive video transformer architecture for action anticipation in videos.
Rohit Girdhar
,
Kristen Grauman
PDF
Cite
Code
3D Spatial Recognition without Spatially Labeled 3D
WyPR can detect and segment objects in a 3D scene without needing any spatial labels at all!
Zhongzheng Ren
,
Ishan Misra
,
Alexander G. Schwing
,
Rohit Girdhar
PDF
Cite
Slides
Code
Self-Supervised Pretraining of 3D Features on any Point-Cloud
SOTA 3D detection/segmentation results by learning contrastive representations on 3D data
Zaiwei Zhang
,
Rohit Girdhar
,
Armand Joulin
,
Ishan Misra
PDF
Cite
Code
Physical Reasoning Using Dynamics Aware Embeddings
Self-supervised representations for physical reasoning.
Eltayeb Ahmed
,
Anton Bakhtin
,
Laurens van der Maaten
,
Rohit Girdhar
PDF
Cite
Code
Forward Prediction for Physical Reasoning
Forward prediction for PHYRE benchmark.
Rohit Girdhar
,
Laura Gustafson
,
Aaron Adcock
,
Laurens van der Maaten
PDF
Cite
Code
«
»
Cite
×