Rohit Girdhar
Rohit Girdhar
Home
Projects
Light
Dark
Automatic
paper-conference
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
A dataset to evaluate temporal reasoning in video models.
Rohit Girdhar
,
Deva Ramanan
PDF
Cite
Slides
Video
Code
MetaPix: Few-Shot Video Retargeting
A dataset to evaluate temporal reasoning in video models.
Jessica Lee
,
Deva Ramanan
,
Rohit Girdhar
PDF
Cite
Slides
Video
Code
DistInit: Learning Video Representations Without a Single Labeled Video
Distilling representations from image models to video models.
Rohit Girdhar
,
Du Tran
,
Lorenzo Torresani
,
Deva Ramanan
PDF
Cite
Video Action Transformer Network
Among the first applications of Transformers to model videos. SOTA results: close 2nd at AVA Challenge, CVPR'18.
Rohit Girdhar
,
João Carreira
,
Carl Doersch
,
Andrew Zisserman
PDF
Cite
Video
Detect-and-Track: Efficient Pose Estimation in Videos
Human keypoint tracking approach that ranked first in ICCV 2017 PoseTrack keypoint tracking challenge!
Rohit Girdhar
,
Georgia Gkioxari
,
Lorenzo Torresani
,
Manohar Paluri
,
Du Tran
PDF
Cite
Code
Attentional Pooling for Action Recognition
Among the first applications of attention for contemporary video/action understanding.
Rohit Girdhar
,
Deva Ramanan
PDF
Cite
Code
ActionVLAD: Learning spatio-temporal aggregation for action classification
Aggregating visual features for action recognition.
Rohit Girdhar
,
Deva Ramanan
,
Abhinav Gupta
,
Josef Sivic
,
Bryan Russell
PDF
Cite
Video
Code
Binge Watching: Scaling Affordance Learning from Sitcoms
Learning how humans interact with their environment by watching TV.
Xiaolong Wang
,
Rohit Girdhar
,
Abhinav Gupta
PDF
Cite
Learning a Predictable and Generative Vector Representation for Objects
A single embedding space, good for both generating and understanding 3D models
Rohit Girdhar
,
David F. Fouhey
,
Mikel Rodriguez
,
Abhinav Gupta
PDF
Cite
Video
Code
«
Cite
×