I am a fifth year PhD student advised by Devi Parikh at Georgia Tech. I also work closely with Dhruv Batra. Prior to starting my PhD, I was a Research Assistant at the Center for Visual Information Technology (CVIT), IIIT-Hyderabad, India working under the joint supervision of Prof. C.V. Jawahar and Gaurav Sharma. Before moving to Hyderabad, I was a Software Engineer at Media.net in Mumbai, India. I completed my dual degree (B. Tech + M. Tech) in Computer Science and Engineering from IIT Roorkee in 2015.
My research interests are broadly in the intersection of vision, language and actions. I am interested in training AI agents embodied in realsitic simulated environments to perform tasks requiring spatial, semantic and temporal understanding capabilities.
Updates
Publications
Episodic Memory Question Answering
Samyak Datta,
Sameer Dharur,
Vincent Cartillier,
Ruta Desai,
Mukul Khanna,
Dhruv Batra and
Devi Parikh
Computer Vision and Pattern Recognition (CVPR), 2022 (Oral)
Coming soon!
Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Samyak Datta,
Oleksandr Maksymets,
Judy Hoffman,
Stefan Lee,
Dhruv Batra and
Devi Parikh
Conference on Robot Learning (CoRL), 2020
arxiv
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Samyak Datta,
Karan Sikka,
Anirban Roy,
Karuna Ahuja,
Devi Parikh and
Ajay Divakaran
International Conference on Computer Vision (ICCV), 2019
arxiv
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Erik Wijmans* ,
Samyak Datta*,
Oleksandr Maksymets* ,
Abhishek Das ,
Georgia Gkioxari,
Stefan Lee,
Irfan Essa,
Devi Parikh and
Dhruv Batra
Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)
arxiv
Embodied Question Answering
Abhishek Das ,
Samyak Datta,
Georgia Gkioxari,
Stefan Lee,
Devi Parikh and
Dhruv Batra
Computer Vision and Pattern Recognition (CVPR), 2018 (Oral)
arxiv / project page
Unsupervised Learning of Face Representations
Samyak Datta,
Gaurav Sharma and
C.V. Jawahar
Automatic Face and Gesture Recognition (FG), 2018 (Oral, Best Paper Award)
arxiv
Learning OpenCV 3 Application Development
Samyak Datta
Packt Publishers, ISBN 13 : 9781784391454
Build, create and deploy your own computer vision applications with the power of OpenCV.
Thanks to visualdialog.org for the webpage format.