phone: (617) 953 1244
e-mail:
French citizen,
Green card holder
CURRICULUM VITAE
RESEARCH EXPERIENCE
2009 – present Vecna Robotics,
Senior Research Scientist, R&D computer vision & autonomous robotics
¨
Research supervision, project
management, grant proposal writing
¨
Projects:
·
Manipulation (object
detection, 3D pose estimation)
·
Human-machine
interaction (human pose estimation, gesture recognition)
·
Navigation (SLAM,
visual odometry, scene recognition)
·
Sensor fusion
(multimodal fusion, auto-calibration)
Additional
affiliations:
-
MIT Media Lab (Prof.
Cynthia Breazeal), project advisor in machine vision
-
MIT CSAIL (Prof. Randy
Davis), research affiliate
2008 – 2009 Toyota Research Institute,
Principal Research Scientist, R&D automotive safety applications
¨
Research in computer vision and machine learning
¨
Project management (2 engineers), technical
advisor
2002 – 2008 MIT Computer Science and Artificial
Intelligence Laboratory,
Research scientist in computer vision, human-machine interaction (with
Prof.
¨
Research in real-time human pose estimation and gesture recognition:
- Computer vision: markerless motion capture, tracking, object recognition and scene understanding, affective computing, multiview geometry, stereo
-
Machine
learning: temporal event classification, gesture recognition
-
Multimodal interfaces (gesture & speech),
ubiquitous computing
¨
Major supervis
·
Interactive
Wall: Real-time 3D gesture-speech interface system
·
CALO
(Darpa): gaze and gesture analysis for meeting understanding
·
Spaulding: gait analysis for
biomedical applications
·
Ford:
facial expression recognition for communication-error detection
·
EStereo:
real-time stereo C++/SSE library (open source, SourceForge)
Demonstrations
can be view
2001 – 2009 Consulting
services in real-time computer vision, human-machine interaction: Newt
Global (
2000 – 2002 MIT Artificial Intelligence
Laboratory,
Postdoctoral
researcher in Vision Interface Group (Prof.
Research and software development on smart environments and perceptual user interfaces
EDUCATION
1997 – 2000 Ph.D.
in Computer Science at INRIA (Institut National de Recherche en Informatique et
Automatisme),
1994 – 1997 Engineering degree of ENSTA (Ecole Nationale Supérieure de
Techniques Avancées),
REFERENCE
Prof. Trevor J. Darrell (trevor@eecs.berkeley.edu) Associate
Professor at UC Berkeley
Dr. Liu Qiao (liu.qiao@tema.toyota.com) General Manager, Technical Research Department, Toyota Research Institute.
Prof. Eric L. Grimson (welg@csail.mit.edu) Head of the Department of Electrical Engineering and Computer Science, MIT
SKILLS
Languages: C/C++, Java, Matlab, MMX/SSE assembly
Programming: GPU (CUDA), Multithreading, ROS, OpenCV, OpenGL, Qt
Motion Capture Systems: VICON
Operating Systems: Linux, Windows, ROS
RESEARCH INTERESTS
Robotics: grasping and manipulation, situational awareness, SLAM, human-machine interfaces
Computer vision: 2D/3D object recognition and pose estimation, gesture recognition, affective computing, motion estimation, multi-view geometry, stereo and autocalibration.
SERVICE
Member of IEEE Boston section
Invit
Program committee for the IEEE
International Conference on Multim
Program committee for the ACM Multim
Program committee for the AAAI Symposium on Human Behavior Modeling 2009
Reviewer for CVPR, ICCV, ECCV since 2005
Reviewer for the ACM International Conference on Multimodal Interaction since 2005
Reviewer for 3DTV-CON 2009, Symposium on User Interface Software and Technology 2006, International Symposium on Wearable Computers 2007
Session chair at ICVS 2009, ICMI 2009
Frequent reviewer for major international journals:
IEEE Transactions on Pattern Analysis and Machine Intelligence, International Journal of Computer Vision, Computer Vision and Image Understanding, Journal of Computer Science and Technology, IEEE Transactions on Robotics, International Journal of Social Robotics, Intelligent Transport Systems
ADVISING
Current Master’s theses supervised
Yale Song, “A Framework for Recognizing Director Gestures on a Flight
Carrier”, with Prof. R. Davis, 2010 (MIT, CSAIL)
PhD student supervised (MIT, CSAIL)
Sy Bor Wang, “Detecting Communication Errors using Facial Expressions”,
with Prof. T. Darrell, 2008
Master’s theses supervised (MIT, CSAIL)
Jasper Vicenti, “Aural Imaging from 3D Vision”, 2006 (now with iRobot)
Petch Manoharn, “Reasoning Agents for an Activity Recognition
System”, 2005
Teresa Ko, “Untether
PUBLICATIONS
International
conferences
· D. Demirdjian and C. Varri.
Recognizing Events
with Temporal Random Forests. Proce
· D. Demirdjian and C. Varri.
Driver Pose Estimation with a 3D
Time-of-Flight Sensor. IEEE Workshop
on Computational Intelligence in Vehicles and Vehicular Systems,
· D. Demirdjian and C. Varri.
Recognizing Gestures for Virtual and
Real World Interaction. Proce
· D. Demirdjian and S. Wang.
Recognition of Temporal Events using
Multiscale Bags of Features. IEEE Workshop
on Computational Intelligence for Visual Intelligence, 2009
· D. Demirdjian and R.
Urtasun. Patch-bas
· S. Wang, D. Demirdjian and T.
Darrell. Detecting communication
errors from visual cues during the system's conversational turn.
Proce
· S. Wang, D. Demirdjian, H. Kjellstrom and T. Darrell. Multimodal Communication Error Detection for Driver-Car Interaction. In the 4th International Conference on Informatics in Control, Automation and Robotics, 2007.
· S. Wang, A. Quattoni, L. Morency, D. Demirdjian, T. Darrell, Hidden
Conditional Random Fields for Gesture Recognition, Proce
· L. Taycher, G. Shakhnarovich, D. Demirdjian, T. Darrell, Conditional
Random People: Tracking Humans with CRFs and Grid Filters, Proc. IEEE
Conference on Computer Vision and Pattern Recognition, 2006.
· D. Demirdjian, L. Taycher, G. Shakhnarovich, K. Grauman, T. Darrell,
Avoiding the Streetlight Effect: Tracking by Exploring Likelihood Modes,
Proceedings of the International Conference on Computer Vision, 2005.
· S. Wang, David Demirdjian, Inferring Body Pose using Speech
Content, Proceedings of the International Conference on Multimodal
Interfaces, 2005.
· D. Demirdjian, Combining Geometric and View-Based Approaches for
Articulated Pose Estimation, Proceedings of the European Conference on
Computer Vision, 2004.
· K. Tollmar, D. Demirdjian and T. Darrell. Navigating in Virtual Environments using a
Vision-bas
· D. Demirdjian. Enforcing
Constraints for Human Body Tracking. In Workshop on Multi-Object Tracking, June 2003.
· K. Tollmar, D. Demirdjian and T. Darrell. Gesture + Play: Full Body Interaction for Virtual Environments. . In Proceedings of CHI’03,
· D. Demirdjian and T. Darrell. 3-D Articulat
· D. Demirdjian, K. Tollmar, K. Koile, N. Checka and T.
Darrell. Activity maps for
location-aware computing. In
Proc. IEEE Workshop on Applications of Computer Vision (WACV2002),
· D. Demirdjian and T. Darrell. Motion Estimation from
Disparity Images. In Proce
· T. Darrell, D. Demirdjian, N. Checka and P. Felzenszwalb. Plan-view trajectory estimation with dense stereo background models.
In Proce
· D. Demirdjian, A. Zisserman and R. Horaud. Stereo
Autocalibration from One Plane. In Proce
· D. Demirdjian and R. Horaud. A Projective Framework for
Scene Segmentation in the Presence of Moving Objects. In Proce
· D. Demirdjian, G. Csurka, and R. Horaud. Autocalibration
in the presence of critical motions. Proce
· G. Csurka, D. Demirdjian, A. Ruf, and R. Horaud. Clos
Refereed journals articles
· D. Demirdjian, T. Ko, Trevor Darrel. Untether
· D. Demirdjian and T. Darrell. Using Multiple-Hypothesis
Disparity Maps and Image Velocity for 3D Motion Estimation. International
Journal on Computer Vision, 2001.
· D. Demirdjian and R. Horaud. Motion-Egomotion Discrimination and Motion Segmentation from Image-pair Streams. In Computer Vision and Image Understanding, 78(1), p. 53-68, April 2000.
· R. Horaud, G. Csurka, and D. Demirdjian. Stereo Calibration Using Rigid Motions, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12), December 2000, p. 1446-1452.
· G. Csurka, D. Demirdjian, and R. Horaud. Finding the
Collineation between two Projective Reconstructions. Computer Vision
and Image Understanding, 75(3), p. 260-269, September 1999.
PATENTS
·
Real-time stereo-based body tracking system, under MIT license.
·
5
Patents under submission to USPTO, in machine learning, computer vision
GRANTS
à
Dexterous object manipulation, with Vecna Technologies.
STTR Darpa 2010, $100k.
à
Framework for pallet loading with a robotic arm, with
Vecna Technologies. STTR Army 2010, $100k.
à
Anomalous activity recognition, with Vecna Technologies.
STTR Air Force 2010, $90k.
à
Development of a markerless System for
clinical analysis of gait abnormalities, with P. Bonato, CIMIT grant, 2006-2007, $50k
à Expressive interaction for information and driving assistance, with T. Darrell, J. Glass. Ford, 2006-2007, $120k
à 3-D tracking of human motion with stereo
cameras, with T. Darrell,
à Modeling and tracking
people using digital cameras for virtual
studio application, with C. Schmid, MIT-France Grant, 2003-2004.
à Markerless gesture recognition system for unmanned air vehicle
traffic control, with Energid. STTR,
2001.
INVITED TALKS
Imaging Technology for Driver
Awareness (Executive Toyota workshop,
Pose Estimation using Local
Features (International Workshop on Object Recognition,
Visual Perception (Toyota Motor Company,
Autocalibration of Cameras in
Meeting Environments (INRIA, France, 2006)
Stereo-bas
Physical Awareness for Multimodal
Human-Computer Interaction:
·
·
NTT,
·
ATR,
Constraining Articulat
MISCELLENOUS
Open source software
Creator of EStereo, open-source C++/SSE library
for real-time stereo map estimation.
(repository on sourceforge.net, ~300
download/month)
Community Service