Multi-View Latent Variable Discriminative Models for Action Recognition

Presented at CVPR 2012

Yale Song1, Louis-Philippe Morency2, Randall Davis1
1MIT Computer Science and Artificial Intelligence Laboratory
2USC Institute for Creative Technology


Figure 1. Graphical representations of multi-view latent variable discriminative models.


Abstract

Many human action recognition tasks involve data that can be factorized into multiple views such as body postures and hand shapes. These views often interact with each other over time, providing important cues to understanding the action. We present multi-view latent variable discriminative models that jointly learn both view-shared and view-specific sub-structures to capture the interaction between views. Knowledge about the underlying structure of the data is formulated as a multi-chain structured latent conditional model, explicitly learning the interaction between multiple views using disjoint sets of hidden variables in a discriminative manner. The chains are tied using a predetermined topology that repeats over time. We present three topologies --linked, coupled, and linked-coupled-- that differ in the type of interaction between views that they model. We evaluate our approach on both segmented and unsegmented human action recognition tasks, using the ArmGesture, the NATOPS, and the ArmGesture-Continuous data. Experimental results show that our approach outperforms previous state-of-the-art action recognition models.


Publications


Codes


Datasets


References

[1] Ariadna Quattoni, Sy Bor Wang, Louis-Philippe Morency, Michael Collins, Trevor Darrell: Hidden Conditional Random Fields. TPAMI 2007
[2] Louis-Philippe Morency, Ariadna Quattoni, Trevor Darrell: Latent-Dynamic Discriminative Models for Continuous Gesture Recognition. CVPR 2007
[3] Yale Song, Louis-Philippe Morency, Randall Davis: Action Recognition by Hierarchical Sequence Summarization. CVPR 2013


Acknowledgements

This work was funded by the Office of Naval Research Science of Autonomy program #N000140910625, the National Science Foundation #IIS-1018055, and the U.S. Army Research, Development, and Engineering Command (RDECOM).

Last update: May 30, 2012