This work proposes a novel convolutional neural network approach to address the fine-grained recognition problem of multi-view dynamic facial action unit detection by formulating the task of predicting the presence or absence of a specific action unit in a still image of a human face as holistic classification.