The eNTERFACE'05 EMOTION Database
The eNTERFACE'05 EMOTION Database is an audio-visual emotion database that can be
used as a reference database for testing and evaluating video, audio or joint
audio-visual emotion recognition algorithms. Additional uses may include the
evaluation of algorithms performing other multimodal signal processing tasks, such
as multimodal person identification or audio-visual speech recognition. It was
designed during the eNTERFACE'05 workshop, in the framework of a project on
multimodal caricature [1,2].
CONTENTS
The final version of the database thus contains 42 subjects, coming from 14
different nationalities. Among the 42 subjects, a percentage of 81% were men, while
the remaining 19% were women. A percentage of 31% of the total set wore glasses,
while 17% of the subjects had a beard.
The recordings lasted for two weeks. All the experiments were driven in English.
Each subject was told to listen to six successive short stories, each of them
eliciting a particular emotion. They had then to react to each of the situations and
two human experts judged whether the reaction expressed the emotion in an
unambiguous way. If this was the case, the sample was added to the database. If not,
it was discarded.
TECHNICAL ASPECTS
The database was recorded using a standard mini-DV digital video camera. The
resolution of the camera was 800.000 pixels. The recording of the speech signal was
realized through the use of a high-quality microphone, specially conceived for
speech recordings. The microphone was situated roughly 30cm below the subject's
mouth, outside of the camera field.
The background consists of a monochromatic dark gray panel that covered the entire
area behind the subject, to allow easier face detection and tracking. Illumination
was made constant through the use of a set of occultation panels, placed in front of
every window. Lighting material consisted of a strong spotlight (500 watts),
situated right behind the camera, facing the user. The spotlight was covered with a
semi-transparent plastic film to soften the light, decrease the shadows and protect
the subject from the very intense source of light. Two additional directional spots
were situated between the subject and the background panel, so as to cancel shadows
produced on the background panel by the main spotlight. The two additional spots
were also covered with a semi-transparent plastic film.
The doors remained closed at all time to prevent external sound to interfere with
the experiments.
LICENSE
This database is available under MIT license conditions (the terms of this very open
license are provided with the database).
DOWNLOAD
Please note that the database is very big (0.8 GB).
Click HERE to proceed.
[1] : Martin, O., et al., "Multimodal Caricatural Mirror", in in Proc. eNTERFACE
2005, July 18th-August 12th, Mons, Belgium - available on the eNTERFACE '05 website
: http://www.tcts.fpms.ac.be/enterface//enterface05/docs/results/reports/project2.pdf.
[2] : O. Martin, I. Kotsia, B. Macq and I. Pitas : "The eNTERFACE'05 Audio-Visual
Emotion Database", Proceedings of the First IEEE Workshop on Multimedia Database
Management, Atlanta, April 2006.
|