3D Lip Event Detection via Interframe Motion Divergence at Multiple Temporal Resolutions

Jie Zhang, Robert B Fisher

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The lip is a dominant dynamic facial unit when a person is speaking. Detecting lip events is beneficial to speech analysis and support for the hearing impaired. This paper proposes a 3D lip event detection pipeline that automatically determines the lip events from a 3D speaking lip sequence. We define a motion divergence measure using 3D lip landmarks to quantify the interframe dynamics of a 3D speaking lip. Then, we cast the interframe motion detection in a multi-temporal-resolution framework that allows the detection to be applicable to different speaking speeds. The experiments on the S3DFM Dataset investigate the overall 3D lip dynamics based on the proposed motion divergence. The proposed 3D pipeline is able to detect opening and closing lip events across 100 sequences, achieving a state-of-the-art performance.
Original languageEnglish
Title of host publicationProceedings of the 9th International Conference on 3D Vision (3DV 2021)
Pages423-431
Number of pages9
ISBN (Electronic)978-1-6654-2688-6
DOIs
Publication statusPublished - 6 Jan 2022
Event9th International Conference on 3D Vision - Virtual, London, United Kingdom
Duration: 1 Dec 20213 Dec 2021
https://3dv2021.surrey.ac.uk/

Publication series

Name2021 International Conference on 3D Vision (3DV)
PublisherIEEE
ISSN (Print)2378-3826
ISSN (Electronic)2475-7888

Conference

Conference9th International Conference on 3D Vision
Abbreviated title3DV 2021
Country/TerritoryUnited Kingdom
CityLondon
Period1/12/213/12/21
Internet address

Fingerprint

Dive into the research topics of '3D Lip Event Detection via Interframe Motion Divergence at Multiple Temporal Resolutions'. Together they form a unique fingerprint.

Cite this