Abstract
The lip is a dominant dynamic facial unit when a person is speaking. Detecting lip events is beneficial to speech analysis and support for the hearing impaired. This paper proposes a 3D lip event detection pipeline that automatically determines the lip events from a 3D speaking lip sequence. We define a motion divergence measure using 3D lip landmarks to quantify the interframe dynamics of a 3D speaking lip. Then, we cast the interframe motion detection in a multi-temporal-resolution framework that allows the detection to be applicable to different speaking speeds. The experiments on the S3DFM Dataset investigate the overall 3D lip dynamics based on the proposed motion divergence. The proposed 3D pipeline is able to detect opening and closing lip events across 100 sequences, achieving a state-of-the-art performance.
Original language | English |
---|---|
Title of host publication | Proceedings of the 9th International Conference on 3D Vision (3DV 2021) |
Pages | 423-431 |
Number of pages | 9 |
ISBN (Electronic) | 978-1-6654-2688-6 |
DOIs | |
Publication status | Published - 6 Jan 2022 |
Event | 9th International Conference on 3D Vision - Virtual, London, United Kingdom Duration: 1 Dec 2021 → 3 Dec 2021 https://3dv2021.surrey.ac.uk/ |
Publication series
Name | 2021 International Conference on 3D Vision (3DV) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2378-3826 |
ISSN (Electronic) | 2475-7888 |
Conference
Conference | 9th International Conference on 3D Vision |
---|---|
Abbreviated title | 3DV 2021 |
Country/Territory | United Kingdom |
City | London |
Period | 1/12/21 → 3/12/21 |
Internet address |