Achieving Real-Time Lip Synch via SVM-Based Phoneme Classification and Lip Shape Refinement

Taeyoon Kim; Yongsung Kang; Hanseok Ko

doi:10.1109/ICMI.2002.1167010

Proceedings Fourth IEEE International Conference on Multimodal Interfaces

Achieving Real-Time Lip Synch via SVM-Based Phoneme Classification and Lip Shape Refinement

Year: 2002, Pages: 299

DOI Bookmark: 10.1109/ICMI.2002.1167010

Authors

Taeyoon Kim, Korea University
Yongsung Kang, Korea University
Hanseok Ko, Korea University

Abstract

In this paper, we develop a real time lip-synch system that activates 2-D avatar?s lip motion in synch with incoming speech utterance. To realize the "real time" operation of the system, we contain the processing time by invoking merge and split procedure performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply the support vector machine (SVM) to constrain the computational load while attaining the desirable accuracy. The coarse-to-fine phoneme classification is accomplished via 2 stages of feature extraction, where each speech frame is acoustically analyzed first for 3 classes of lip opening using MFCC as feature and then a further refined classification for detailed lip shape using formant information. We implemented the system with a 2-D lip animation that shows the effectiveness of the proposed 2-stage procedure accomplishing the real-time lip-synch task.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Shift-tolerant K-subspaces for phoneme recognition
Acoustics, Speech, and Signal Processing, IEEE International Conference on
Lip analysis in traditional Chinese medicine
2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Classification of lip color based on multiple SVM-RFE
2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)
Gestures and Lip Shape Integration for Cued Speech Recognition
Pattern Recognition, International Conference on
Human Perception of Lip Synchronization in Mobile Environment
2007 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks
Modular-Based Classifier for Phoneme Recognition
2006 IEEE International Symposium on Signal Processing and Information Technology
View Independent Computer Lip-Reading
2012 IEEE International Conference on Multimedia and Expo
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach
2022 18th International Conference on Mobility, Sensing and Networking (MSN)
Perceptual Synchronization Scoring of Dubbed Content using Phoneme-Viseme Agreement
2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)
Deformation Flow Based Two-Stream Network for Lip Reading
2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020) (FG)

Achieving Real-Time Lip Synch via SVM-Based Phoneme Classification and Lip Shape Refinement

Authors

Abstract

Related Articles