Publication | Closed Access
Research on Visual Speech Feature Extraction
23
Citations
7
References
2009
Year
Dct+pca AlgorithmEngineeringBiometricsFeature ExtractionSpeech RecognitionImage AnalysisPattern RecognitionRobust Speech RecognitionLip Contour InformationVoice RecognitionHealth SciencesComputer VisionSpeech AnalysisDct CoefficientsSpeech CommunicationSpeech ProcessingSpeech InputSpeech PerceptionLinguisticsSpeaker Recognition
To solve the problem of extracting visual feature in lipreading, a new method based on DCT+LDA is proposed in this paper. First, region of interest (ROI) is located based on the lip contour information, and then discrete cosine transformation (DCT) is performed on ROI. In order to extract the most discriminative feature vectors from the DCT coefficients and further reduce the feature dimensionality, linear discriminative analysis (LDA) is then introduced. Experiments were performed on speaker-dependent (SD) and speaker-independent (SI) bimodal database respectively, the experimental results showed that this algorithm achieved high recognition accuracy than traditional Zig-Zag DCT coefficients selection method and DCT+PCA algorithm. finally, this algorithm is also justified on our real-time lipreading platform.
| Year | Citations | |
|---|---|---|
Page 1
Page 1