In ACS applied materials & interfaces ; h5-index 147.0
Silent communication based on biosignals from facial muscle requires accurate detection of its directional movement and thus optimally positioning minimum numbers of sensors for higher accuracy of speech recognition with a minimal person-to-person variation. So far, previous approaches based on electromyogram or pressure sensors are ineffective in detecting the directional movement of facial muscles. Therefore, in this study, high-performance strain sensors are used for separately detecting x- and y-axis strain. Directional strain distribution data of facial muscle is obtained by applying three-dimensional digital image correlation. Deep learning analysis is utilized for identifying optimal positions of directional strain sensors. The recognition system with four directional strain sensors conformably attached to the face shows silent vowel recognition with 85.24% accuracy and even 76.95% for completely nonobserved subjects. These results show that detection of the directional strain distribution at the optimal facial points will be the key enabling technology for highly accurate silent speech recognition.
Yoo Hyunjun, Kim Eunji, Chung Jong Won, Cho Hyeon, Jeong Sujin, Kim Heeseung, Jang Dongju, Kim Hayun, Yoon Jinsu, Lee Gae Hwang, Kang Hyunbum, Kim Joo-Young, Yun Youngjun, Yoon Sungroh, Hong Yongtaek
2022-Nov-22
deep learning, facial strain distribution, silent speech recognition, soft device, strain sensor, three-dimensional digital image correlation