Unknown

Dataset Information

0

Pushing the limits of remote RF sensing by reading lips under the face mask.


ABSTRACT: The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environment, where face masks have become a norm. This paper aims to solve the fundamental limitations of camera-based systems by proposing a radio frequency (RF) based Lip-reading framework, having an ability to read lips under face masks. The framework employs Wi-Fi and radar technologies as enablers of RF sensing based Lip-reading. A dataset comprising of vowels A, E, I, O, U and empty (static/closed lips) is collected using both technologies, with a face mask. The collected data is used to train machine learning (ML) and deep learning (DL) models. A high classification accuracy of 95% is achieved on the Wi-Fi data utilising neural network (NN) models. Moreover, similar accuracy is achieved by VGG16 deep learning model on the collected radar-based dataset.

SUBMITTER: Hameed H 

PROVIDER: S-EPMC9452506 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pushing the limits of remote RF sensing by reading lips under the face mask.

Hameed Hira H   Usman Muhammad M   Tahir Ahsen A   Hussain Amir A   Abbas Hasan H   Cui Tie Jun TJ   Imran Muhammad Ali MA   Abbasi Qammer H QH  

Nature communications 20220907 1


The problem of Lip-reading has become an important research challenge in recent years. The goal is to recognise speech from lip movements. Most of the Lip-reading technologies developed so far are camera-based, which require video recording of the target. However, these technologies have well-known limitations of occlusion and ambient lighting with serious privacy concerns. Furthermore, vision-based technologies are not useful for multi-modal hearing aids in the coronavirus (COVID-19) environmen  ...[more]

Similar Datasets

| S-EPMC4650618 | biostudies-other
| S-EPMC7928787 | biostudies-literature
| S-EPMC10700517 | biostudies-literature
| S-EPMC10322844 | biostudies-literature
| S-EPMC8294878 | biostudies-literature
| S-EPMC7543707 | biostudies-literature
| S-EPMC10423210 | biostudies-literature
| S-EPMC7361913 | biostudies-literature
| S-EPMC3866607 | biostudies-other
| S-EPMC6385557 | biostudies-literature