Deep-HR : fast heart rate estimation from face video under realistic conditions

Sabokrou, Mohammad; Pourreza, Masoud; Li, Xiaobai; Fathy, Mahmood; Zhao, Guoying

Deep-HR : fast heart rate estimation from face video under realistic conditions

Sabokrou, Mohammad; Pourreza, Masoud; Li, Xiaobai; Fathy, Mahmood; Zhao, Guoying (2021-07-31)

Avaa tiedosto

nbnfi-fe2022021018477.pdf (5.685Mt)

nbnfi-fe2022021018477_meta.xml (38.47Kt)

nbnfi-fe2022021018477_solr.xml (32.02Kt)

Lataukset:

URL:

https://doi.org/10.1016/j.eswa.2021.115596

Sabokrou, Mohammad

Pourreza, Masoud

Li, Xiaobai

Fathy, Mahmood

Zhao, Guoying

Elsevier

31.07.2021

Sabokrou, M., Pourreza, M., Li, X., Fathy, M., & Zhao, G. (2021). Deep-HR: Fast heart rate estimation from face video under realistic conditions. Expert Systems with Applications, 186, 115596. https://doi.org/10.1016/j.eswa.2021.115596

https://creativecommons.org/licenses/by-nc-nd/4.0/
© 2021 Published by Elsevier Ltd. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/.
https://creativecommons.org/licenses/by-nc-nd/4.0/

doi:https://doi.org/10.1016/j.eswa.2021.115596

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022021018477

Tiivistelmä

Abstract

This paper presents a novel method for remote heart rate (HR) estimation. Recent studies have proved that blood pumping by the heart is highly correlated to the intense color of face pixels, and surprisingly can be utilized for remote HR estimation. Researchers successfully proposed several methods for this task, but making it work in realistic situations is still a challenging problem in computer vision community. Furthermore, learning to solve such a complex task on a dataset with very limited annotated samples is not reasonable. Consequently, researchers do not prefer to use the deep learning approaches for this problem. In this paper, we propose a simple yet efficient approach to benefit the advantages of the Deep Neural Network (DNN) by simplifying HR estimation from a complex task to learning from very correlated representation to HR. Inspired by previous work, we learn a component called Front-End (FE) to provide a discriminative representation of face videos, afterward a light deep regression auto-encoder as Back-End (BE) is learned to map the FE representation to HR. Regression task on the informative representation is simple and could be learned efficiently on limited training samples. Beside of this, to be more accurate and work well on low-quality videos, two deep encoder–decoder networks are trained to refine the output of FE. We also introduce a challenging dataset (HR-D) to show that our method can efficiently work in realistic conditions. Experimental results on HR-D and MAHNOB datasets confirm that our method could run as a real-time method and estimate the average HR better than state-of-the-art ones.

Kokoelmat

Avoin saatavuus [32010]

Ellei muuten mainita, aineiston lisenssi on https://creativecommons.org/licenses/by-nc-nd/4.0/