Contrastive context-aware learning for 3D high-fidelity mask face presentation attack detection

Liu, Ajian; Zhao, Chenxu; Yu, Zitong; Wan, Jun; Su, Anyang; Liu, Xing; Tan, Zichang; Escalera, Sergio; Xing, Junliang; Liang, Yanyan; Guo, Guodong; Lei, Zhen; Li, Stan Z.; Zhang, Du

Contrastive context-aware learning for 3D high-fidelity mask face presentation attack detection

Liu, Ajian; Zhao, Chenxu; Yu, Zitong; Wan, Jun; Su, Anyang; Liu, Xing; Tan, Zichang; Escalera, Sergio; Xing, Junliang; Liang, Yanyan; Guo, Guodong; Lei, Zhen; Li, Stan Z.; Zhang, Du (2022-07-04)

Avaa tiedosto

nbnfi-fe2023040635314.pdf (14.42Mt)

nbnfi-fe2023040635314_meta.xml (61.96Kt)

nbnfi-fe2023040635314_solr.xml (49.34Kt)

Lataukset:

URL:

https://doi.org/10.1109/tifs.2022.3188149

Liu, Ajian

Zhao, Chenxu

Yu, Zitong

Wan, Jun

Su, Anyang

Liu, Xing

Tan, Zichang

Escalera, Sergio

Xing, Junliang

Liang, Yanyan

Guo, Guodong

Lei, Zhen

Li, Stan Z.

Zhang, Du

Institute of Electrical and Electronics Engineers

04.07.2022

A. Liu et al., "Contrastive Context-Aware Learning for 3D High-Fidelity Mask Face Presentation Attack Detection," in IEEE Transactions on Information Forensics and Security, vol. 17, pp. 2497-2507, 2022, doi: 10.1109/TIFS.2022.3188149

https://rightsstatements.org/vocab/InC/1.0/
© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/tifs.2022.3188149

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2023040635314

Tiivistelmä

Abstract

Face presentation attack detection (PAD) is essential to secure face recognition systems primarily from high-fidelity mask attacks. Most existing 3D mask PAD benchmarks suffer from several drawbacks: 1) a limited number of mask identities, types of sensors, and a total number of videos; 2) low-fidelity quality of facial masks. Basic deep models and remote photoplethysmography (rPPG) methods achieved acceptable performance on these benchmarks but still far from the needs of practical scenarios. To bridge the gap to real-world applications, we introduce a large-scale High- Fidelity Mask dataset, namely HiFiMask. Specifically, a total amount of 54,600 videos are recorded from 75 subjects with 225 realistic masks by 7 new kinds of sensors. Along with the dataset, we propose a novel C ontrastive C ontext-aware L earning (CCL) framework. CCL is a new training methodology for supervised PAD tasks, which is able to learn by leveraging rich contexts accurately (e.g., subjects, mask material and lighting) among pairs of live faces and high-fidelity mask attacks. Extensive experimental evaluations on HiFiMask and three additional 3D mask datasets demonstrate the effectiveness of our method. The codes and dataset will be released soon.

Kokoelmat

Avoin saatavuus [32044]