Beyond vanilla convolution : random pixel difference convolution for face perception

Liu, Wenzhe; Su, Zhuo; Liu, Li

Beyond vanilla convolution : random pixel difference convolution for face perception

Liu, Wenzhe; Su, Zhuo; Liu, Li (2021-10-04)

Avaa tiedosto

nbnfi-fe2021111154658.pdf (2.855Mt)

nbnfi-fe2021111154658_meta.xml (31.42Kt)

nbnfi-fe2021111154658_solr.xml (31.20Kt)

Lataukset:

URL:

https://doi.org/10.1109/ACCESS.2021.3117955

Liu, Wenzhe

Su, Zhuo

Liu, Li

Institute of Electrical and Electronics Engineers

04.10.2021

W. Liu, Z. Su and L. Liu, "Beyond Vanilla Convolution: Random Pixel Difference Convolution for Face Perception," in IEEE Access, vol. 9, pp. 139248-139259, 2021, doi: 10.1109/ACCESS.2021.3117955

https://creativecommons.org/licenses/by/4.0/
© The Authors 2021. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
https://creativecommons.org/licenses/by/4.0/

doi:https://doi.org/10.1109/ACCESS.2021.3117955

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2021111154658

Tiivistelmä

Abstract

Face perception is an essential and significant problem in pattern recognition, concretely including Face Recognition (FR), Facial Expression Recognition (FER), and Race Categorization (RC). Though handcrafted features perform well on face images, Deep Convolutional Neural Networks (DCNNs) have brought new vitality to this field recently. Vanilla DCNNs are powerful at learning high-level semantic features, but are weak in capturing low-level image characteristic changes in illumination, intensity, and texture regarded as key traits in facial processing and feature extraction, which is alternatively the strength of human-designed feature descriptors. To integrate the best of both worlds, we proposed novel Random Pixel Difference Convolution (RPDC) which is efficient alternatives to vanilla convolutional layers in standard CNNs and can promote to extract discriminative and diverse facial features. By means of searched RPDC of high efficiency, we build S-RaPiDiNet, and achieve promising and extensive experiment results in FR ( ≈0.5 % improvement), FER (over 1% growth), and RC (0.25%–3% increase) than baseline network in vanilla convolution, showing strong generalization of RPDC.

Kokoelmat

Avoin saatavuus [31871]

Ellei muuten mainita, aineiston lisenssi on https://creativecommons.org/licenses/by/4.0/