Rotation invariant local binary convolution neural networks

Zhang, Xin; Liu, Li; Xie, Yuxiang; Chen, Jie; Wu, Lingda; Pietikäinen, Matti

Rotation invariant local binary convolution neural networks

Zhang, Xin; Liu, Li; Xie, Yuxiang; Chen, Jie; Wu, Lingda; Pietikäinen, Matti (2018-01-23)

Avaa tiedosto

nbnfi-fe202003057306.pdf (461.4Kt)

nbnfi-fe202003057306_meta.xml (40.83Kt)

nbnfi-fe202003057306_solr.xml (32.49Kt)

Lataukset:

URL:

https://doi.org/10.1109/ICCVW.2017.146

Zhang, Xin

Liu, Li

Xie, Yuxiang

Chen, Jie

Wu, Lingda

Pietikäinen, Matti

Institute of Electrical and Electronics Engineers

23.01.2018

X. Zhang, L. Liu, Y. Xie, J. Chen, L. Wu and M. Pietikäinen, "Rotation Invariant Local Binary Convolution Neural Networks," 2017 IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, 2017, pp. 1210-1219. doi: 10.1109/ICCVW.2017.146

https://rightsstatements.org/vocab/InC/1.0/
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/ICCVW.2017.146

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe202003057306

Tiivistelmä

Abstract

Although Convolution Neural Networks(CNNs) are unprecedentedly powerful to learn effective representations, they are still parameter expensive and limited by the lack of ability to handle with the orientation transformation of the input data. To alleviate this problem, we propose a deep architecture named Rotation Invariant Local Binary Convolution Neural Network(RI-LBCNN). RI-LBCNN is a deep convolution neural network consisting of Local Binary orientation Module(LBoM). A LBoM is composed of two parts, i.e., three layers steerable module (two layers for the first and one for the second part), which is a combination of Local Binary Convolution (LBC)[19] and Active Rotating Filters (ARFs)[38]. Through replacing the basic convolution layer in DCNN with LBoMs, RI-LBCNN can be easily implemented and LBoM can be naturally inserted to other popular models without any extra modification to the optimisation process. Meanwhile, the proposed RI-LBCNN thus can be easily trained end to end. Extensive experiments show that the updating with the proposed LBoMs leads to significant reduction of learnable parameters and the reasonable performance improvement on three benchmarks.

Kokoelmat

Avoin saatavuus [31941]