In the last decade deep learning techniques have strongly influenced many aspects of computational vision. Many difficult vision tasks can now be performed by deploying a properly tailored and trained deep network. The enthusiasm for deep learning is unfortunately paired by the present lack of a clear understanding of how they work and why they provide such brilliant performance. The same applies to biometric systems. Deep learning has been successfully applied to several biometric recognition tasks, including face recognition. VGG-face is possibly the first deep convolutional network designed to perform face recognition, obtaining unsurpassed performance at the time it was firstly proposed. Over the last years, several and more complex deep convolutional networks, trained on very large, mainly private, datasets, have been proposed still elevating the performance bar also on quite challenging public databases, such as the Janus IJB-A and IJB-B. Despite of the progress in the development of such networks, and the advance in the learning algorithms, the insight on these networks is still very limited. For this reason, in this paper we analyse a biologically-inspired network based on the HMAX model, not with the aim of pushing the recognition performance further, but to better understand the representation space produced by including the retino-cortical mapping performed by the log-polar image resampling.
Foveated Vision for Deepface Recognition / Khellat-Kihel, S.; Lagorio, A.; Tistarelli, M.. - 11896:(2019), pp. 31-41. (Intervento presentato al convegno 24th Iberoamerican Congress on Pattern Recognition, CIARP 2019 tenutosi a cub nel 2019) [10.1007/978-3-030-33904-3_3].
Foveated Vision for Deepface Recognition
Khellat-Kihel S.
;Lagorio A.;Tistarelli M.
2019-01-01
Abstract
In the last decade deep learning techniques have strongly influenced many aspects of computational vision. Many difficult vision tasks can now be performed by deploying a properly tailored and trained deep network. The enthusiasm for deep learning is unfortunately paired by the present lack of a clear understanding of how they work and why they provide such brilliant performance. The same applies to biometric systems. Deep learning has been successfully applied to several biometric recognition tasks, including face recognition. VGG-face is possibly the first deep convolutional network designed to perform face recognition, obtaining unsurpassed performance at the time it was firstly proposed. Over the last years, several and more complex deep convolutional networks, trained on very large, mainly private, datasets, have been proposed still elevating the performance bar also on quite challenging public databases, such as the Janus IJB-A and IJB-B. Despite of the progress in the development of such networks, and the advance in the learning algorithms, the insight on these networks is still very limited. For this reason, in this paper we analyse a biologically-inspired network based on the HMAX model, not with the aim of pushing the recognition performance further, but to better understand the representation space produced by including the retino-cortical mapping performed by the log-polar image resampling.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.