Одноэтапный детектор лиц и особых точек на цифровых изображениях

Визильтер Юрий Валентинович; Горбацевич Владимир Сергеевич; Моисеенко Анастасия Сергеевна; Vizilter Yuri Valentinovich; Gorbatsevich Vladimir Sergeevich; Moiseenko Anastasiia Sergeevna

doi:10.18287/2412-6179-CO-674

Научные статьи \ Общие вопросы науки и культуры \ Информационные технологии. Вычислительная техника. Обработка данных \ Программные средства

Одноэтапный детектор лиц и особых точек на цифровых изображениях

Автор: Визильтер Юрий Валентинович, Горбацевич Владимир Сергеевич, Моисеенко Анастасия Сергеевна

Журнал: Компьютерная оптика @computer-optics

Рубрика: Обработка изображений, распознавание образов

Статья в выпуске: 4 т.44, 2020 года.

Бесплатный доступ

Поиск особых точек лица является важной подзадачей в решении ряда задач лицевой биометрии. В системах распознавания лиц построение биометрического шаблона происходит по предварительно выровненному (нормализованному) изображению лица, этап нормализации включает в себя задачу поиска основных точек лица. Актуальной в подобной задаче является проблема баланса между качеством работы детектора особых точек и скоростью. В данной статье предложен одноэтапный детектор лиц и особых точек на базе глубоких конволюционных нейронных сетей, работающий в режиме реального времени и достигающий высокого качества на ряде известных тестовых выборок (AFLW2000, COFW, Menpo2D). Предлагаемый детектор лиц и особых точек основан на идее одноэтапного детектора объектов SSD, зарекомендовавшего себя как алгоритм, обеспечивающий высокую скорость работы и высокое качество обнаружения объектов. В качестве базовой архитектуры глубоких конволюционных нейронных сетей используется сеть ShuffleNet V2. Важной особенностью предлагаемого алгоритма является то, что обнаружение лица на изображении и поиск ключевых точек делается за один проход глубоких конволюционных нейронных сетей, что позволяет значительно экономить время на этапе внедрения. Также подобная многозадачность позволяет снизить процент ошибок в задаче поиска особых точек, что позитивно сказывается на качестве работы итогового алгоритма распознавания лиц.

Биометрия, обнаружение лиц, поиск особых точек лица

Короткий адрес: https://sciup.org/140250027

IDR: 140250027 | DOI: 10.18287/2412-6179-CO-674

Single-shot face and landmarks detector

Facial landmark detection is an important sub-task in solving a number of biometric facial recognition tasks. In face recognition systems, the construction of a biometric template occurs according to a previously aligned (normalized) face image and the normalization stage includes the task of finding facial keypoints. A balance between quality and speed of the facial keypoints detector is important in such a problem. This article proposes a CNN-based one-stage detector of faces and keypoints operating in real time and achieving high quality on a number of well-known test datasets (such as AFLW2000, COFW, Menpo2D). The proposed face and facial landmarks detector is based on the idea of a one-stage SSD object detector, which has established itself as an algorithm that provides high speed and high quality in object detection task. As a basic CNN architecture, we used the ShuffleNet V2 network. An important feature of the proposed algorithm is that the face and facial keypoint detection is done in one CNN forward pass, which can significantly save time at the implementation stage. Also, such multitasking allows one to reduce the percentage of errors in the facial keypoints detection task, which positively affects the final face recognition algorithm quality.

Список литературы Одноэтапный детектор лиц и особых точек на цифровых изображениях

Liu, W. SSD: Single shot multibox detector / W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg // ECCV. - 2016. - P. 21-37.
Jiao, L. A survey of deep learning-based object detection / L. Jiao, F. Zhang, F. Liu, S. Yang, L. Li, Z. Feng, R. Qu // IEEE Access. - 2019. - Vol. 7. - P. 128837-128868.
Deng, J. ArcFace: Additive angular margin loss for deep face recognition / J. Deng, J. Guo, N. Xue, S. Zafeiriou // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). - 2019. - P. 4685-4694.
Liu, W. SphereFace: Deep hypersphere embedding for face recognition / W. Liu, Y. Wen, Z. Yu, M. Li, B. Raj, L. Song // 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). - 2017. - P. 6738-6746.
Zhang, K. Joint face detection and alignment using multitask cascaded convolutional networks / K. Zhang, Z. Zhang, Z. Li, Y. Qiao // IEEE Signal Processing Letters. - 2016. -Vol. 23, Issue 10. - P. 1499-1503.
Deng, J. RetinaFace: Single-stage dense face localisation in the wild [Electronical Resource] / J. Deng, J. Guo, Y. Zhou, J. Yu, I. Kotsia, S. Zafeiriou. - 2019. - URL: https://arxiv.org/abs/1905.00641 (request date 25.12.2019).
Shen, Z. DSOD: Learning deeply supervised object detectors from scratch / Z. Shen, Z. Liu, J. Li, Y.-G. Jiang, Y. Chen, X. Xue // 2017 IEEE International Conference on Computer Vision (ICCV). - 2017. - P. 1937-1945.
Lin, T.-Y. Focal loss for dense object detection / T.-Y. Lin, P. Goyal, R. Girshick, K. He, P. Dollár // 2017 IEEE International Conference on Computer Vision (ICCV). - 2017. - P. 2999-3007.
Girshick, R. Rich feature hierarchies for accurate object detection and semantic segmentation / R. Girshick, J. Donahue, T. Darrell, J. Malik // Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. - 2014. - P. 580-587.
Girshick, R. Fast R-CNN / R. Girshick // 2015 IEEE International Conference on Computer Vision (ICCV). - 2015. - P. 1440-1448.
Ren, S. Faster R-CNN: Towards real-time object detection with region proposal networks / S. Ren, K. He, R. Girshick, J. Sun // IEEE Transactions on Pattern Analysis and Machine Intelligence. - 2015. - Vol. 39, Issue 6. - P. 1137-1149.
Lin, Y. Feature pyramid networks for object detection / Y. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, S. Belongie // 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). - 2017 -
DOI: 10.1109/CVPR.2017.106
Ma, N. ShuffleNet V2: Practical guidelines for efficient CNN architecture design / N. Ma, X. Zhang, H.-T. Zheng, J. Sun // European Conference on Computer Vision. - 2018. - P. 122-138.
Zhang, S. S3FD: Single shot scale-invariant face detector / S. Zhang, X. Zhu, Z. Lei, H. Shi, X. Wang, S.Z. Li // International Conference on Computer Vision. - 2017. - P. 192-201.

Еще