Разработка Python приложения для распознавания жестов рук из видеопотока RGB и RGBD камер

D.Zh. Satybaldina; N.S. Glazyrina

doi:10.32523/bulmathenu.2021/3.1

Authors

D.Zh. Satybaldina L. N. Gumilyov Eurasian National University
N.S. Glazyrina L. N. Gumilyov Eurasian National University

DOI:

https://doi.org/10.32523/bulmathenu.2021/3.1

Keywords:

depth camera, gesture recognition, convolutional neural network, RealSense, OpenCV, Python, VGG-16

Abstract

Gesture recognition systems have changed a lot recently, due to the development of modern data capture
devices (sensors) and the development of new recognition algorithms. The article presents the results of a study for
recognizing static and dynamic hand gestures from a video stream from RGB and RGBD cameras, namely from the
Logitech HD Pro Webcam C920 webcam and from the Intel RealSense D435 depth camera. Software implementation
is done using Python 3.6 tools. Open source Python libraries provide robust implementations of image processing and
segmentation algorithms. The feature extraction and gesture classification subsystem is based on the VGG-16 neural
network architecture implemented using the TensorFlow and Keras deep learning frameworks. The technical characteristics
of the cameras are given. The algorithm of the application is described. The research results aimed at comparing data
capture devices under various experimental conditions (distance and illumination) are presented. Experimental results show
that using the Intel RealSense D435 depth camera provides more accurate gesture recognition under various experimental
conditions.