Human gesture recognition under degraded environments using 3D-integral imaging and deep learning
Ver/ Abrir
Impacto
Scholar |
Otros documentos de la autoría: Krishnan, Gokul; Joshi, Rakesh; O’Connor, Timothy; Pla, Filiberto; Javidi, Bahram
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
Human gesture recognition under degraded environments using 3D-integral imaging and deep learningFecha de publicación
2020Editor
Optical Society of AmericaISSN
1094-4087Cita bibliográfica
Gokul Krishnan, Rakesh Joshi, Timothy O’Connor, Filiberto Pla, and Bahram Javidi, "Human gesture recognition under degraded environments using 3D-integral imaging and deep learning," Opt. Express 28, 19711-19725 (2020)Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
https://www.osapublishing.org/oe/abstract.cfm?uri=oe-28-13-19711&origin=searchVersión
info:eu-repo/semantics/publishedVersionResumen
In this paper, we propose a spatio-temporal human gesture recognition algorithm under degraded conditions using three-dimensional integral imaging and deep learning. The proposed algorithm leverages the advantages of ... [+]
In this paper, we propose a spatio-temporal human gesture recognition algorithm under degraded conditions using three-dimensional integral imaging and deep learning. The proposed algorithm leverages the advantages of integral imaging with deep learning to provide an efficient human gesture recognition system under degraded environments such as occlusion and low illumination conditions. The 3D data captured using integral imaging serves as the input to a convolutional neural network (CNN). The spatial features extracted by the convolutional and pooling layers of the neural network are fed into a bi-directional long short-term memory (BiLSTM) network. The BiLSTM network is designed to capture the temporal variation in the input data. We have compared the proposed approach with conventional 2D imaging and with the previously reported approaches using spatio-temporal interest points with support vector machines (STIP-SVMs) and distortion invariant non-linear correlation-based filters. Our experimental results suggest that the proposed approach is promising, especially in degraded environments. Using the proposed approach, we find a substantial improvement over previously published methods and find 3D integral imaging to provide superior performance over the conventional 2D imaging system. To the best of our knowledge, this is the first report that examines deep learning algorithms based on 3D integral imaging for human activity recognition in degraded environments. [-]
Publicado en
Optics Express 28, 19711-19725 (2020)Proyecto de investigación
FA9550-18-1-0338, N000141712405, N000141712561Derechos de acceso
© 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- INIT_Articles [754]