Deep Learning-Based Sign Language Recognition Using Efficient Multi-Feature Attention Mechanism

Yenisari, Esma; Yavuz, Sirma

Deep Learning-Based Sign Language Recognition Using Efficient Multi-Feature Attention Mechanism

dc.authorid	0000-0001-6757-5990
dc.contributor.author	Yenisari, Esma
dc.contributor.author	Yavuz, Sirma
dc.date.accessioned	2026-02-03T12:00:44Z
dc.date.available	2026-02-03T12:00:44Z
dc.date.issued	2025
dc.department	Çanakkale Onsekiz Mart Üniversitesi
dc.description.abstract	Sign language is a communication system used by Deaf and hard of hearing people and serves as a bridge between Deaf and hearing communities. Since sign language uses numerous visuomotor elements that include both visual perception (hand shapes, facial expressions) and physical movements (hand and arm movements), it represents a multimodal input source for Sign Language Recognition (SLR) systems. In this study, a novel deep learning-based architecture using EfficientNet and multi-feature attention mechanism is proposed to accurately recognize SL signs. Initially, general visual features are acquired through the EfficientNet model, leveraging the transfer learning paradigm. Subsequently, dataset-specific contextual features are extracted utilizing distinct network types; spatial dependencies are modeled via Convolutional Neural Networks (CNNs), whereas temporal dynamics are learned through Recurrent Neural Networks (RNNs). These features are adaptively weighted using an attention mechanism and focus on the most critical information for the classification task. This approach ensures that the most information-rich and useful components of both methods are emphasized, leading to a significant increase in final performance. Utilizing RGB video images, the proposed model, on the BosphorusSign22k General dataset comprising Turkish Sign Language (TSL) signs, achieved accuracies of 99.01% and 96.84% for sign classes of 50 and 174, respectively. Furthermore, the generalization ability of the model was demonstrated by its high accuracy of 99.84% in the Argentinian Sign Language dataset (LSA64) and 98.41% in the Indian Sign Language dataset (INCLUDE50). Experimental results indicated that the proposed model architecture has a competitive performance compared to existing SLR models reviewed in the literature.
dc.description.sponsorship	Scientific and Technological Research Council of Turkiye (TUBITAK) [125E318]
dc.description.sponsorship	This work was supported by the Scientific and Technological Research Council of Turkiye (TUBITAK) under Project 125E318.
dc.identifier.doi	10.1109/ACCESS.2025.3586096
dc.identifier.endpage	126699
dc.identifier.issn	2169-3536
dc.identifier.scopus	2-s2.0-105009969779
dc.identifier.scopusquality	Q1
dc.identifier.startpage	126684
dc.identifier.uri	https://doi.org/10.1109/ACCESS.2025.3586096
dc.identifier.uri	https://hdl.handle.net/20.500.12428/34691
dc.identifier.volume	13
dc.identifier.wos	WOS:001534536400047
dc.identifier.wosquality	Q2
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Ieee-Inst Electrical Electronics Engineers Inc
dc.relation.ispartof	Ieee Access
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_WOS_20260130
dc.subject	Sign language
dc.subject	Hands
dc.subject	Systematic literature review
dc.subject	Feature extraction
dc.subject	Attention mechanisms
dc.subject	Sensors
dc.subject	Deep learning
dc.subject	Cameras
dc.subject	Accuracy
dc.subject	Deafness
dc.subject	Attention mechanism
dc.subject	computer vision
dc.subject	deep learning
dc.subject	sign language recognition
dc.subject	SLR datasets
dc.subject	vision-based recognition
dc.title	Deep Learning-Based Sign Language Recognition Using Efficient Multi-Feature Attention Mechanism
dc.type	Article

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Deep Learning-Based Sign Language Recognition Using Efficient Multi-Feature Attention Mechanism

Dosyalar

Koleksiyon