Score level multi cue fusion for sign language recognition

dc.contributorGraduate Program in Computer Engineering.
dc.contributor.advisorAkarun, Lale.
dc.contributor.authorGökçe, Çağrı.
dc.date.accessioned2023-03-16T10:04:39Z
dc.date.available2023-03-16T10:04:39Z
dc.date.issued2020.
dc.description.abstractIn this thesis, we propose a Score-Level Multi Cue Fusion approach that improves the sign language recognition performance of the three dimensional convolutional neural networks. Sign Language is the communication language of the Deaf and Hearing-impaired individuals and performed using hand movements, facial gestures, and body alignment. Sign Language Recognition is the task that aims to understand sign language and gaining increasing popularity with the task becoming feasible due to the e ciency of the neural network. Previous work uses 3D CNN network variants to inspect SL properties in di erent settings. The vanilla 3D variant uses 3D kernels with high processing cost, the mixed convolution variant applies both 3D and 2D kernels respectively, and R(2+1)D variants exploit bottleneck connections to exploit the bottleneck dimension. Various studies use these networks to generate an end to end framework for tasks such as sign classi cation and translation. To achieve better performance, 3D CNN methods use the complicated neural network architectures that have a branch for every cue system. We evaluate the 3D network performances and propose a more straightforward approach which only adopts a single neural network that can process multiple cues at test time. We exploit the hand, body, and face cues by training single individual networks and fuse results by using a weighted score fusion. We test our method on the recently published Turkish Isolated SLR dataset. Despite the simple architecture, our method achieves %94 percent classi cation rate on 744 di erent sign glosses. We hope that the multi cue approach can help with the other SLR tasks such as translation, which is stated as future work.
dc.format.extent30 cm.
dc.format.pagesxvii, 59 leaves ;
dc.identifier.otherCMPE 2020 G75
dc.identifier.urihttps://digitalarchive.library.bogazici.edu.tr/handle/123456789/12427
dc.publisherThesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020.
dc.subject.lcshSign language.
dc.titleScore level multi cue fusion for sign language recognition

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
b2714059.035491.001.PDF
Size:
3.92 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
b2714059.035492.001.rar
Size:
2.43 MB
Format:
Unknown data format

Collections