Neural Sign Language Translation by learning tokenization

dc.contributorGraduate Program in Computer Engineering.
dc.contributor.advisorAkarun, Lale.
dc.contributor.authorOrbay, Alptekin.
dc.date.accessioned2023-03-16T10:04:40Z
dc.date.available2023-03-16T10:04:40Z
dc.date.issued2020.
dc.description.abstractIn this thesis, we propose a multitask learning based method to improve Neural Sign Language Translation (NSLT) consisting of two parts, a tokenization layer and Neural Machine Translation (NMT). The tokenization part focuses on how Sign Language (SL) videos should be represented to be fed into the other part. It has not been studied elaborately whereas NMT research has attracted several researchers contributing enormous advancements. Up to now, there are two main input tokenization levels, namely frame-level and gloss-level tokenization. Glosses are world-like intermediate presentation and unique to SLs. Therefore, we aim to develop a generic sign-level tokenization layer so that it is applicable to other domains without further e ort. We begin with investigating current tokenization approaches and explain their weaknesses with several experiments. To provide a solution, we adapt Transfer Learning, Multitask Learning and Unsupervised Domain Adaptation into this research to leverage additional supervision. We succeed in enabling knowledge transfer between SLs and improve translation quality by 5 points in BLEU-4 and 8 points in ROUGE scores. Secondly, we show the e ects of body parts by extensive experiments in all the tokenization approaches. Apart from these, we adopt 3D-CNNs to improve e ciency in terms of time and space. Lastly, we discuss the advantages of sign-level tokenization over gloss-level tokenization. To sum up, our proposed method eliminates the need for gloss level annotation to obtain higher scores by providing additional supervision by utilizing weak supervision sources.
dc.format.extent30 cm.
dc.format.pagesxiii, 71 leaves ;
dc.identifier.otherCMPE 2020 O73
dc.identifier.urihttps://digitalarchive.library.bogazici.edu.tr/handle/123456789/12428
dc.publisherThesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020.
dc.subject.lcshSign language -- Study and teaching.
dc.subject.lcshLearning strategies.
dc.titleNeural Sign Language Translation by learning tokenization

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
b2714074.035499.001.PDF
Size:
13.48 MB
Format:
Adobe Portable Document Format

Collections