Affect analysis of communication texts in Turkish language
dc.contributor | Graduate Program in Computer Engineering. | |
dc.contributor.advisor | Salah, Albert Ali. | |
dc.contributor.author | Oktay, Eda Aydın. | |
dc.date.accessioned | 2023-03-16T10:02:16Z | |
dc.date.available | 2023-03-16T10:02:16Z | |
dc.date.issued | 2016. | |
dc.description.abstract | As digital technologies expanded our range of online communication, textual a ect analysis become the focus of considerable interest in several disciplines. While many studies have been conducted in English language, there are only a few tools speci c for the Turkish language. One reason for that is the lack of lexical resources for a ect analysis and annotated corpora for an accurate evaluation. In this thesis, we develop an approach for continuous and dimensional a ect analysis of Turkish communications by combining several tools. We conduct experiments on various text corpora in Turkish including online multi-party chat records, psychotherapy records, Twitter data, movie reviews, and teachers' comments on high school students. Analyzing such texts brings challenges like non-standard word usage, grammatical irregularities, abbreviation usage, and spelling mistakes. We propose several pre-processing steps to deal with these. Then, we adapt an a ective word dictionary from English to Turkish, and by expanding it with synsets, obtain 15,200 words with annotations for valence, arousal, and dominance. We also employ a list of frequently used abbreviations, emoticons, interjections, modi ers (intensi ers and diminishers), and other linguistic indicators to capture the overall a ective state at the sentence level. We recruit and train annotators to obtain a ective ground truth speci cally for multi-party chat records. Our results show that the proposed system is useful, yet there is much room for improvement at di erent stages. | |
dc.format.extent | 30 cm. | |
dc.format.pages | xiv, 91 leaves ; | |
dc.identifier.other | CMPE 2016 O57 | |
dc.identifier.uri | https://digitalarchive.library.bogazici.edu.tr/handle/123456789/12309 | |
dc.publisher | Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2016. | |
dc.subject.lcsh | Natural language processing (Computer science) | |
dc.subject.lcsh | Turkish language -- Texts. | |
dc.title | Affect analysis of communication texts in Turkish language |