Drug-target affinity prediction using a graph-based approach enriched with molecule words

dc.contributorGraduate Program in Computer Engineering.
dc.contributor.advisorÖzgür, Arzucan.
dc.contributor.authorYılmaz, Cansu Damla.
dc.date.accessioned2025-04-14T12:09:51Z
dc.date.available2025-04-14T12:09:51Z
dc.date.issued2023
dc.description.abstractWet-lab experiments to predict the affinity of drugs for their targets are costly and time consuming. Computational methods can provide an alternative to early stage experiments and guide the research process. Recently, the use of natural language processing techniques to represent molecules has become popular and has led to successful results. In our work, we assume that proteins and ligands, like human languages, have their own languages and that these languages consist of meaningful smaller parts that we call words. We identify protein and ligand words based on their 1D sequences using a subword tokenization method and represent protein-ligand interactions with a heterogeneous graph consisting of four different node types corresponding to proteins, ligands, protein words, and ligand words. A graph-based approach is used to learn embeddings for the nodes in the graph. These embeddings are fed into a deep learning model for predicting protein-ligand binding affinity. We show that using their word embeddings to represent novel proteins and/or ligands not present in the training set improves the results compared to the case where no words are used. Using pre-trained word embeddings for previously unknown molecules is also efficient in terms of complexity, as we do not need to re-train the input graph to learn the embeddings for these new molecules.
dc.format.pagesxiii, 50 leaves
dc.identifier.otherGraduate Program in Computer Engineering. TKL 2023 U68 PhD (Thes TKL 2023 K75
dc.identifier.urihttps://digitalarchive.library.bogazici.edu.tr/handle/123456789/21494
dc.publisherThesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2023.
dc.subject.lcshDrug targeting.
dc.subject.lcshMolecules.
dc.subject.lcshCombinatorial analysis.
dc.titleDrug-target affinity prediction using a graph-based approach enriched with molecule words

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
b2795769.038436.001.PDF
Size:
16.1 MB
Format:
Adobe Portable Document Format

Collections