Empowering heterogeneous networks for drug-target affinity prediction

dc.contributorGraduate Program in Computer Engineering.
dc.contributor.advisorÖzgür, Arzucan.
dc.contributor.advisorÖzkırımlı, Elif.
dc.contributor.authorParlar, Selen.
dc.date.accessioned2023-10-15T06:58:15Z
dc.date.available2023-10-15T06:58:15Z
dc.date.issued2022
dc.description.abstractPredicting drug-target binding affinity is a critical phase in computer-aided drug design, which can help accelerate the drug development process and reduce experimen tal validation costs caused by the significant false-positive rates. Hence, developing in-silico computational algorithms to predict drug- target binding affinity values has become an important research area. Machine learning approaches have been pro posed for this task, including models that use readily available biomolecule sequences and heterogeneous networks enriched with drug and target-related information. We present WideDeepDTA, the first study that leverages both text-based and network based approaches and predicts drug-target binding affinities. Given homogeneous and heterogeneous networks containing multiple types of biological entities, relationships between these entities, and pre-trained language models for biomolecular language, WideDeepDTA first learns the low-dimensional feature representation of drugs and targets using the node embedding technique Metapath2Vec. Then, it predicts affinity values based on the learned features. WideDeepDTA demonstrates its ability to cre ate rich representations in the drug-target affinity prediction task compared to one of the state-of-the-art methods, DeepDTA, on the BDB dataset in terms of concordance index and mean squared error. Experiments indicate that integrating pre-trained lan guage models with heterogeneous information improves model performance, especially while predicting the affinity values between proteins and unseen ligands. Moreover, the results show that the model performance improves when heterogeneous graphs are empowered with the information extracted from text-based representations.
dc.format.pagesxiv, 85 leaves
dc.identifier.otherCMPE 2022 P37
dc.identifier.urihttps://digitalarchive.library.bogazici.edu.tr/handle/123456789/19714
dc.publisherThesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2022.
dc.subject.lcshHeterogeneous catalysis.
dc.subject.lcshDrugs -- Design -- Data processing.
dc.subject.lcshComputer-aided design.
dc.titleEmpowering heterogeneous networks for drug-target affinity prediction

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
b2778391.037769.001.PDF
Size:
555.09 KB
Format:
Adobe Portable Document Format

Collections