Date of Completion
6-20-2020
Embargo Period
9-16-2020
Keywords
Drug-Target Interaction, Singular Value Decomposition, Time Series Analysis, Motif Search, Jacobi SVD, Autoencoder, Ensemble Methods, Edit Distance
Major Advisor
Dr. Sanguthevar Rajasekaran
Associate Advisor
Dr. Reda Ammar
Associate Advisor
Dr. Ion Mandoiu
Field of Study
Computer Science and Engineering
Degree
Doctor of Philosophy
Open Access
Campus Access
Abstract
In this dissertation we propose novel approaches for data mining and machine learn- ing for some of the fundamental and advanced problems in the areas of data analysis and bioinformatics. A number of problems such as Drug-Target Interaction (DTI) pre- diction, Singular Value Decomposition (SVD), Time series Analysis (TSA) and motif search has been studied in this thesis. We have developed algorithms that outperformed the state of art in all of the above mentioned areas. We proposed algorithms for DTI prediction that outperformed all prior algorithms over benchmark datasets under mul- tiple scenarios. Our proposed approaches for Jacobi based SVD, both sequential and parallel, improves the computation time and work compared to the state of art. We also proposed ensemble based TSA models that improves classification accuracy sta- tistically significantly compared to all prior algorithm on benchmark dataset. We also introduced novel motif search algorithms for DNA and protein motifs.
Recommended Citation
Pathak, Sudipta, "Novel Algorithms and Applications for Data Mining and Machine Learning" (2020). Doctoral Dissertations. 2594.
https://digitalcommons.lib.uconn.edu/dissertations/2594