Gökçe Abay, Biological Data Integration and Relation Prediction by Matrix Factorization
In this study, we propose to integrate large-scale gene/protein annotation data by using non-negative matrix factorization (NMF). Using NMF, the ultimate aim here is to predict the unknown binary relationships between these biological entities; and to represent these entities (i.e., proteins, functions and disease entries) as informative and non-redundant quantitative feature vectors (using the low-rank feature matrices generated by the factorization process), which can be used in diverse data mining and machine learning tasks in the future, such as the automated annotations of proteins or the construction of biological knowledge graphs.
Date: 30.01.2020 / 15:30 Place: A-212