Predicting missing annotations in Gene Ontology with Knowledge Graph Embeddings and True Path Rule

conference paper
Gene Ontology (GO) and its Annotations (GOA) provide a controlled and evolving vocabulary for gene products and gene functions widely used in molecular biology. GO & GOA are updated and maintained both automatically from biological publications and manually by curators. These knowledge bases however are often incomplete for two reasons: 1) Research in biological domain itself is still ongoing; 2) The amount of experimental evidence might not be yet sufficient to validate annotations. In this paper, we address the gap in evidence between gene products and their annotations by making link predictions using Knowledge Graph Embedding (KGE) methods. Through the application of the True Path Rule (TPR) in the training stage of KGE, we were able to improve the performance of traditional KGE methods. We report two experimental scenarios with GO and GO Chicken Annotation datasets to show the contribution of embedding TPR to prediction accuracy. (C) 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
TNO Identifier
987818
ISSN
16130073
Publisher
CEUR-WS
Source title
CEUR Workshop Proceedings
Editor(s)
Yamaguchi A.Splendiani A.Marshall M.S.Baker C.Bolleman J.Burger A.Castro L.J.Eigenbrod O.Osterle S.Romacker M.Waagmeester A.
Pages
82-86
Files
To receive the publication files, please send an e-mail request to TNO Repository.