PeerJ Computer Science | 2021

Fusion of text and graph information for machine learning problems on networks

Abstract

Today, increased attention is drawn towards network representation learning, a technique that maps nodes of a network into vectors of a low-dimensional embedding space. A network embedding constructed this way aims to preserve nodes similarity and other specific network properties. Embedding vectors can later be used for downstream machine learning problems, such as node classification, link prediction and network visualization. Naturally, some networks have text information associated with them. For instance, in a citation network, each node is a scientific paper associated with its abstract or title; in a social network, all users may be viewed as nodes of a network and posts of each user as textual attributes. In this work, we explore how combining existing methods of text and network embeddings can increase accuracy for downstream tasks and propose modifications to popular architectures to better capture textual information in network embedding and fusion frameworks.

PeerJ Computer Science | 2021

Fusion of text and graph information for machine learning problems on networks

Abstract

Volume 7

Pages None

DOI 10.7717/peerj-cs.526

Language English

Journal PeerJ Computer Science

Full Text