Learning network embeddings using small graphlets
le Gorrec, Luce and Knight, Philip A. and Caen, Auguste (2021) Learning network embeddings using small graphlets. Social Network Analysis and Mining, 12 (1). 20. ISSN 1869-5450 (https://doi.org/10.1007/s13278-021-00846-9)
Preview |
Text.
Filename: le_Gorrec_etal_SNAM_Learning_network_embeddings_using_small_graphlets.pdf
Final Published Version License: Download (2MB)| Preview |
Abstract
Techniques for learning vectorial representations of graphs (graph embeddings) have recently emerged as an effective approach to facilitate Machine Learning on graphs. Some of the most popular methods involve sophisticated features such as graph kernels or convolutional networks. In this work, we introduce two straightforward supervised learning algorithms based on small-size graphlet counts, combined with a dimension reduction step. The first relies on a classic feature extraction method powered by Principal Component Analysis (PCA). The second is a feature selection procedure also based on PCA. Despite their conceptual simplicity, these embeddings are arguably more meaningful than some popular alternatives and at the same time are competitive with state-of-the-art methods. We illustrate this second point on a downstream classification task. We then use our algorithms in a novel setting, namely to conduct an analysis of author relationships in Wikipedia articles, for which we present an original dataset. Finally, we provide empirical evidence suggesting that our methods could also be adapted to unsupervised learning algorithms.
ORCID iDs
le Gorrec, Luce, Knight, Philip A. ORCID: https://orcid.org/0000-0001-9511-5692 and Caen, Auguste;-
-
Item type: Article ID code: 78820 Dates: DateEvent15 December 2021Published15 September 2021AcceptedSubjects: Science > Mathematics Department: Faculty of Science > Mathematics and Statistics Depositing user: Pure Administrator Date deposited: 08 Dec 2021 11:24 Last modified: 11 Nov 2024 13:19 Related URLs: URI: https://strathprints.strath.ac.uk/id/eprint/78820