Recognizing semantic relations by combining transformers and fully connected models

Tools

Roussinov, Dmitri and Sharoff, Serge and Puchnina, Nadezhda; Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe and Choukri, Khalid and Cieri, Christopher and Declerck, Thierry and Goggi, Sara and Isahara, Hitoshi and Maegaard, Bente and Mariani, Joseph and Mazo, Hélène and Moreno, Asuncion and Odijk, Jan and Piperidis, Stelios, eds. (2020) Recognizing semantic relations by combining transformers and fully connected models. In: LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings. LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings . European Language Resources Association (ELRA), FRA, pp. 5838-5845. ISBN 9791095546344 (https://www.aclweb.org/anthology/2020.lrec-1.715/)

[thumbnail of Roussinov-etal-LREC-2021-Recognizing-semantic-relations-by-combining-transformers-and-fully-connected-models]

Preview

Text. Filename: Roussinov_etal_LREC_2021_Recognizing_semantic_relations_by_combining_transformers_and_fully_connected_models.pdf
Final Published Version
License:

Download (295kB)| Preview

Abstract

Automatically recognizing an existing semantic relation (e.g. "is a", "part of", "property of", "opposite of" etc.) between two words (phrases, concepts, etc.) is an important task affecting many NLP applications and has been subject of extensive experimentation and modeling. Current approaches to automatically telling if a relation exists between two given concepts X and Y can be grouped into two types: 1) those modeling word-paths connecting X and Y in text and 2) those modeling distributional properties of X and Y separately, not necessary in the proximity to each other. Here, we investigate how both types can be improved and combined. We suggest a distributional approach that is based on an attention-based transformer. We have also developed a novel word path model that combines useful properties of a convolutional network with a fully connected language model. While our transformer-based approach works better, both our models significantly outperform the state-of-the-art within their classes of approaches. We also demonstrate that combining the two approaches results in additional gains since they use somewhat different data sources.

ORCID iDs

Roussinov, Dmitri

, Sharoff, Serge and Puchnina, Nadezhda; Calzolari, Nicoletta, Béchet, Frédéric, Blache, Philippe, Choukri, Khalid, Cieri, Christopher, Declerck, Thierry, Goggi, Sara, Isahara, Hitoshi, Maegaard, Bente, Mariani, Joseph, Mazo, Hélène, Moreno, Asuncion, Odijk, Jan and Piperidis, Stelios

Share and Export

Item metadata

Item type:	Book Section
ID code:	75898
Dates:	Date Event 31 May 2020 Published 13 February 2020 Accepted
Subjects:	Language and Literature > Philology. Linguistics
Department:	Faculty of Science > Computer and Information Sciences
Depositing user:	Pure Administrator
Date deposited:	23 Mar 2021 10:20
Last modified:	03 Feb 2025 23:11
Related URLs:	Scopus publication
URI:	https://strathprints.strath.ac.uk/id/eprint/75898

CORE (COnnecting REpositories)