Changing the Geometry of Representations: α-Embeddings for NLP Tasks

Riccardo Volpi, Uddhipan Thakur, Luigi Malagò

Entropy, 23(3), 287 (2021) .

Full text: http://dx.doi.org/10.3390/e23030287

Abstract

Word embeddings based on a conditional model are commonly used in Natural Language Processing (NLP) tasks to embed the words of a dictionary in a low dimensional linear space. Their computation is based on the maximization of the likelihood of a conditional probability distribution for each word of the dictionary. These distributions form a Riemannian statistical manifold, where word embeddings can be interpreted as vectors in the tangent space of a specific reference measure on the manifold. A novel family of word embeddings, called α-embeddings have been recently introduced as deriving from the geometrical deformation of the simplex of probabilities through a parameter α, using notions from Information Geometry. After introducing the α-embeddings, we show how the deformation of the simplex, controlled by α, provides an extra handle to increase the performances of several intrinsic and extrinsic tasks in NLP. We test the α-embeddings on different tasks with models of increasing complexity, showing that the advantages associated with the use of α-embeddings are present also for models with a large number of parameters. Finally, we show that tuning α allows for higher performances compared to the use of larger models in which additionally a transformation of the embeddings is learned during training, as experimentally verified in attention models.

Add this to the list of publications that I have authored

Save to my library

Add your rating and review

If all scientific publications that you have read were ranked according to their scientific quality and importance from 0% (worst) to 100% (best), where would you place this publication? Please rate by selecting a range.

0% - 100%

This publication ranks between % and % of publications that I have read in terms of scientific quality and importance.

Review title (optional)

Keep my rating and review anonymous
Show publicly that I gave the rating and I wrote the review

Export to:

Notice: Undefined index: publicationsCaching in /www/html/epistemio/application/controllers/PublicationController.php on line 2240

Changing the Geometry of Representations: α-Embeddings for NLP Tasks

Abstract

Add your rating and review

Sign up / Log in

Services

Company

Legal info

Blog & newsletter

Follow us

Changing the Geometry of Representations: α-Embeddings for NLP Tasks

Abstract

Add your rating and review

Embed publication

Save publication

Share comment

Sign up / Log in

Services

Company

Legal info

Blog & newsletter

Follow us

Log in