Publications | Edoardo M. Ponti

Zero-Shot Tokenizer Transfer

Language models (LMs) are bound to their tokenizer, which maps raw text to a sequence of vocabulary items (tokens). This restricts …

Benjamin Minixhofer, Edoardo M. Ponti, Ivan Vulić

PDF Code

Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference

Transformers have emerged as the backbone of large language models (LLMs). However, generation remains inefficient due to the need to …

Piotr Nawrot, Adrian Łańcucki, Marcin Chochowski, David Tarjan, Edoardo M. Ponti

PDF

Scaling Sparse Fine-Tuning to Large Language Models

Large Language Models (LLMs) are difficult to fully fine-tune (e.g., with instructions or human feedback) due to their sheer number of …

Alan Ansell, Ivan Vulić, Hannah Sterz, Anna Korhonen, Edoardo M. Ponti

PDF Code

Combining Modular Skills in Multitask Learning

A modular design encourages neural models to disentangle and recombine different facets of knowledge to generalise more systematically …

Edoardo M. Ponti, Alessandro Sordoni, Yoshua Bengio, Siva Reddy

PDF Code

Visually Grounded Reasoning across Languages and Cultures

The design of widespread vision-and-language datasets and pre-trained encoders directly adopts, or draws inspiration from, the concepts …

Fangyu Liu, Emanuele Bugliarello, Edoardo M. Ponti, Siva Reddy, Nigel Collier, Desmond Elliott

PDF Code Dataset Project

Modelling Latent Translations for Cross-Lingual Transfer

While achieving state-of-the-art results in multiple tasks and languages, translation-based cross-lingual transfer is often overlooked …

Edoardo M. Ponti, Julia Kreutzer, Ivan Vulić, Siva Reddy

PDF Code

Composable Sparse Fine-Tuning for Cross-Lingual Transfer

Fine-tuning all parameters of a pre-trained model has become the mainstream approach for transfer learning. To increase its efficiency …

Alan Ansell, Edoardo M. Ponti, Anna Korhonen, Ivan Vulić

PDF Code

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning

In order to simulate human language capacity, natural language processing systems must complement the explicit information derived from …

Edoardo M. Ponti, Goran Glavaš, Olga Majewska, Qianchu Liu, Ivan Vulić, Anna Korhonen

PDF Dataset

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse …

Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

PDF Dataset

Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages

Most combinations of NLP tasks and language varieties lack in-domain examples for supervised training because of the paucity of …

Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parovic, Roi Reichart, Anna Korhonen

PDF Code

Towards Zero-shot Language Modeling

Can we construct a neural language model which is inductively biased towards learning human language? Motivated by this question, we …

Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Roi Reichart, Anna Korhonen

PDF Video

Cross-lingual Semantic Specialization via Lexical Relation Induction

Semantic specialization integrates structured linguistic knowledge from external resources (such as lexical relations in WordNet) into …

Edoardo M. Ponti, Ivan Vulić, Goran Glavaš, Roi Reichart, Anna Korhonen

PDF Code

Informing Unsupervised Pretraining with External Linguistic Knowledge

Unsupervised pretraining models have been shown to facilitate a wide range of downstream applications. These models, however, still …

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti, Anna Korhonen, Goran Glavaš

PDF

Specializing Distributional Vectors of All Words for Lexical Entailment

Semantic specialization methods fine-tune distributional word vectors using lexical knowledge from external resources (e.g., WordNet) …

Aishwarya Kamath, Jonas Pfeiffer, Edoardo Ponti, Goran Glavaš, Ivan Vulić

PDF

Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing

Linguistic typology aims to capture structural and semantic variation across the world’s languages. A large-scale typology could …

Edoardo Maria Ponti, Helen O’Horan, Yevgeni Berzak, Ivan Vulić, Roi Reichart, Thierry Poibeau, Ekaterina Shutova, Anna Korhonen

PDF

Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization

Semantic specialization is the process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (eg, …

Edoardo M. Ponti, Ivan Vulić, Goran Glavaš, Nikola Mrkšić, Anna Korhonen

PDF Code