Edoardo M. Ponti
News
Publications
FAQ
Resume
Publications
Type
Conference paper
Journal article
Preprint
Date
2024
2022
2021
2020
2019
2018
Zero-Shot Tokenizer Transfer
Language models (LMs) are bound to their tokenizer, which maps raw text to a sequence of vocabulary items (tokens). This restricts …
Benjamin Minixhofer
,
Edoardo M. Ponti
,
Ivan Vulić
PDF
Code
Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference
Transformers have emerged as the backbone of large language models (LLMs). However, generation remains inefficient due to the need to …
Piotr Nawrot
,
Adrian Łańcucki
,
Marcin Chochowski
,
David Tarjan
,
Edoardo M. Ponti
PDF
Scaling Sparse Fine-Tuning to Large Language Models
Large Language Models (LLMs) are difficult to fully fine-tune (e.g., with instructions or human feedback) due to their sheer number of …
Alan Ansell
,
Ivan Vulić
,
Hannah Sterz
,
Anna Korhonen
,
Edoardo M. Ponti
PDF
Code
Combining Modular Skills in Multitask Learning
A modular design encourages neural models to disentangle and recombine different facets of knowledge to generalise more systematically …
Edoardo M. Ponti
,
Alessandro Sordoni
,
Yoshua Bengio
,
Siva Reddy
PDF
Code
Visually Grounded Reasoning across Languages and Cultures
The design of widespread vision-and-language datasets and pre-trained encoders directly adopts, or draws inspiration from, the concepts …
Fangyu Liu
,
Emanuele Bugliarello
,
Edoardo M. Ponti
,
Siva Reddy
,
Nigel Collier
,
Desmond Elliott
PDF
Code
Dataset
Project
Modelling Latent Translations for Cross-Lingual Transfer
While achieving state-of-the-art results in multiple tasks and languages, translation-based cross-lingual transfer is often overlooked …
Edoardo M. Ponti
,
Julia Kreutzer
,
Ivan Vulić
,
Siva Reddy
PDF
Code
Composable Sparse Fine-Tuning for Cross-Lingual Transfer
Fine-tuning all parameters of a pre-trained model has become the mainstream approach for transfer learning. To increase its efficiency …
Alan Ansell
,
Edoardo M. Ponti
,
Anna Korhonen
,
Ivan Vulić
PDF
Code
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
In order to simulate human language capacity, natural language processing systems must complement the explicit information derived from …
Edoardo M. Ponti
,
Goran Glavaš
,
Olga Majewska
,
Qianchu Liu
,
Ivan Vulić
,
Anna Korhonen
PDF
Dataset
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse …
Ivan Vulić
,
Simon Baker
,
Edoardo Maria Ponti
,
Ulla Petti
,
Ira Leviant
,
Kelly Wing
,
Olga Majewska
,
Eden Bar
,
Matt Malone
,
Thierry Poibeau
,
Roi Reichart
,
Anna Korhonen
PDF
Dataset
Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages
Most combinations of NLP tasks and language varieties lack in-domain examples for supervised training because of the paucity of …
Edoardo M. Ponti
,
Ivan Vulić
,
Ryan Cotterell
,
Marinela Parovic
,
Roi Reichart
,
Anna Korhonen
PDF
Code
Towards Zero-shot Language Modeling
Can we construct a neural language model which is inductively biased towards learning human language? Motivated by this question, we …
Edoardo M. Ponti
,
Ivan Vulić
,
Ryan Cotterell
,
Roi Reichart
,
Anna Korhonen
PDF
Video
Cross-lingual Semantic Specialization via Lexical Relation Induction
Semantic specialization integrates structured linguistic knowledge from external resources (such as lexical relations in WordNet) into …
Edoardo M. Ponti
,
Ivan Vulić
,
Goran Glavaš
,
Roi Reichart
,
Anna Korhonen
PDF
Code
Informing Unsupervised Pretraining with External Linguistic Knowledge
Unsupervised pretraining models have been shown to facilitate a wide range of downstream applications. These models, however, still …
Anne Lauscher
,
Ivan Vulić
,
Edoardo Maria Ponti
,
Anna Korhonen
,
Goran Glavaš
PDF
Specializing Distributional Vectors of All Words for Lexical Entailment
Semantic specialization methods fine-tune distributional word vectors using lexical knowledge from external resources (e.g., WordNet) …
Aishwarya Kamath
,
Jonas Pfeiffer
,
Edoardo Ponti
,
Goran Glavaš
,
Ivan Vulić
PDF
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
Linguistic typology aims to capture structural and semantic variation across the world’s languages. A large-scale typology could …
Edoardo Maria Ponti
,
Helen O’Horan
,
Yevgeni Berzak
,
Ivan Vulić
,
Roi Reichart
,
Thierry Poibeau
,
Ekaterina Shutova
,
Anna Korhonen
PDF
Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization
Semantic specialization is the process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (eg, …
Edoardo M. Ponti
,
Ivan Vulić
,
Goran Glavaš
,
Nikola Mrkšić
,
Anna Korhonen
PDF
Code
Cite
×