neuralnoise.com

March 2025 in Research

Thu, 17 Apr 2025 02:00:00 +0200

We have been working on language model evaluation, knowledge utilization, efficiency, and multimodal reasoning. We had papers at ICLR 2025, NAACL 2025 (x3), AAAI 2025, and others, along with several ongoing works.

NAACL 2025 – Controlling Knowledge & Reasoning

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering, by Yu Zhao et al. – We introduce SpARE, a training‑free method to control whether an LLM relies on its internal parametric knowledge or given context when conflicts arise. By analyzing mid‑layer activations with sparse autoencoders, SpARE identifies conflict signals and manipulates them to steer the model at inference time, significantly improving performance on open‑domain QA compared to prior methods. (Oral presentation)
Are We Done with MMLU?, by Aryo Gema and many others – We analyze the Massive Multitask Language Understanding benchmark, uncovering a high error rate (57% of sampled questions had at least one ground‑truth mistake). We introduce MMLU‑Redux, a cleaned subset of 3,000 expert‑verified questions, and show that corrected evaluations can substantially alter model rankings. MMLU‑Redux is open‑sourced also adopted for example by DeepSeek and Qwen!
Self-Training Large Language Models for Tool-Use Without Demonstrations, based on Ne Luo’s MSc project – We explore whether LLMs can learn tool usage (e.g., search engines, calculators) without hand‑crafted examples. Starting with zero‑shot prompts, we generate synthetic tool‑using traces and then fine‑tune the model with them. On PopQA, the self‑trained model gains +3.7% accuracy, though results vary on other datasets, highlighting both promise and challenges in autonomous tool‑use learning. Ne Luo is looking for a PhD position, contact her if you are interested in working with her!

ICLR 2025 – Learning & Evaluation

An Auditing Test to Detect Behavioral Shift in Language Models, by the amazing Leo Richter – We propose a method for continual Behavioral Shift Auditing (BSA) of LLMs. This statistical test monitors an LLM’s outputs for significant deviations from a reference model’s behavior, with theoretical guarantees on detecting genuine shifts while avoiding false alarms. Our BSA approach relies on catching subtle changes in a model’s toxicity and translation performance after fine-tuning, using only a few hundred examples, offering a practical tool to ensure that an LLM remains aligned during its deployment/lifetime.

AAAI 2025 – Efficient Inference

Adaptive Computation Modules: Granular Conditional Computation for Efficient Inference, by Bartosz Wójcik, Alessio Devoto, et al. – We propose Adaptive Computation Modules (ACMs) for dynamic, per‑token computation in Transformers. ACMs consist of cascaded sub‑modules with gating functions that allow easy tokens to exit early. Our distillation method retrofits pre‑trained models with ACMs, cutting inference cost without accuracy loss in vision and speech tasks, offering a plug‑and‑play approach to green AI.

COLING 2025 – Multilingual Resources

SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages, by Gayane Ghazaryan, Erik Arakelyan et al. – SynDARin synthesizes QA datasets in low‑resource languages (e.g., Armenian) by generating English questions via LLMs from parallel corpora, translating and validating them. The resulting 16,000+ QA pairs produce a challenging benchmark where models often perform near chance, highlighting critical gaps and enabling rapid evaluation in languages lacking resources.

Frontiers in AI 2025 – Human-AI Collaboration

Fostering Effective Hybrid Human-LLM Reasoning and Decision Making – We examine frameworks combining LLMs and human judgment for complex tasks, offering design principles for AI‑assisted decision systems. Through case studies, we show that integrating LLM‑generated insights with human oversight yields more reliable and interpretable outcomes than either alone, providing guidelines for principled human‑in‑the‑loop systems.

What’s Brewing

Noiser: Bounded Input Perturbations for Attributing Large Language Models, by Reza Madani et al. – Noiser perturbs input embeddings to attribute token importance, introducing an “answerability” check to validate attributions. Outperforming gradients and attention, Noiser offers robust post‑hoc explanations for LLM predictions.
An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering, by Alex, Sanad, and others amazing students at the UoE – We analyse how faithfulness‑enhancing decoding (e.g., DeCoRe) within the ReAct agent framework improves multi‑hop QA, boosting HotpotQA F1 from 19.5 to 32.6, underscoring the role of decoding in reliable LLM reasoning.
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression, led by Nathan Godey – Q-Filters uses query‑key geometric projections to filter past tokens on the fly, compressing KV cache without retraining and matching attention‑based methods like SnapKV, enabling long‑context generation with minimal memory.
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization, by the amazing Rohit Saxena – PosterSum offers 16,000+ re search posters paired with abstracts for evaluating vision‑language summarization. Our “Segment & Summarize” approach secures a 3.1% ROUGE‑L gain, highlighting this benchmark’s challenge.
Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs, also by Rohit Saxena et al. – We introduce ClockQA and CalendarQA for testing multimodal LLMs’ temporal reasoning from images, revealing widespread failures and motivating models with better time‑date understanding.

Postdoc Position in Multimodal Foundation Models

Sat, 01 Mar 2025 01:00:00 +0100

Amazing opportunity to join our team at the School of Informatics, University of Edinburgh! The School of Informatics is seeking a Postdoctoral Research Associate to work on evaluating and improving multimodal foundation models, with a particular focus on Vision-Language Models (VLMs).

About the Position

This is a full-time position running until January 2029, fully funded by the AI Hub in Generative Models. The successful candidate will join the Edinburgh NLP Group, one of the best NLP research groups in the world!

Key Details:

Duration: Fixed-term contract until January 2029
Application Deadline: April 8th, 2025, 12:59 AM (UK time)

For more details or to apply, visit https://edin.ac/3DDQK1o

For informal enquiries, feel free to reach out to me directly at p.minervini@ed.ac.uk

November 2024 in Research

Fri, 01 Nov 2024 01:00:00 +0100

My amazing collaborators will be presenting three papers at EMNLP 2024 (main track), a leading conference in natural language processing, happening in Miami later this month! A few weeks ago I also blogged about our ACL 2024, ICML 2024, and CoLM 2024 papers – you can check the post here.

Our work at EMNLP 2024

We will be presenting three papers this year at EMNLP, a flagship NLP conference:

A Simple and Effective $L_{2}$ Norm-Based Strategy for KV Cache Compression, by Yu Zhao, Alessio Devoto, et al. – we introduce a simple strategy for compressing the Key-Value (KV) cache in large language models by utilizing the $L_{2}$ norm of key embeddings; specifically, we found a correlation between low $L_{2}$ norms and high attention scores, allowing them to identify influential KV pairs before querying. For example, here we can see the attention distributions for five heads at layer 9 in Llama2-7B – we can see that the attention scores (top) and the key $L_{2}$ norms (bottom) are highly correlated. — Our method effectively reduces KV cache size by up to 90% without loss of accuracy and is compatible with FlashAttention. This paper will be presented as an Oral – top 8% of the accepted papers! An extended version of this paper will also be presented at the Efficient Natural Language and Speech Processing workshop at NeurIPS 2024! EMNLP Poster
Atomic Inference for NLI with Generated Facts as Atoms, by Joe Stacey et al. – we propose an atomic inference approach for Natural Language Inference (NLI) that decomposes inputs into individual facts or atoms, and explicitly models the entailment relationships between such atoms. Furthermore, we propose a multi-stage fact generation process and a specialized training regime for incorporates such facts, achieving state-of-the-art results in several hard NLI tasks. Our best system, FGLR, produces significantly more robust and accurate results than large-scale language models while providing clear interpretability guarantees by identifying the specific atoms responsible for each prediction! Joe wrote an amazing blog post on this work, check it out!
Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs, by Xin Zhou, Ping Nie et al. – we analyse Mixture-of-Expert (MoE)-based Large Language Models (LLMs) in the context of Retrieval-Augmented Generation (RAG); we identify the groups of experts that are primarily responsible for RAG-related behaviors, such as identifying whether the parametric knowledge is sufficient to solve a given knowledge-intensive task; assessing the quality of retrieved documents; and improving the utilisation of context. Based on these findings, we propose several strategies to improve the efficiency and effectiveness of RAG systems by adjusting expert activations.

What’s brewing

We have several super-interesting works in the pipeline! Here are some of them:

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering, by Yu Zhao et al. – we introduce SpARE, a training-free method that leverages pre-trained sparse auto-encoders (SAEs) to control the knowledge selection behavior of large language models (LLMs) when faced with conflicts between their internal (parametric) knowledge and external (contextual) information. By identifying and manipulating functional features within the LLMs’ internal activations, SpARE can steer the model to prioritize either parametric or contextual knowledge during inference. We show that SpARE is surprisngly effective at resolving knowledge conflicts in open-domain question-answering tasks, producing significantly better results than existing representation engineering and contrastive decoding methods. The insights in this paper are based on another paper, Analysing the Residual Stream of Language Models Under Knowledge Conflicts also by Yu Zhao et al. that will appear in the Workshop on Foundation Model Interventions @ NeurIPS 2024.
Mixtures of In-Context Learners, by Giwon Hong et al. – we propose Mixtures of In-Context Learners (MoICL), a method that trains a set of experts via in-context learning, and learns a weighting function to merge their outputs, addressing many of the limitations of standard in-context learning (ICL). MoICL yields significantly more accurate results than many strong baselines (up to +13% compared to ICL and LENS); reduces inference time by achieving similar performance with fewer demonstrations; and shows greater robustness to out-of-domain, imbalanced, or noisy demonstrations.
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations, by Aryo Gema et al. – we introduce DeCoRe (Decoding by Contrasting Retrieval Heads), a novel, training-free decoding strategy designed to mitigate hallucinations in large language models (LLMs). DeCoRe works by masking specific retrieval heads — attention heads responsible for extracting relevant contextual information — to induce hallucinations, and then contrast the outputs of the base LLM and the masked LLM, using conditional entropy as a guide. DeCoRe significantly improves performance on tasks requiring high contextual faithfulness, such as summarization, instruction following, and open-book question answering, and surprisingly (to us), it also helps with factual recall!
FLARE: Faithful Logic-Aided Reasoning and Exploration, by Erik Arakelyan et al. – we introduce FLARE (Faithful Logic-Aided Reasoning and Exploration), a framework designed to improve the reasoning abilities of LLMs in knowledge-intensive reasoning tasks. FLARE use an intermediate logic programming-inspired representation of the reasoning process by generating Prolog code and simulating a program execution, ensuring that the reasoning process remains faithful and interpretable without relying on external solvers. FLARE achieves state-of-the-art results on seven out of nine diverse reasoning benchmarks, and we identify a strong correlation between the faithfulness of the reasoning process and the downstream model accuracy.

July 2024 in Research

Mon, 01 Jul 2024 02:00:00 +0200

My amazing collaborators will be presenting several works at ACL 2024, ICML 2024, and CoLM 2024 in the upcoming weeks/months!

Our work at ACL 2024

We will be presenting four papers this year at ACL, the flagship NLP conference:

Analysing The Impact of Sequence Composition on Language Model Pre-Training, by Yu Zhao et al. – we analyse several language model pre-training schemes and find out that, e.g., intra-document causal masking helps both in terms of pre-training dynamics, and accuracy on a wide array of downstream tasks! This approach was later adopted by Llama 3, Meta’s flagship language model family. This paper will be presented as an Oral – top 8% of the accepted papers!
SparseFit: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations, by Jesus Solano, Mardhiyah Sanni et al. – we introduce SparseFit, a sparse fine-tuning method that uses few-shot prompting and discrete prompts to efficiently generate both predictions and natural language explanations (NLEs) with large pre-trained language models, achieving competitive performance while significantly reducing the number of fine-tuning parameters! [poster]
Using Natural Language Explanations to Improve Robustness of In-context Learning, by Xuanli He et al. – we found that integrating NLEs into in-context learning significantly improves the robustness of large language models against adversarial inputs, and show that generating NLEs with frontier models in a few-shot setting can significantly improve accuracy on challenging natural language inference tasks compared to traditional in-context learning and human-generated NLEs.
Probing the Emergence of Cross-lingual Alignment during LLM Training, by Hetong Wang et al. – we analyse how cross-lingual alignment emerges during the training of multilingual large language models by probing neuron activity in different languages. We find that higher neuron overlap between languages correlates strongly with improved zero-shot cross-lingual transfer performance, but also identifies phases during training where both alignment and performance degrade, offering new insights into the dynamics of multilingual model training! [poster]

Our work at ICML 2024

We will be presenting three works this year at ICML – one in the main conference and two in co-located workshops:

On the Independence Assumption in Neurosymbolic Learning, by Emile van Krieken et al. – we analyse the common assumption in neurosymbolic learning that symbols are conditionally independent given the input, and argue that this assumption biases models towards deterministic solutions and limits their ability to express uncertainty.
Attention Is All You Need But You Don’t Need All Of It For Inference of Large Language Models, by Georgy Tyukin et al. – we investigate the effects of removing MLP and attention layers in large language models during inference, finding that removing deeper attention layers can only marginally reduce performance while significantly improving inference speed!
An Auditing Test to Detect Behavioral Shift in Language Models, by Leo Richter et al. – we propose a continuous online auditing framework to detect behavioural shifts in language models, ensuring that deployed models remain aligned with societal values and preventing vendors or attackers from covertly deploying unaligned models for malicious purposes.

Our work at CoLM 2024

The Conference on Language Modeling (CoLM) is a very new thing. I have been area-chairing for CoLM this year, and I’m really impressed by the quality of all submissions! We will be presenting two papers:

Forklift: An Extensible Neural Lifter, by Jordi Armengol-Estapé et al. – we introduce Forklift, a framework that uses neural models to translate assembly code across different instruction set architectures by “lifting” source assembly code into an intermediate representation, thereby reducing the engineering effort required for cross-architecture software migration!
Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models, by Simon Yu, Jie He et al. – we analyse the adversarial robustness of retrieval-based in-context learning, finding that while retrieval-augmented methods improve robustness against test sample attacks, they increase vulnerability to adversarially perturbed demonstrations; to address this, we propose a new training-free defence method, which significantly improves adversarial robustness.

Looking for Postdocs, June 2024 Edition

Sat, 01 Jun 2024 02:00:00 +0200

We have an opening for a 3-year postdoc – more details are available here – on a project funded by Huawei via the Huawei-Edinburgh Joint Lab initiative, with me as the Principal Investigator (PI).

The researcher will work on projects involving the design and application of improving the robustness and trustworthiness of Large Language Models when solving complex reasoning tasks, while improving their explainability and generalisation properties. They will be part of the Edinburgh NLP Group, a world-leading research group in Natural Language Processing.

Looking for Postdocs!

Tue, 01 Nov 2022 01:00:00 +0100

We have an opening for a 2-year postdoc – more details are available here – on a project titled Gradient-based Learning of Complex Latent Structures, with me as the Principal Investigator (PI), and Antonio Vergari (IANC) and Edoardo Ponti (ILCC) as co-PIs. The position is entirely funded by the Edinburgh Laboratory for Integrated Artificial Intelligence (ELIAI) – if you want to know more, feel free to reach out!

You can apply at this link.

Project description

Imposing structural constraints on the latent representations learned by deep neural models has several applications, which can improve their explainability, their robustness, and their ability to generalise to out-of-domain distributions. For example, we can learn more explainable models by making them selectively decide which parts of the input to consider; and we can improve their generalisation properties by learning representations suitable for reasoning tasks, such as deductive reasoning and planning, and comply with any desired constraints. For instance, the intermediate structure can represent a relational graph between objects in the world; the relationships between multiple sub-questions in a complex question; or computation graphs which can be executed to produce a prediction.

In this project, we aim to investigate how we can derive better methods for back-propagating through mixed continuous-discrete complex latent structures, and how we can leverage them for learning more explainable, data-efficient, and robust deep neural models. The reason why discrete latent representations are not widely adopted by deep neural models is that they tend to not interact well with gradient-based optimisation methods, but this started to change recently (e.g., see Niepert et al., 2021; Minervini et al. 2022), enabling a wide range of applications and use cases.

Position

The post holder will work on projects involving the design and application of deep learning models with discrete latent structures for improving their explainability, generalisation, and robustness properties. They will be part of the new Edinburgh Laboratory for Integrated Artificial Intelligence and the Edinburgh NLP Group, a world-leading research group in Natural Language Processing.

The School of Informatics is one of the largest research centres in Computer Science in Europe, and it has been ranked #1 in the UK in terms of research power by a large margin. The Edinburgh NLP Group is consistently ranked among the world’s leading research groups in Natural Language Processing. We are offering an exciting opportunity to work in an interdisciplinary, collaborative, friendly, and supportive environment, integrating different sub-fields of Computer Science and Artificial Intelligence.

PhD Projects

Sat, 01 Oct 2022 02:00:00 +0200

As mentioned here, in September 2022 I joined the Institute for Language, Cognition and Communication (ILCC) at the School of Informatics, University of Edinburgh, one of the world’s best schools in NLP and related areas, as a faculty member in NLP! If you are interested in working with me, I have funding for multiple PhD students: make sure to apply either to the UKRI CDT in Natural Language Processing or to the ILCC 3-year PhD program!

Some more details on the ILCC PhD program – there are two deadlines for applying: the first round is on 25th November 2022, and the second round is on 27th January 2023. I strongly recommend that non-UK applicants submit their applications in the first round, to maximise their chances of funding.

Regarding the NLP CDT program – there are also two deadlines for applying: the first round is on 25th November 2022, and the second round is on 27th January 2023. Likewise, I strongly recommend that non-UK applicants submit their applications in the first round, to maximise their chances of funding.

If you are interested in working with me, you can apply via the ILCC PhD program’s and the NLP CDT program’s application portals. You will be asked to submit a research proposal: this is mostly used for assessing candidate PhD students and for matching them with potential faculty supervisor, and you can decide to work on different problems during your PhD. If you would like some feedback on your research proposal, get in touch!

In the following there’s a (non-exhaustive but fairly up-to-date) list of PhD topics we may decide to work on – this list is also available on the Possible PhD topics in ILCC page. An older list of possible research topics is also available at this link, and feel free to propose new project topics that intest you! I’m always happy to explore new directions!

Open-Domain Complex Question Answering at Scale

Open-Domain Question Answering (ODQA) is a task where a system needs to generate the answer to a given general-domain question, and the evidence is not given as input to the system. A core limitation of modern ODQA models (and, more generally, of all models for solving knowledge-intensive tasks) is that they remain limited to answering simple, factoid questions, where the answer to the question is explicit in a single piece of evidence. In contrast, complex questions involve aggregating information from multiple documents, requiring some form of logical reasoning and sequential, multi-hop processing in order to generate the answer. Projects in this area involve proposing new ODQA models for answering complex questions, for example, by taking inspiration from models for answering complex queries in Knowledge Graphs (Arakaleyan et al., 2021; Minervini et al., 2022a) and Neural Theorem Provers (Minervini et al., 2020a; Minervini et al., 2020b) and proposing methods by which neural ODQA models can learn to search in massively large text corpora, such as the entire Web.

Neuro-Symbolic and Hybrid Discrete-Continuous Natural Language Processing Models

Incorporating discrete components, such as discrete decision steps and symbolic reasoning algorithms, in neural models can significantly improve their interpretability, data efficiency, and predictive properties — for example, see (Niepert et al., 2021; Minervini et al., 2022b; Minervini et al., 2020a; Minervini et al., 2020b). However, approaches in this space rely either on ad-hoc continuous relaxations (e.g., Minervini et al., 2020a, Minervini et al., 2020b) or on gradient estimation techniques that require some assumptions on the distributions of the discrete variables (Niepert et al., 2021; Minervini et al., 2022b). Projects in this area involve devising neuro-symbolic approaches for solving NLP tasks that require some degree of reasoning and compositionality and identifying gradient estimation techniques (for back-propagating through discrete decision steps) that are both data-efficient, hyperparameter-free, accurate, and require fewer assumptions on the distribution of the discrete variables.

Learning from Graph-Structured Data

Graph-structured data is everywhere – e.g. consider Knowledge Graphs, social networks, protein and drug interaction networks, and molecular profiles. In this project, we aim to improve models for learning from graph-structured data and their evaluation protocols. Projects in this area involve incorporating invariances and constraints in graph machine learning models (e.g., see Minervini et al., 2017), proposing methods of transferring knowledge between graph representations, automatically identifying functional inductive biases for learning from graphs from a given domain (such as Knowledge Graphs – for example, see our NeurIPS 2022 paper on incorporating the inductive biases used by factorisation-based models into GNNs) and proposing techniques for explaining the output of black-box graph machine learning methods (such as graph embeddings).

Call for PhD Students

Fri, 01 Oct 2021 02:00:00 +0200

From September 2022, I will join the Institute for Language, Cognition and Communication (ILCC) at the School of Informatics, University of Edinburgh!

And there is more! I have funding for multiple PhD students: if you are interested in working with me, make sure to apply either to the UKRI CDT in Natural Language Processing or to the ILCC 3-year PhD program.

In general, I care about anything that can help Deep Learning models become more data-efficient, statistically robust, and explainable. As Artificial Intelligence and Machine Learning systems become more pervasive in areas like critical infrastructures, education, and healthcare, there is an increasing need of AI-based systems that we can trust. For example, the European Union is working on a new set of regulations that will enforce AI-based systems used in high-risk areas to be able to produce high-quality explanations to their users and high levels of robustness and accuracy, among other things. This will automatically exclude the vast majority of the Deep Learning systems that we love and work with on a daily basis.

My research focuses about filling this gap, and developing Deep Learning systems that can produce faithful explanations, that can learn from fewer examples (e.g. thanks to stronger inductive biases), and that can work even on out-of-distribution samples (such as adversarial inputs).

Probably you may want to know a bit more about my research so far in these directions – here are some pointers. Let me now if any of these clicks with you, and feel free to reach out!

Bridging Neural and Symbolic Computation

One way I am trying to address some of the limitations of modern Deep Learning models is by designing hybrid approaches, that inheret the strength of both neural and symbolic systems.

For example, let’s consider the problem of answering complex symbolic queries on (potentially very large) Knowledge Graphs. In our paper Complex Query Answering with Neural Link Predictors, presented at ICLR 2021, we presented an hybrid approach where the query answering task is reduced to solving an optimisation problem whose structure follows the compositional logic structure of the query. Using orders of magnitude less training data, our approach obtains significant improvements in comparison with the purely-neural state-of-the-art models developed in this space, while also being able to produce faithful explanations to its users. This paper obtained an Outstanding Paper Award at ICLR 2021.

Or, for example, let’s consider the problem of deductive reasoning – i.e. deriving logical conclusions. Previous research shows that even BERT-based models do not generalise properly when required to perform reasoning tasks that differ from these observed during training – e.g. because they require composing multiple reasoning patterns, that were never observed together at training time. We proposed several approaches for solving this problem, by designing neural models whose behaviour mimics the behaviour of logic deductive reasoners. Our approaches enable neural models to perform multi-hop reasoning over multiple documents (ACL 2019), and learn logic rules from graph-structured data (ICML 2020 and AAAI 2020).

More recently, we were wondering whether it could be possible to incorporate black-box algorithmic components, like Dijkstra’s shortest path algorithm or any ILP solver in a neural model. In our paper Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions, presented at NeurIPS 2021, we developed a very general (and extremely simple!) method for back-propagating through a massive variety of algorithmic components, effectively allowing neural models to use them as off-the-shelf components. See our presentation of this paper, as well as Yannic Kilcher’s explanation.

Incorporating Constraints in Neural Models

Some other times, we would like a neural model to comply with a given set constraints. For example, we would like that, when our model predicts that $X$ is a parent of $Y$ *, and *$Y$ is a parent of $Z$, we would also like it to predict that $X$ is a grandparent of $Z$. Constraints are key for developing statistically robust model – for example, think of adversarial perturbations in computer vision. In the case of adversarial perturbations, the model is essentially violating a single constraint, i.e. given an image $X$, if $Y$ is a semantically-invariant perturbation of $X$, the model should produce the same output for both $X$ and $Y$.

In our paper Adversarial Sets for Regularising Neural Link Predictors, presented at UAI 2017, we presented the first method for incorporating arbitrary constraints encoded in the form of First-Order Logic rules in a wide class of neural models. Our idea is very simple and general: during training, at each step, we can define an adversary that finds on which inputs the model maximally violates a given constraint, and then require the model to reduce the degree of such violations. We also show that, for a wide class of models and constraint types, we can have efficient and globally-optimal solutions to the problem of finding where the model maximally violates a constraint. This is pretty amazing, since (1) it makes the training procedure extremely efficient, adding very little overhead, and (2) if the search process does not return any significant violation of a constraint, it means that the model will never violate that constraint, for every possible input it may encounter. This provides a way of producing some kind of safety guarantees for a large set of neural models, which are very desirable in a lot of high-risk settings.

We explored further applications of these ideas in several settings. For example, in Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge (CoNLL 2018), we show that some common-sense reasoning patterns can also be represented as constraints, and incporporating these in neural Natural Language Inference (NLI) models yields improvements both on in-distribution and out-of-distribution data. In Gone At Last: Removing the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training (EMNLP 2020), we show that we can use ensembles of adversaries for de-biasing neural NLI models. In Undersensitivity in Neural Reading Comprehension (Findings of EMNLP 2020), we found that neural Question Answering (QA) models can often ignore semantically meaningful variations in the input questions, and proposed a related training process for correcting such behaviour. In Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations (ACL 2020), we identified that models for producing natural language explanations often violate self-consistency constraints, and can produce mutually inconsistent explanations.

Some notes on Gaussian Fields and Label Propagation

Sun, 01 Jan 2017 01:00:00 +0100

In several occasions, we find ourselves in need of propagating information among nodes in an undirected graph.

For instance, consider graph-based Semi-Supervised Learning (SSL): here, labeled and unlabeled examples are represented by an undirected graph, referred to as the similarity graph.

The task consists in finding a label assignment to all examples, such that:

The final labeling is consistent with training data (e.g. positive training examples are still classified as positive at the end of the learning process), and
Similar examples are assigned similar labels: this is referred to as the semi-supervised smoothness assumption.

Similarly, in networked data such as social networks, we might assume that related entities (such as friends) are associated to similar attributes (such as political and religious views, musical tastes and so on): in social network analysis, this phenomenon is commonly referred to as homophily (love of the same).

In both cases, propagating information from a limited set of nodes in a graph to all nodes provides a method for predicting the attributes of such nodes, when this information is missing.

In the following, we introduce a really clever method for efficiently propagating information about nodes in undirected graphs, known as the Gaussian Fields method.

Propagation as a Cost Minimization Problem

We now cast the propagation problem as a binary classification task. Let $X = \{ x_{1}, x_{2}, \ldots, x_{n} \}$ be a set of $n$ instances, of which only $l$ are labeled: $X^{+}$ are positive examples, while $X^{-}$ are negative examples

Similarity relations between instances can be represented by means of an undirected similarity graph having adjacency matrix $\mathbf{W} \in \mathbb{R}^{n \times n}$: if two instances are connected in the similarity graph, it means that they are considered similar, and should be assigned the same label. Specifically, $\mathbf{W}_{ij} > 0$ iff the instances $x_{i}, x_{j} \in X$ are connected by an edge in the similarity graph, and $\mathbf{W}_{ij} = 0$ otherwise.

Let $y_{i} \in \{ \pm 1 \}$ be the label assigned to the $i$-th instance $x_{i} \in X$. We can encode our assumption that similar instances should be assigned similar labels by defining a quadratic cost function over labeling functions in the form $f : X \mapsto \{ \pm 1 \}$:

\[E(f) = \frac{1}{2} \sum_{x_{i} \in X} \sum_{x_{j} \in X} \mathbf{W}_{ij} \left[ f(x_{i}) - f(x_{j}) \right]^{2}.\]

Given an input labeling function $f$, the cost function $E(\cdot)$ associates, for each pair of instances $x_{i}, x_{j} \in X$, a non-negative cost $\mathbf{W}_{ij} \left[ f(x_{i}) - f(x_{j}) \right]$: this quantity is $0$ when $\mathbf{W}_{ij} = 0$ (i.e. $x_{i}$ and $X_{j}$ are not linked in the similarity graph), or when $f(x_{i}) = f(x_{j})$ (i.e. they are assigned the same label).

For such a reason, the cost function $E(\cdot)$ favors labeling functions that are more likely to assign the same labels to instances that are linked by an edge in the similarity graph.

Now, the problem of finding a labeling function that is both consistent with training labels, and assigns similar labels to similar instances, can be cast as a cost minimization problem. Let’s represent a labeling function $f$ by a vector $\mathbf{f} \in \mathbb{R}^{n}$, $L \subset X$ denote labeled instances, and $\mathbf{y}_{i} \in \{ \pm 1 \}$ denote the label of the $x_{i}$-th instance. The optimization problem can be defined as follows:

\[\begin{aligned} & \underset{\mathbf{f} \in \{ \pm 1 \}^{n}}{\text{minimize}} & & E(\mathbf{f}) \\ & \text{subject to} & & \forall x \in L: \; \mathbf{f}_{i} = \mathbf{y}_{i}. \end{aligned}\]

The constraint $\forall x \in L : \mathbf{f}_{i} = \mathbf{y}_{i}$ enforces the label of each labeled example $x_{i} \in L$ to $\mathbf{f}_{i} = +1$ if the instance has a positive label, and to $\mathbf{f}_{i} = -1$ if the instance has a negative label, so to achieve consistency with training labels.

However, constraining labeling functions $f$ to only take discrete values has two main drawbacks:

Each function $f$ can only provide hard classifications, without yielding any measure of confidence in the provided classification.
The cost term $E(\cdot)$ can be hard to optimize in a multi-label classification setting.

For overcoming such limitations, Zhu et al. propose a continuous relaxation of the previous optimization problem:

\[\begin{aligned} & \underset{\mathbf{f} \in \mathbb{R}^{n}}{\text{minimize}} & & E(\mathbf{f}) \\ & \text{subject to} & & \forall x \in L: \; \mathbf{f}_{i} = \mathbf{y}_{i}, \end{aligned}\]

where the term $\sum_{x_{i} \in X} \mathbf{f}_{i}^{2} = \mathbf{f}^{T} \mathbf{f}$ is a $L_{2}$ regularizer over $\mathbf{f}$, weighted by a parameter $\epsilon > 0$ which ensures that the optimization problem has a unique global solution.

The parameter $\epsilon$ can be interpreted as the decay of the propagation process: as the distance from a labeled instance within the similarity graph increases, the confidence in the classification (as measured by the continuous label) gets closer to zero.

This optimization problem has a unique, global solution that can be calculated in closed-form. Specifically, the optimal (relaxed) discriminant function $f : X \mapsto \mathbb{R}$ is given by $\mathbf{\hat{f}} = \left[ \mathbf{f}_{L}, \mathbf{f}_{U} \right]^{T}$, where $\mathbf{\hat{f}}_{L} = \mathbf{y}_{L}$ (i.e. labels for labeled examples in $L$ coincide with training labels), while $\mathbf{\hat{f}}_{U}$ is given by:

\[\mathbf{\hat{f}}_{U} = (\mathbf{L}_{UU} + \epsilon \mathbf{I})^{-1} \mathbf{W}_{UL} \mathbf{\hat{f}}_{L},\]

where $\mathbf{L} = \mathbf{D} - \mathbf{W}$ is the graph Laplacian of the similarity graph with adjacency matrix $\mathbf{W}$, and $\mathbf{D}$ is a diagonal matrix such that $\mathbf{D}_{ii} = \sum_{j} \mathbf{W}_{ij}$.