JMS 630 (please watch for possible last-minute room changes onsite)
James McCune Smith Learning Hub, University of Glasgow, Glasgow, Scotland
Sunday, 10 August, 2025
11 AM to 5 PM
Invited Talks Workshop

Co-chairs

Vijay Ganesh
Vijay Ganesh
Professor
Georgia Tech
Stefan Szeider
Stefan Szeider
Professor
TU Wien

Invited Talks

Martin Suda
Martin Suda
Senior Researcher
CTU/CIIRC
Nguyen Dang
Nguyen Dang
Lecturer
University of St Andrews
Mikoláš Janota
Mikoláš Janota
Senior Researcher
CTU/CIIRC
Sean Holden
Sean Holden
University Associate Professor
University of Cambridge
Bistra Dilkina
Bistra Dilkina
Associate Professor
University of Southern California

Overview

Learning and reasoning have been the two foundational pillars of AI since its inception, yet it is only in the past decade that interactions between these fields have become increasingly prominent. In particular, machine learning (ML) has had a substantial impact on SAT/SMT and CP solvers, as well as automated theorem provers. Recent advances have demonstrated the power of ML to inform solver heuristics, guide proof search, and optimize algorithm portfolios.

Despite growing interest in this direction, work on ML for solvers and provers is often scattered across multiple research communities — SAT, SMT, CP, theorem proving, formal methods, and machine learning - with few opportunities for focused interaction. This workshop aims to bring together researchers and practitioners working at the intersection of machine learning and formal reasoning systems. It provides a forum for the presentation of recent work, the exchange of ideas, and the fostering of collaboration between these communities.


Schedule

Time Event Speaker
11:00 - 11:45 Machine Learning Guidance for an Automatic Theorem Prover Martin Suda
11:45 - 12:30 Automated Streamliner Selection via Algorithm Configuration and Selection Nguyen Dang
12:30 - 13:30 Lunch Break  
13:30 - 14:15 Machine Learning for Quantifiers Mikoláš Janota
14:15 - 15:00 Applying Machine Learning to Improve SAT Solvers: Some Highlights Sean Holden
15:00 - 15:30 Coffee Break  
15:30 - 16:15 ML-guided search for Mixed Integer Linear Programming Bistra Dilkina
16:15 - 17:00 Panel: Future of Machine Learning for Solvers/Provers Panelists: Nguyen Dang, Sean Holden, Mikoláš Janota, Martin Suda, Stefan Szeider
Moderator: Vijay Ganesh

Talk Details

Machine Learning Guidance for an Automatic Theorem Prover

Speaker: Martin Suda

In automatic theorem provers (ATPs) based on saturation—the predominant paradigm—the most promising applications of machine learning (ML) target the so-called clause selection heuristic, which helps decide which lemma to consider next for inference. The most promising systems in this category invariably learn from past successes: the lemmas that ultimately appeared in discovered proofs. In this talk, I will outline how such systems work, including those based on supervised learning and those inspired by reinforcement learning. I will also share general insights from my experience developing a neurally guided clause selection heuristic in the ATP Vampire.

Speaker Bio:
Martin Suda is a senior researcher at the CIIRC institute of the Czech Technical University in Prague and the head of the Automated Reasoning Group there. His primary research interest is automated theorem proving and how it can be boosted through the techniques of machine learning. He is one of the main developers of the award-winning automatic theorem prover Vampire.

Automated Streamliner Selection via Algorithm Configuration and Selection

Speaker: Nguyen Dang

Many algorithms have parameters that can significantly impact their performance. Automated algorithm configuration is a family of general-purpose techniques aimed at finding the best parameter setting of an algorithm on a given problem. Automated algorithm selection, a closely related area, focuses on selecting the best algorithm among a given set of algorithms for a specific instance of a problem. Owing to their general-purpose nature, these techniques are broadly applicable across a wide range of domains. In this talk, I will present our work on combining algorithm configuration and algorithm selection to automate the generation and selection of streamliner constraints. Streamliners are heuristic constraints added to a constraint model to reduce the search space, resulting in solving speedup. However, since streamliners do not preserve correctness, they must be selected carefully to avoid pruning valid solutions. Our empirical results indicate the effectiveness of our approach where substantial solving speed up is achieved in various cases.

Speaker Bio:
Nguyen Dang is a Lecturer (Assistant Professor) at the University of St Andrews. She received a PhD degree from KU Leuven in 2018 and was awarded a Leverhulme Early Career fellowship in 2020. Her research interests include automated algorithm configuration, automated algorithm selection, as well as their applications in both constraint programming and black-box optimisation. Her work in automated algorithm configuration has received best paper awards at GECCO’2017 and GECCO’2022, and nominations for best paper award at FOGA’2023 and GECCO’2025.

Machine Learning for Quantifiers

Speaker: Mikoláš Janota

Reasoning with quantifiers is inherently difficult. Adding quantifiers to a language typically significantly increases its computational complexity or leads to undecidability. Yet, quantifiers enable us to express properties over large or infinite domains. In this talk we will look at ML techniques that attempt to help automated reasoning with quantifiers. We will focus mainly on Quantified Boolean Formulas and Satisfiability Modulo Theories, where quantifiers are tackled by instantiation-based techniques.

Speaker Bio:
Mikoláš Janota is a researcher at the Czech Institute of Informatics, Robotics and Cybernetics (CIIRC) at the Czech Technical University in Prague, where he leads the Formal Methods group. His work focuses on automated reasoning, formal verification, and logic-based problem solving, including SAT, MaxSAT, QBF, and SMT. He has contributed to tools like Z3 and developed award-winning solvers such as RAReQS and QESTO. With a background spanning Charles University, University College Dublin, and postdoctoral roles in Portugal and the UK, he currently explores the integration of machine learning with logic solvers under an ERC CZ grant.

Applying Machine Learning to Improve SAT Solvers: Some Highlights

Speaker: Sean Holden

Modern state-of-the-art SAT-solvers rely on numerous sophisticated data-structures and algorithms to achieve their performance. Several of the algorithms involved can be considered machine learning algorithms, and in recent years a great deal of effort has been focused on using machine learning directly to improve the performance of SAT solvers. This talk will present a selection of highlights from research in this area.

Speaker Bio:
Sean Holden is University Associate Professor of Computer Science at the University of Cambridge, and Director of Studies in Computing at Trinity College, Cambridge. He has published in various areas of theoretical and applied machine learning, including computational learning theory, Bayesian inference and applications in drug design, sports science, wearable computing and organelle proteomics. He currently works on machine learning for automated theorem proving, and is responsible for the Connect++ connection prover.

ML-guided search for Mixed Integer Linear Programming

Speaker: Bistra Dilkina

Many real-world applications that require combinatorial optimization involve solving repeatedly similar instances of the same problem over different data (objective and constraint coefficients). Hence, they provide the opportunity to learn to search more effectively by leveraging historical instances. We design approaches to effectively augment established state-of-the-art Mixed Integer Linear programming (MILP) solvers with ML-guided components to significantly improve performance. In particular, many important (heuristic) tasks in MILP solving involve choosing subsets of variables, and we demonstrate that Contrastive Loss is particularly well-suited for training in this setting by learning from both positive and negative examples of candidate sets. We will further discuss the potential of learning across multiple domains when historical examples are limited, and the impressive power of multi-task learning across different ML-guided solving techniques for any given problem domain, setting a key stepping stone to a foundation model for ML-guided MILP solving.

Speaker Bio: Dr. Bistra Dilkina is an associate professor of computer science at the University of Southern California (USC), co-director of the USC Center of AI in Society, and leads the USC Site for the NSF AI Institute for Advances in Optimization (AI4OPT). Her research and teaching are centered around the integration of machine learning and discrete optimization, with a strong focus on AI applications in computational sustainability and social impact, such as wildlife conservation and disaster resilience. Her research has contributed significant advances to machine-learning-guided combinatorial solving, including mathematical programming and AI planning, as well as to decision-focused learning, where combinatorial reasoning is integrated into machine-learning pipelines. 


Scope

Topics of interest include, but are not limited to, the use of various machine learning (ML) techniques in:


Organizers

Vijay Ganesh
Vijay Ganesh
Professor
Georgia Tech
Stefan Szeider
Stefan Szeider
Professor
TU Wien
John Zhengyang Lu
John Zhengyang Lu
PhD Student
UWaterloo
Piyush Jha
Piyush Jha
PhD Student
Georgia Tech