Hugo Academic CV Theme

Matej Jusup

PhD Student

Highlights

Co-developed the first LLM that plays chess at the world champion level using human-comparable planning efficiency at Google DeepMind hosted as Gemini Chess Gem.
Designed a scalable probabilistic decision-making model for safe real-time multi-agent fleet control.
Demonstrated coordination of 10,000+ autonomous vehicles with sub-second planning latency.
PhD with 5 years of industry experience, including a leadership position.
Silver medalist at the Croatian junior (under 20 years) chess championship.
On www.chess.com 99.999th percentile among over 100 million registered users.

About Me

I obtained my PhD at ETH Zurich on safe and scalable multi-agent reinforcement learning and was an Associated Researcher at ETH AI Center, supervised by Prof. Francesco Corman and Prof. Andreas Krause.

During my PhD, I co-developed a Gemini Chess Gem—the first language model to reach Grandmaster-level performance in Chess with human-comparable planning efficiency. It integrates search-based planning techniques to enhance multi-step reasoning in games like Chess, Chess960, Connect Four, and Hex. This work was part of my Student Research on the Gemini Post-Training Team at Google DeepMind, hosted by Eric Malmi and Aliaksei Severyn.

Before starting a PhD, I worked as a Machine Learning Researcher at Morgan Stanley and as a Senior Maching Learning Researcher, leading a team of four, in a tech startup, Cantab Predictive Intelligence.

Download CV

Interests

Artificial Intelligence
Reinforcement Learning
Large Language Models (LLMs)
Planning and Reasoning (with LLMs)
Inference Time Methods
Sequential Decision Making
Multi-Agent Systems
Probabilistic Learning
Safe Learning
Data-Driven Algorithms
AI in Board Games
Mean-Field Control

Education

PhD in Artificial Intelligence

ETH Zurich
MSc in Mathematical Statistics

University of Zagreb
Visiting Student

University of Bielefeld
BSc in Mathematics

University of Zagreb

CroAI – ML Pub Meetup

Zagreb, Croatia

Jun 3, 2025

Invited talk on Superhuman Planning with Large Language Models hosted by CroAI.

Zagreb, Croatia

Jun 3, 2025

ETH Zurich AI Center Associated Researchers Meetup

Zurich, Switzerland

May 22, 2025

Invited talk on Mastering Board Games With Language Models hosted by ETH AI Center.

Zurich, Switzerland

May 22, 2025

ZurichNLP Meetup

Zurich, Switzerland

Feb 20, 2025

Invited talk on Mastering Board Games With Language Models hosted by ZurichAI.

Zurich, Switzerland

Feb 20, 2025

Google DeepMind Booth at NeurIPS 2024

Vancouver, Canada

Dec 11, 2024

A talk on Mastering Chess With Language Models.

Vancouver, Canada

Dec 11, 2024

The 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2024)

Auckland, New Zealand

May 10, 2024

A conference talk on a Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning.

Auckland, New Zealand

May 10, 2024

See all events

Experience

Student Researcher

Google DeepMind –– Gemini Post-Training Team

Zurich, Switzerland April 2024 – September 2024
- Key Contribution: The first LLM that plays chess at the world champion level using human search budget — Gemini Chess Gem.
- Hosts: Eric Malmi and Aliaksei Severyn
- Publication: First co-author of a spotlight paper at ICML 2025 — https://arxiv.org/abs/2412.12119
- Planning with LLMs: Enhanced LLMs with search-based planning techniques to improve multi-step reasoning.
- Asynchronous MCTS: Introduced dynamic virtual counts to balance exploration–exploitation with few simulations.
- Prompt Engineering: Assisted in designing board-game prompts and test-time internal search linearization.
- Technology Stack: Python, Transformer Pre-Training, Supervised Fine-Tuning, Tree-Search Methods
Senior Machine Learning Researcher

Cantab Predictive Intelligence (tech startup)

Zagreb & Cambridge March 2019 – July 2020
- Key Contribution: Lead a team of four researchers on a few projects running in parallel.
- Behavioral Credit Scoring: Gradient-boosting model for default risk, achieving a market-leading Gini of 75%.
- AI-Driven Marketing: Boosted heart drug sales by 10% via data-driven A/B-tested campaign for pharma client.
- Personalized Newsletter: Built a hybrid recommender (content-based + collaborative); 1.5% CTR in PoC.
- Delivery Delay Estimation: Predicted COVID-era mall delays using ARIMA and supervised learning.
- Technology Stack: Python, PyTorch, PySpark, Databricks, Statsmodels, AWS/Azure, Sklearn, Numpy, Pandas, Git
Machine Learning Researcher

Morgan Stanley

Budapest, Hungary October 2017 – February 2019
- Key Contribution: Built scalable models for risk, liquidity, and trade execution in financial systems.
- Systemic Risk Model: Built a parallel hill climber heuristic, solving the problem in 3 minutes, averaging 5% from optimal.
- Cash Traceability System: Developed a real-time uncollateralized debt tracker from daily data feeds.
- E-Trading Limits Calibration: Tuned model to block high-risk trades via statistical analysis of client behavior.
- Listed Derivatives Liquidity: Developed a PoC liquidation model driven by intraday futures data.
- Technology Stack: Python, CPLEX, OR-Tools, Q/kdb+, PyQ, SQL, Pandas
Software Engineer

Morgan Stanley

New York, London & Budapest August 2016 – September 2017
- Annual Grad Program: Participated in a 15-week program for 50 globally selected students.
- Margin Calculator Microservice: Implemented and unit-tested features for NYSE and HGK stock exchanges.
- Technology Stack: Java, C++, Spring Beans, JUnit
Junior Teaching Assistant

University of Zagreb

Zagreb, Croatia October 2013 – March 2014
- Euclidean Spaces: Delivered problem-solving lectures after achieving the top score in a class of 70.

Education

PhD in Artificial Intelligence

ETH Zurich

Zurich, Switzerland September 2020 – October 2025
- Key Contribution: Operating a fleet of tens of thousands of agents in real time while satisfying safety constraints.
- Thesis: Safe and Scalable Ride-Sourcing Vehicle Rebalancing: A Constrained Mean-Field RL Approach (defense slides)
- Supervisors: Prof. Francesco Corman and Prof. Andreas Krause
- Research Area: Reinforcement Learning, Multi-Agent Systems, Sequential Decision Making, Data-Driven Algorithms
MSc in Mathematical Statistics

University of Zagreb

Zagreb, Croatia October 2013 – February 2017
- Thesis: Network Optimization in Railway Transport Planning
- Supervisor: Prof. Marko Vrdoljak
- Distinction: Graduated with honors.
Read Thesis
Visiting Student

University of Bielefeld

Bielefeld, Germany September 2015 – July 2016
- Research Visit: Two semesters funded by Erasmus+ during which I wrote my MSc thesis.
- Host: Prof. Andreas Dress
BSc in Mathematics

University of Zagreb

Zagreb, Croatia October 2010 – July 2013

Skills

Research

RL

LLMs

MCTS

Safe RL

MFRL

MFC

BO

Programming

Python

C++

SQL

Java

C

Bash & CLI

Q/kdb+

Cloud & Packages

Git

PyTorch

AWS & Databricks

Numpy

Sklearn

Pandas

PySpark

Languages

90%

English

100%

Croatian

10%

German

Challenge

Find solutions such that white to move wins.

I am happy to hear your solutions if you can solve it even with the assistance of an engine! I am also open to discussing why many modern engines fail to solve it.

Computer Chess:

I co-developed a Chess Champ Gem for Gemini, which enhances language models with search-based planning techniques to improve multi-step reasoning in board games such as Chess, Chess960, Connect Four, and Hex. It achieves Grandmaster-level performance in Chess with a search move count per decision comparable to human players.

Notable Achievements:

Silver medalist at the individual Croatian junior (under 20 years) championship in 2011.
Played in Croatian, German, Hungarian and Swiss leagues.
A personal best Elo rating of 2585 on www.chess.com ranks me within 3 thousand best players on the platform among over 100 million registered users (99.999th percentile).
The official Elo rating of 2250 places me among the top 3% of globally registered chess players.

PhD Student

Highlights

About Me

CroAI – ML Pub Meetup

ETH Zurich AI Center Associated Researchers Meetup

ZurichNLP Meetup

Google DeepMind Booth at NeurIPS 2024

The 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2024)

Experience

Student Researcher

Google DeepMind –– Gemini Post-Training Team

Senior Machine Learning Researcher

Cantab Predictive Intelligence (tech startup)

Machine Learning Researcher

Morgan Stanley

Software Engineer

Morgan Stanley

Junior Teaching Assistant

University of Zagreb

Education

PhD in Artificial Intelligence

ETH Zurich

MSc in Mathematical Statistics

University of Zagreb

Visiting Student

University of Bielefeld

BSc in Mathematics

University of Zagreb