Ryan Pégoud

Ryan Pégoud

MSc Student in Computational Statistics and Machine Learning

University College London

Biography

I’m currently studying in the Computational Statistics and Machine Learning MSc program at UCL. I’m looking forward to pursue a PhD in Reinforcement Learning (RL) and am particularly interested in using methods from Open-Endedness to solve alignment from an RL perspective. For more details, feel free to take a look at my research proposal.

Previously, I was a lecturer at EPF Engineering School in charge of the Natural Language Processing module, while working on independent research in Reinforcement Learning.

I completed an MEng in Data Engineering at EPF Engineering School during which I worked on anomaly detection in time series at BMW and multilingual text classification at CEWE.

Interests
  • Reinforcement Learning
  • Open-endedness
  • Curriculum Learning
  • Alignment / AI Safety
Education
  • MSc in Computational Statistics and Machine Learning, 2024 - 2025

    University College London

  • MEng in Data Engineering, 2018 - 2023

    EPF Engineering School

Recent Publications

(2024). Syllabus: Portable Curricula for Reinforcement Learning Agents. ArXiv preprint.

PDF Code ArXiv

(2024). Better gradient steps for deep on-policy reinforcement learning. ICML, Aligning Reinforcement Learning Experimentalists and Theorists workshop.

PDF Cite

Experience

 
 
 
 
 
EPF Engineering School
Part-time Lecturer in Natural Language Processing
EPF Engineering School
September 2023 – Present Montpellier, France

Course: Natural Language Processing (NLP)

  • Responsibilites: Module Leader, Module Lecturer, Exam-Setting for a class of 25 MSc students

Curriculum (15h):

  • Introduction to NLP and preprocessing pipelines
  • Vectorization methods and machine Learning for NLP
  • Word Embeddings and Text Similarity
  • Sequence-to-Sequence Deep Learning models
 
 
 
 
 
BMW Group
Data Scientist Intern
BMW Group
February 2023 – August 2023 Munich, Germany

Master thesis title: “Time Series Based Anomaly Detection For Fleet Connectivity”

Grade: 20/20 (100%)

  • Development and evaluation of statistical and ML-based methods to detect vehicle connectivity anomalies in the networked ConnectedDrive fleet.
  • Development of ETL pipelines in the context of Big Data to provide data for analysis.
  • Design and development of dashboards for visualization and stakeholder-adequate reports.
  • Presentation of the evaluated methods and results in project and management rounds.
 
 
 
 
 
CEWE Stiftung & Co. KGaA
Data Scientist (Remote Student Job)
CEWE Stiftung & Co. KGaA
June 2022 – July 2022 Fabrègues, France

Building an Aspect-Based Sentiment Analysis pipeline

  • Designed and implemented an NL¸ pipeline combining topic extraction and sentiment analysis of customer reviews
  • Applied the pipeline to analyze and gain insights from customer feedback in French and German.
 
 
 
 
 
CEWE Stiftung & Co. KGaA
Data Scientist Intern
CEWE Stiftung & Co. KGaA
July 2021 – January 2022 Oldenburg, Germany

Bachelor Thesis Title: “Multilingual Text Classification using Transformers”

Grade: 17.67/20 (88.4%)

  • Fine-tuned an XLM-RoBERTa (Goyal, 2020) model using Transfer Learning techniques on 110.000 documents
  • Implemented Temperature Scaling (Guo, 2017) to produce calibrated probability outputs, enabling uncertainty estimation
  • Achieved an F1-score of 93% across 14 classes in 3 languages (German, English and French)
  • Deployed an active learning web application to facilitate collaborative data labeling and efficient model training

Projects

*
Stoix
Stoix provides simplified code for quickly iterating on ideas in single-agent reinforcement learning with useful implementations of popular single-agent RL algorithms in JAX allowing for easy parallelisation across devices with JAX’s pmap.
Stoix
Syllabus
Syllabus is an API for designing curricula for reinforcement learning agents, as well as a framework for synchronizing those curricula across environments running in multiple processes.
Syllabus
Neat-JAX
JAX implementation of NeuroEvolution of Augmenting Topologies (Neat). (in progress)
Neat-JAX

Contact

  • ryan [dot] pegoud [dot] 24 [at] ucl [dot] ac [dot] uk