About me

I am a Staff Research Scientist at DeepMind. I currently work on training LLMs.

Before joining DeepMind as a Research Engineer in 2016, I was a Software Engineer at Google Zurich working on NLP using neural networks.

Education

I obtained my PhD at UCL, supervised by Prof. Marc Deisenroth. I passed my viva without corrections in May 2023. My examination committee was formed by Dr. Ferenc Huszár and Dr. Patrick Rebeschini. You can find my PhD thesis here.

I finished my 4 year Computing MEng degree from Imperial College London in 2014, with first honours and a prize of excellence for an outstanding overall performance.

Professional Interests

Generative models, reinforcement learning, natural language processing, scalable and safe machine learning.

I am currently focused on understanding optimisation in deep learning.

Papers

Talks

Open source projects

  • Optax: JAX optimization library. Github
  • Monte Carlo Gradient estimation in machine learning source code. Github
  • Deep Compressed Sensing source code. Github
  • High performance GPU implementation of deep belief networks to assess their performance on facial emotion recognition from images. Github
  • Hopfield networks and RBM implementation in Haskell. Github

Other ML community contributions