SamComber

Sam Comber SamComber

AI @ Deliveroo

19 followers · 12 following

Here, there, everywhere
London
https://scholar.google.com/citations?user=KYmFMxsAAAAJ&hl=en

Achievements

x2 x3

Achievements

x2 x3

Organizations

Stars

PrimeIntellect-ai / prime-rl

Agentic RL Training at Scale

Python 1,352 284 Updated May 8, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 57,716 9,900 Updated Nov 12, 2025

ChenmienTan / RL2

Python 1,282 133 Updated Feb 28, 2026

SiliangZeng / Multi-Turn-RL-Agent

Python 124 10 Updated Jun 11, 2025

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 936 121 Updated Mar 23, 2024

thebjorn / pydeps

Python Module Dependency graphs

Python 2,080 134 Updated May 5, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 92,915 10,520 Updated May 6, 2026

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 4,085 543 Updated May 8, 2026

Danau5tin / calculator_agent_rl

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 71 7 Updated May 5, 2025

huggingface / deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,866 789 Updated Apr 17, 2026

mpatacchiola / dissecting-reinforcement-learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 624 179 Updated May 2, 2023

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,650 223 Updated Apr 14, 2026

bytarnish / AGILE

Python 166 11 Updated Jan 21, 2025

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,612 411 Updated Nov 12, 2025

hamelsmu / llama-inference

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

adenhaus / f1-data-viz

An interactive dashboard to display Formula 1 data and statistics

Python 13 1 Updated Aug 3, 2021

guyfe / Tweetsumm

A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations

Python 32 13 Updated Dec 21, 2023

pex-tool / pex

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,208 311 Updated May 5, 2026

erain / bazel-python-example

Python 24 5 Updated Dec 13, 2022

pbloem / former

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,097 173 Updated Mar 20, 2025

Khaliladib11 / Transformer-from-scratch

I will build Transformer from scratch

Python 90 13 Updated Jul 21, 2025

NannyML / nannyml

nannyml: post-deployment data science in python

Python 2,141 183 Updated Jul 12, 2025

online-ml / river

🌊 Online machine learning in Python

Python 5,812 624 Updated May 8, 2026

WillKoehrsen / hyperparameter-optimization

Implementation of Bayesian Hyperparameter Optimization of Machine Learning Algorithms

Jupyter Notebook 640 319 Updated Apr 29, 2023

mprpic / git-spell-check

Spell checking pre-commit Git hook.

Shell 90 16 Updated Oct 5, 2019

uber / causalml

Uplift modeling and causal inference with machine learning algorithms

Python 5,829 856 Updated Apr 25, 2026

aleksandramiesiac / UpliftModelling_Iml_team4

Jupyter Notebook 3 2 Updated Jun 5, 2020

awslabs / datawig

Imputation of missing values in tables.

492 70 Updated Jan 14, 2026

awslabs / python-deequ

Python API for Deequ

Jupyter Notebook 820 152 Updated May 7, 2026

hyperopt / hyperopt

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,576 1,074 Updated Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sam Comber SamComber

Achievements

Achievements

Organizations

Block or report SamComber

Stars

PrimeIntellect-ai / prime-rl

karpathy / nanoGPT

ChenmienTan / RL2

SiliangZeng / Multi-Turn-RL-Agent

vwxyzjn / ppo-implementation-details

thebjorn / pydeps

browser-use / browser-use

PrimeIntellect-ai / verifiers

Danau5tin / calculator_agent_rl

huggingface / deep-rl-class

mpatacchiola / dissecting-reinforcement-learning

mll-lab-nu / RAGEN

bytarnish / AGILE

ufal / whisper_streaming

hamelsmu / llama-inference

adenhaus / f1-data-viz

guyfe / Tweetsumm

pex-tool / pex

erain / bazel-python-example

pbloem / former

Khaliladib11 / Transformer-from-scratch

NannyML / nannyml

online-ml / river

WillKoehrsen / hyperparameter-optimization

mprpic / git-spell-check

uber / causalml

aleksandramiesiac / UpliftModelling_Iml_team4

awslabs / datawig

awslabs / python-deequ

hyperopt / hyperopt