Skip to content
View SamComber's full-sized avatar

Organizations

@doordash @creditornot @deliveroo @GDSL-UL

Block or report SamComber

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agentic RL Training at Scale

Python 1,352 284 Updated May 8, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 57,716 9,900 Updated Nov 12, 2025
Python 1,282 133 Updated Feb 28, 2026

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 936 121 Updated Mar 23, 2024

Python Module Dependency graphs

Python 2,080 134 Updated May 5, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 92,915 10,520 Updated May 6, 2026

Our library for RL environments + evals

Python 4,085 543 Updated May 8, 2026

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 71 7 Updated May 5, 2025

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,866 789 Updated Apr 17, 2026

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 624 179 Updated May 2, 2023

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,650 223 Updated Apr 14, 2026
Python 166 11 Updated Jan 21, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,612 411 Updated Nov 12, 2025

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

An interactive dashboard to display Formula 1 data and statistics

Python 13 1 Updated Aug 3, 2021

A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations

Python 32 13 Updated Dec 21, 2023

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,208 311 Updated May 5, 2026
Python 24 5 Updated Dec 13, 2022

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,097 173 Updated Mar 20, 2025

I will build Transformer from scratch

Python 90 13 Updated Jul 21, 2025

nannyml: post-deployment data science in python

Python 2,141 183 Updated Jul 12, 2025

🌊 Online machine learning in Python

Python 5,812 624 Updated May 8, 2026

Implementation of Bayesian Hyperparameter Optimization of Machine Learning Algorithms

Jupyter Notebook 640 319 Updated Apr 29, 2023

Spell checking pre-commit Git hook.

Shell 90 16 Updated Oct 5, 2019

Uplift modeling and causal inference with machine learning algorithms

Python 5,829 856 Updated Apr 25, 2026

Imputation of missing values in tables.

492 70 Updated Jan 14, 2026

Python API for Deequ

Jupyter Notebook 820 152 Updated May 7, 2026

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,576 1,074 Updated Mar 16, 2026
Next