oceank

Jianhai Su oceank

11 followers · 23 following

University of South Carolina
Columbia, SC
https://oceank.github.io/

Achievements

Highlights

Organizations

Stars

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0 and 75.3 on the BrowseComp and BrowseComp Zh, respectively.

Python 8,199 622 Updated Apr 25, 2026

kesai-labs / lead

[CVPR'26] LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving

Python 167 15 Updated Apr 29, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,752 78 Updated Jan 20, 2026

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,278 72 Updated Mar 9, 2025

dunnolab / awesome-in-context-rl

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

297 14 Updated Sep 8, 2025

EvoAgentX / Awesome-Self-Evolving-Agents

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

2,121 151 Updated Oct 11, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 79,375 16,548 Updated May 8, 2026

ai-nikolai / StateAct

[REALM25 @ ACL25] - "StateAct" Official Paper Repo (SOTA LLM Agent)

Python 18 Updated Feb 27, 2026

microsoft / LLF-Bench

A benchmark for evaluating learning agents based on just language feedback

Python 98 18 Updated Mar 26, 2026

yanxue7 / RL-LLM-Prior

Python 25 4 Updated Jun 11, 2025

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,976 1,678 Updated May 1, 2026

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,769 264 Updated Nov 12, 2025

google-deepmind / mujoco_playground

An open-source library for GPU-accelerated robot learning and sim-to-real transfer.

Python 1,924 313 Updated May 8, 2026

ryanxhr / IVR

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Python 46 6 Updated Jul 27, 2023

Facebear-ljx / PROTO

Python 18 2 Updated May 25, 2023

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,618 433 Updated Dec 7, 2025

Open-Deep-ML / DML-OpenProblem

Python 641 197 Updated Nov 10, 2025

khangich / machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

12,512 2,012 Updated Aug 31, 2023

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 61,330 12,328 Updated Apr 30, 2026

perixtar / 2026-Tech-OA-by-FastPrep

List of Tech Company OAs. Save your time from finding them all over the internet.

2,838 200 Updated May 7, 2026

ml-jku / hopfield-layers

Hopfield Networks is All You Need

Python 1,928 227 Updated Apr 23, 2023

Farama-Foundation / Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,812 343 Updated Jan 20, 2026

awarelab / continual_world

Python 121 20 Updated Jan 9, 2024

AGI-Labs / continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Python 134 15 Updated Jul 6, 2023

lebrice / Sequoia

The Research Tree - A playground for research at the intersection of Continual, Reinforcement, and Self-Supervised Learning.

Python 200 16 Updated May 30, 2023

sfujim / BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Python 663 145 Updated Apr 6, 2021

google-deepmind / concordia

A library for generative social simulation

Python 1,413 317 Updated May 6, 2026

WeihaoTan / TWOSOME

Implementation of TWOSOME

Python 82 10 Updated Jan 11, 2025

nakamotoo / Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)

Python 121 8 Updated Jul 31, 2024

truefoundry / cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 4,409 387 Updated Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jianhai Su oceank

Achievements

Achievements

Highlights

Organizations

Block or report oceank

Stars

MiroMindAI / MiroThinker

kesai-labs / lead

xhyumiracle / Awesome-AgenticLLM-RL-Papers

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

dunnolab / awesome-in-context-rl

EvoAgentX / Awesome-Self-Evolving-Agents

vllm-project / vllm

ai-nikolai / StateAct

microsoft / LLF-Bench

yanxue7 / RL-LLM-Prior

NVIDIA / open-gpu-kernel-modules

ML-GSAI / LLaDA

google-deepmind / mujoco_playground

ryanxhr / IVR

Facebear-ljx / PROTO

opendilab / DI-engine

Open-Deep-ML / DML-OpenProblem

khangich / machine-learning-interview

youngyangyang04 / leetcode-master

perixtar / 2026-Tech-OA-by-FastPrep

ml-jku / hopfield-layers

Farama-Foundation / Metaworld

awarelab / continual_world

AGI-Labs / continual_rl

lebrice / Sequoia

sfujim / BCQ

google-deepmind / concordia

WeihaoTan / TWOSOME

nakamotoo / Cal-QL

truefoundry / cognita