Skip to content
View oceank's full-sized avatar

Highlights

  • Pro

Organizations

@softsys4ai

Block or report oceank

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0 and 75.3 on the BrowseComp and BrowseComp Zh, respectively.

Python 8,199 622 Updated Apr 25, 2026

[CVPR'26] LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving

Python 167 15 Updated Apr 29, 2026

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,278 72 Updated Mar 9, 2025

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

297 14 Updated Sep 8, 2025

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

2,121 151 Updated Oct 11, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 79,375 16,548 Updated May 8, 2026

[REALM25 @ ACL25] - "StateAct" Official Paper Repo (SOTA LLM Agent)

Python 18 Updated Feb 27, 2026

A benchmark for evaluating learning agents based on just language feedback

Python 98 18 Updated Mar 26, 2026
Python 25 4 Updated Jun 11, 2025

NVIDIA Linux open GPU kernel module source

C 16,976 1,678 Updated May 1, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,769 264 Updated Nov 12, 2025

An open-source library for GPU-accelerated robot learning and sim-to-real transfer.

Python 1,924 313 Updated May 8, 2026

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Python 46 6 Updated Jul 27, 2023
Python 18 2 Updated May 25, 2023

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,618 433 Updated Dec 7, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

12,512 2,012 Updated Aug 31, 2023

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 61,330 12,328 Updated Apr 30, 2026

List of Tech Company OAs. Save your time from finding them all over the internet.

2,838 200 Updated May 7, 2026

Hopfield Networks is All You Need

Python 1,928 227 Updated Apr 23, 2023

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,812 343 Updated Jan 20, 2026
Python 121 20 Updated Jan 9, 2024

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Python 134 15 Updated Jul 6, 2023

The Research Tree - A playground for research at the intersection of Continual, Reinforcement, and Self-Supervised Learning.

Python 200 16 Updated May 30, 2023

Author's PyTorch implementation of BCQ for continuous and discrete actions

Python 663 145 Updated Apr 6, 2021

A library for generative social simulation

Python 1,413 317 Updated May 6, 2026

Implementation of TWOSOME

Python 82 10 Updated Jan 11, 2025

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)

Python 121 8 Updated Jul 31, 2024

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Python 4,409 387 Updated Mar 13, 2026
Next