Skip to content
View wooksu's full-sized avatar

Organizations

@nota-github

Block or report wooksu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 71,089 8,689 Updated May 8, 2026

Official implementation for "PIO-FVLM: Rethinking Training-Free Visual Token Reduction for VLM Acceleration from an Inference-Objective Perspective"

Python 108 8 Updated Apr 22, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 56,793 4,624 Updated May 9, 2026

A simple yet powerful agent framework that delivers with open-source models

Python 4,547 468 Updated Mar 21, 2026

ERGO (Efficient Reasoning & Guided Observation) is a large vision-language model trained with reinforcement learning on efficiency objectives. [ICLR'26]

Python 19 1 Updated Feb 25, 2026

[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Python 89 2 Updated Jan 26, 2026
Python 42 1 Updated Jul 14, 2025

Nano vLLM

Python 13,328 2,059 Updated Apr 26, 2026

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 99 5 Updated Sep 20, 2025

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 187 11 Updated Jan 16, 2026
Python 1,208 77 Updated Nov 20, 2025

Open-source unified multimodal model

Python 5,901 523 Updated May 4, 2026

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,409 62 Updated Apr 19, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,926 373 Updated Apr 6, 2026

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,274 157 Updated Apr 13, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,952 379 Updated Mar 12, 2026

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,238 106 Updated Oct 29, 2025

Witness the aha moment of VLM with less than $3.

Python 4,054 285 Updated May 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 79,476 16,584 Updated May 9, 2026

A paper list of some recent works about Token Compress for Vit and VLM

893 42 Updated Apr 14, 2026

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 316 20 Updated Jul 6, 2024

A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]

Python 60 6 Updated Mar 8, 2024

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 91 13 Updated Sep 13, 2024

The official NetsPresso Python package.

Jupyter Notebook 48 1 Updated Nov 20, 2025

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 74 12 Updated Sep 29, 2025

Repository for 2023 AI City Challenge (Track1: Multi-Camera People Tracking)

Python 38 6 Updated Oct 7, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,413 33,148 Updated May 9, 2026

Polynomial Learning Rate Decay Scheduler for PyTorch

Python 65 13 Updated Dec 25, 2021

Learning Rate Warmup in PyTorch

Python 415 23 Updated Jun 19, 2025

An easy to use PyTorch to TensorRT converter

Python 4,867 702 Updated Aug 17, 2024
Next