Skip to content
View kadu-v's full-sized avatar

Highlights

  • Pro

Organizations

@prg-titech @nishi-7

Block or report kadu-v

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 925 48 Updated May 9, 2026

slotd is a lightweight, Rust-based job scheduler inspired by Slurm, designed for single-node, single-user workloads.

Rust 38 Updated Apr 14, 2026

Low-latency AI engine for mobile devices & wearables

C 4,716 371 Updated May 7, 2026

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 5,100 602 Updated May 9, 2026

A minimal PyTorch re-implementation of Qwen 3.5

Python 418 34 Updated Mar 5, 2026

A GPU-oriented coverage-guided fuzzer for userland CUDA applications

Python 24 Updated Mar 11, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 176,160 27,255 Updated May 9, 2026

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

3,320 212 Updated Apr 22, 2026

dLLM: Simple Diffusion Language Modeling

Python 2,476 256 Updated Apr 15, 2026

Educational WIP

Python 70 8 Updated Feb 16, 2026
Python 25 7 Updated Feb 18, 2026

ANADAPTIVEEDGE-GUIDEDDUAL-NETWORKFRAMEWORKFORFASTQRCODE MOTIONDEBLURRING

Python 4 Updated Oct 15, 2025

A character-level language diffusion model trained on Tiny Shakespeare

Python 904 87 Updated Jan 16, 2026

🐹 Deep clean and optimize your Mac.

Shell 50,542 1,575 Updated May 7, 2026

WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.

Python 644 45 Updated Mar 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,147 627 Updated Mar 13, 2026

MobileFineTuner: Native C++ framework for fine-tuning LLMs directly on mobile devices. Features: LoRA/Full-FT, ZeRO-inspired parameter sharding, energy-aware throttling, custom autograd engine. Kee…

C++ 11 5 Updated May 2, 2026

a repo to understand llama.cpp

C++ 9 1 Updated Nov 8, 2025

An experimental, cross-platform CPU tensor library.

Rust 7 Updated Mar 1, 2026
2 Updated Oct 14, 2024

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,746 182 Updated Mar 24, 2026

Visualize machine learning models with Netron in VSCode

JavaScript 19 2 Updated Apr 22, 2026

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Rust 3,075 107 Updated May 7, 2026

Predictions of the four corners of documents.

Python 83 9 Updated Jan 13, 2026

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 2,004 143 Updated Mar 17, 2026

A tool for deep image processing's dataset augmentation

Python 17 1 Updated May 8, 2024

We all edit.

Rust 14,156 674 Updated May 8, 2026

The official PyTorch implementation of SEMv3.

Python 52 6 Updated May 26, 2024
Python 70 6 Updated Jun 26, 2024

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models such as YOLO, FastVLM, and more.

Rust 401 44 Updated May 1, 2026
Next