Skip to content
View dawncc's full-sized avatar

Block or report dawncc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM KV cache compression made easy

Python 1,067 138 Updated May 5, 2026

🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 2,955 255 Updated May 8, 2026

Self-referential self-improving agents that can optimize for any computable task

Python 2,450 312 Updated Apr 26, 2026

"OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving" -- Community: https://open-space.cloud/

Python 6,072 750 Updated Apr 16, 2026

Make your OpenClaw agent self-improving

TypeScript 10 Updated Mar 31, 2026

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

314 13 Updated Dec 5, 2025

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

2,121 151 Updated Oct 11, 2025

🚀 The fast, Pythonic way to build MCP servers and clients.

Python 25,063 1,993 Updated May 7, 2026

Open-source unified multimodal model

Python 5,894 523 Updated May 4, 2026

我在 Obsidian 中用的各种模板(Dataview,Templater,QuickAdd)

JavaScript 548 49 Updated Jul 14, 2024

DFlash: Block Diffusion for Flash Speculative Decoding

Python 3,616 256 Updated May 6, 2026

[ICLR'26] Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling

Python 97 1 Updated Jan 29, 2026

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 391 49 Updated Apr 22, 2025

WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.

Python 644 45 Updated Mar 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,139 622 Updated Mar 13, 2026

Persist and reuse KV Cache to speedup your LLM.

Python 276 73 Updated May 8, 2026

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

Python 294 16 Updated May 6, 2026

The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Chen Chen, Lei Chen, Xianzhi Yu, Wulong Liu, Jianye HAO, Mingx…

Python 28 5 Updated Jul 15, 2025

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 4,104 661 Updated May 8, 2026

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

Python 249 17 Updated Feb 3, 2026

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 959 119 Updated May 6, 2026
C++ 363 40 Updated Jan 28, 2026

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 517 80 Updated Aug 1, 2024

An autonomous agent that conducts deep research on any data using any LLM providers

Python 26,935 3,614 Updated Apr 16, 2026

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 39,137 3,751 Updated Jul 9, 2025

Naive Bayes-based Context Extension

Python 328 22 Updated Dec 9, 2024

The agent engineering platform. Available in TypeScript!

Python 136,112 22,496 Updated May 8, 2026

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

Python 548 75 Updated Jan 31, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,529 3,439 Updated May 7, 2026

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,938 2,192 Updated Jul 29, 2024
Next