Skip to content
View xdotli's full-sized avatar

Sponsoring

@dohooo
@AmyTao

Highlights

  • Pro

Organizations

@benchflow-ai

Block or report xdotli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PaperBanana: Automating Academic Illustration For AI Scientists

Python 6,240 458 Updated Apr 23, 2026

Reverse engineer clean behavioral specs from any codebase

105 4 Updated Apr 24, 2026

Iterative development methodology plugin for Claude Code — extracts requirements, defines walking skeleton, loops through audited sprints autonomously

Python 65 5 Updated Apr 24, 2026

Rust CLI that converts EPUBs to YAML-headed markdown with byte/line offsets per chapter, for token-efficient agent reading

Rust 49 2 Updated May 2, 2026

An experimental agentic tool

TypeScript 39 7 Updated Apr 30, 2026

Self-hosted bridge that lets you interact with AI coding agents (Claude Code, Codex, etc.) from messaging platforms (Telegram, Discord, etc.) via the Agent Client Protocol (ACP).

TypeScript 345 39 Updated May 6, 2026

Agent harness to make your slop code well-engineered and beautiful.

Python 2,822 195 Updated Apr 6, 2026

Safest and fastest Python library for secp256k1 elliptic curve operations

Python 179 60 Updated Dec 1, 2025

Cloud-synced dashboards for OpenCode and Claude Code. Track sessions, search with semantic lookup, export eval datasets.

TypeScript 356 40 Updated Feb 23, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 51,549 6,785 Updated May 8, 2026

TeachOpenCADD: a teaching platform for computer-aided drug design (CADD) using open source packages and data

Jupyter Notebook 997 227 Updated May 8, 2026

Repository for the NodeMedic-FINE tool (NDSS'25).

TypeScript 11 6 Updated Nov 24, 2025

RF Genesis: Zero-Shot Generalization of mmWave Sensing through Simulation-Based Data Synthesis and Generative Diffusion Models (SenSys'23)

Python 99 9 Updated Aug 3, 2025

Open-source Environment toolkit of claw-like agents, support task/harness generation and evaluation

Python 40 5 Updated May 7, 2026

stripe-mock is a mock HTTP server that responds like the real Stripe API. It can be used instead of Stripe's testmode to make test suites integrating with Stripe faster and less brittle.

Go 1,613 129 Updated May 7, 2026

Skills for the Gemini API, SDK and model/agent interactions

3,434 321 Updated May 8, 2026

Tool for making it easy to work with lots of AI agents

TypeScript 47 12 Updated May 8, 2026

Repository of Stargazer: A Scalable Model-Fitting Benchmark Environment for AI Agents under Astrophysical Constraints

Python 7 Updated Apr 25, 2026
Python 60 10 Updated Nov 3, 2025

Foundation for an open strong-agent platform: controllers, operators, skills, A2A, runtime, and graph execution.

Python 4 Updated Apr 23, 2026

Unite the knowledge of the world's top experts across every domain — to accelerate AI-driven scientific discovery.

JavaScript 34 5 Updated May 4, 2026

Official repository for the General Robust Image Task (GRIT) Benchmark

Jupyter Notebook 55 7 Updated Mar 29, 2023

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 529 46 Updated May 8, 2026
Jupyter Notebook 279 31 Updated Jan 5, 2026

Open-source implementation of AlphaEvolve

Python 6,193 991 Updated Mar 18, 2026

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 194 32 Updated May 1, 2026

kubernetes, reimplemented in Rust

Rust 401 29 Updated May 6, 2026

Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.

Shell 23,659 2,390 Updated May 1, 2026

Gas Town - multi-agent workspace manager

Go 15,031 1,375 Updated May 8, 2026

Ready-to-use harnesses for OpenReward environments

Python 8 2 Updated Apr 21, 2026
Next