Skip to content
View ritazh's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@kubernetes @open-policy-agent @virtual-kubelet @kubernetes-sigs @coreweave

Block or report ritazh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,640 391 Updated May 9, 2026

Claude Code skill that removes signs of AI-generated writing from text

17,953 1,714 Updated Apr 1, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 51,739 6,817 Updated May 9, 2026

llm-d benchmark scripts and tooling

Python 59 71 Updated May 9, 2026

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 284 39 Updated May 9, 2026

A cloud-agnostic Kubernetes node autoscaler that dynamically scales infrastructure across Azure and emerging neoclouds like Nebius—managed from a single control plane.

Go 8 1 Updated Apr 24, 2026

Rally your AI squad to GitHub issues and PRs via git worktrees

JavaScript 33 2 Updated Apr 23, 2026
Shell 2 1 Updated Feb 14, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 94,110 8,176 Updated May 8, 2026

Inspektor Gadget is a set of tools and framework for data collection and system inspection on Kubernetes clusters and Linux hosts using eBPF

C 2,808 341 Updated May 7, 2026

✈️ Kubernetes-native platform for deploying and managing AI inference across multiple providers

TypeScript 78 25 Updated May 9, 2026

Discover ingress-nginx usage and auto-generate Gateway API migration plans before ingress-nginx reaches end-of-life (March 2026).

Go 16 Updated Nov 26, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

Python 9,953 1,060 Updated May 9, 2026

The best ChatGPT that $100 can buy.

Python 53,157 7,122 Updated May 5, 2026

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,402 252 Updated Feb 8, 2026

A sample pack of GitHub Agentic Workflows!

Makefile 681 100 Updated May 9, 2026
2 Updated Sep 26, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,154 464 Updated May 9, 2026

Wassette: A security-oriented runtime that runs WebAssembly Components via MCP

Rust 883 61 Updated Apr 23, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,086 2,073 Updated Mar 27, 2026

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 2,653 444 Updated May 8, 2026

LLM inference in C/C++

C++ 109,198 17,995 Updated May 9, 2026

Home of the out-of-tree KAITO plugin for Headlamp Kubernetes UI

TypeScript 7 2 Updated Aug 8, 2025

The Security Toolkit for LLM Interactions

Python 2,934 388 Updated Dec 15, 2025

Set of tools to assess and improve LLM security.

Python 4,172 731 Updated May 9, 2026

A comprehensive social media management tool designed to help you create, format, and post content across multiple platforms including LinkedIn, Twitter/X, Bluesky, and Mastodon. Features advanced …

TypeScript 91 14 Updated Jan 15, 2026

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Go 443 77 Updated May 9, 2026

CRIU based GPU workload migration in Kubernetes

Go 23 6 Updated Apr 22, 2025

OPA Gatekeeper provider for GitHub Artifact Attestations

Go 22 8 Updated May 5, 2026
Shell 5 Updated May 9, 2026
Next