Skip to content
View ritazh's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@kubernetes @open-policy-agent @virtual-kubelet @kubernetes-sigs @coreweave

Block or report ritazh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repositories contains examples and best practices for AI workloads on Azure

Shell 30 14 Updated May 6, 2026

Main reference implementation for NLWeb, implemented in Python.

Python 6,205 693 Updated May 11, 2026

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,293 1,681 Updated Nov 19, 2025

Model Context Protocol (MCP) server for Kubernetes and OpenShift

Go 1,567 338 Updated May 11, 2026

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

TypeScript 28,171 4,553 Updated May 12, 2026

Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord

Go 2,739 552 Updated May 11, 2026

The fullstack MCP framework to develop MCP Apps for ChatGPT / Claude & MCP Servers for AI Agents.

TypeScript 9,929 1,276 Updated May 12, 2026

⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.

Shell 23 17 Updated May 9, 2026

GenAI inference performance benchmarking tool

Python 184 89 Updated May 11, 2026

Security scanner for AI agents, MCP servers and agent skills.

Python 2,383 217 Updated May 11, 2026

A comprehensive security checklist for MCP-based AI tools. Built by SlowMist to safeguard LLM plugin ecosystems.

827 71 Updated Apr 28, 2025

An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.

Go 1,343 173 Updated May 11, 2026

📦️ A fast, secure MCP server that extends its capabilities through WebAssembly plugins.

Rust 871 66 Updated May 11, 2026

Published in CNCF Landscape: A MCP server for Kubernetes.

Python 880 168 Updated Apr 8, 2026

hyperlight-wasm is a rust library crate that enables Wasm Modules and components to be run inside lightweight Virtual Machine backed Sandbox. It is built on top of Hyperlight.

Rust 706 36 Updated May 11, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,774 1,104 Updated May 12, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 27,664 5,849 Updated May 12, 2026

Kubernetes RBAC authorizing HTTP proxy for a single upstream.

Go 673 266 Updated Apr 27, 2026

Use github actions cache to back the go build

Go 3 Updated Apr 5, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,796 579 Updated May 11, 2026

Health checks for Azure N- and H-series VMs.

Shell 57 37 Updated May 6, 2026

Gateway API Inference Extension

Go 667 289 Updated May 10, 2026

Hyperlight is a lightweight Virtual Machine Manager (VMM) designed to be embedded within applications. It enables safe execution of untrusted code within micro virtual machines with very low latenc…

Rust 4,248 169 Updated May 11, 2026

Agentic AI framework for enterprise workflow automation.

Python 1,556 101 Updated Apr 18, 2025

This Kubernetes fork is intended to provide long term support for Kubernetes releases, but is not an official release of the Kubernetes project. For more information, please see https://github.com/…

Go 18 8 Updated Mar 31, 2026

Basic Streamlit Application for testing, and displaying Multi-GPU LLM timings

Python 10 2 Updated Mar 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 79,704 16,673 Updated May 12, 2026

Containerized Python based Framework for running and visualizing benchmark workloads on any Kubernetes/ OpenShift and runtime kinds pods, kubevirt virtual machines simply and safely

Python 34 24 Updated May 11, 2026

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 720 148 Updated May 9, 2026

📦 Produce secure packages and containers with declarative configurations

Go 301 52 Updated May 11, 2026