-
Toss Bank
- Seoul, South Korea
- https://huggingface.co/upskyy
- https://upskyy.github.io
- in/sangchunha
Highlights
Stars
Community maintained hardware plugin for vLLM on Apple Silicon
The agent that grows with you
Hundreds of models & providers. One command to find what runs on your hardware.
Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training support.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
⚡ A fast Git hook manager written in Rust, designed as a drop-in alternative to pre-commit, reimagined.
AI agents running research on single-GPU nanochat training automatically
Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.
A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
A framework for efficient model inference with omni-modality models
Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
직접 만든 모바일 청첩장 (w/ AI Chatbot) · Mobile wedding invitation with AI chatbot
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
Breakthrough Method for Agile Ai Driven Development
Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
The absolute trainer to light up AI agents.
ValueCell is a community-driven, multi-agent platform for financial applications.
Training library for Megatron-based models with bidirectional Hugging Face conversion capability