-
Rimo, LLC.
- Tokyo
- http://www.linkedin.com/in/awakia
Stars
All parts of Claude Code's system prompt, 24 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash cmd, secur…
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Fast, accurate & comprehensive text measurement & layout
real time face swap and one-click video deepfake with only a single image
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
On-device Speech AI for Apple Silicon
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
A natural language interface for computers
Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
A guidance language for controlling large language models.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Software that can perform photorealistic style transfer without the need of any post-processing steps.
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Automated auditing, performance metrics, and best practices for the web.
Python packaging and dependency management made easy
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.