Highlights
- Pro
Stars
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Dexbotic: Open-Source Vision-Language-Action Toolbox
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
An Open-Source Library for Robust Object Manipulation via Uncertainty-aware Task-specific Intuitive Physics
[RSS 2026] Causal video-action world model for generalist robot control
A community collection of OpenClaw use cases for making life easier.
RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. 🎉🎉🎉
Memory for 24/7 proactive agents like OpenClaw.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seam…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A lightweight point-based visualization tool used for inspecting Gaussian data, designing camera motion, and exporting setups for external Gaussian renderers.
A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
Official repo for GraspGen: A Diffusion-based Framework for 6-DOF Grasping
[CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
COLMAP - Structure-from-Motion and Multi-View Stereo
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
💪 [TPAMI 2025] Pytorch implementation of 'HAC++: Towards 100X Compression of 3D Gaussian Splatting'
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…