natowi

I may be slow to respond.

natowi

I may be slow to respond.

159 followers · 11 following

03:15 (UTC +02:00)

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Organizations

Lists (7)

Sort

Starred repositories

okdalto / ComfyUI-PersonaLive

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 113 11 Updated Jan 25, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,489 313 Updated Jan 5, 2026

Francis-Rings / FlashPortrait

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 474 36 Updated Feb 21, 2026

Tongyi-MAI / Z-Image

Python 11,203 758 Updated Feb 9, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 46,835 5,200 Updated May 6, 2026

resemble-ai / chatterbox

SoTA open-source TTS

Python 24,619 3,277 Updated May 1, 2026

VaclavNezerka / Cloud2BIM

Codes for automatic point-cloud-to-BIM conversion

Python 98 25 Updated Apr 7, 2026

sisakat / e57inspector

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 19 2 Updated Apr 3, 2026

Dijji / XstReader

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 666 128 Updated Sep 11, 2023

PozzettiAndrea / ComfyUI-SAM3DBody

ComfyUI wrapper for sam-3d-body

Python 303 29 Updated May 8, 2026

yulunwu0108 / NeuSurf

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 1,120 96 Updated Nov 29, 2025

FORARTfe / HyMPS

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

307 18 Updated Apr 16, 2026

ThinkInAIXYZ / deepchat

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 5,781 661 Updated May 8, 2026

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 12,107 1,373 Updated May 8, 2026

cjpais / Handy

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 21,326 1,762 Updated Apr 30, 2026

ashesbloom / LocalLens

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

CSS 115 7 Updated Apr 30, 2026

alam00000 / bentopdf

The Privacy First PDF Toolkit

JavaScript 13,143 1,066 Updated May 8, 2026

Ircama / epson_print_conf

Epson Printer Configuration tool and waste ink counter resetter

Python 548 98 Updated Dec 30, 2025

JanuszBedkowski / mandeye_controller

C++ 100 12 Updated May 9, 2026

VAST-AI-Research / MIDI-3D

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 912 70 Updated Jun 12, 2025

Open3DVLab / GigaGS

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

149 3 Updated Sep 11, 2024

theialab / 3dgs-flats

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 62 Updated Nov 26, 2025

QianyiWu / GSRec

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 121 5 Updated Jan 19, 2026

doubletwisted / ComfyUI-Deadline-Plugin

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 29 3 Updated Jan 14, 2026

Billionmail / BillionMail

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 14,665 1,533 Updated Apr 20, 2026

zhangganlin / vista-slam

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 245 13 Updated Apr 5, 2026

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

natowi

Organizations

Lists (7)

CameraCalibration

🔮 Future ideas

Meshroom

ML

NERF

RTI

sfm

Starred repositories

ocr-android

keypoint-detection

speech-enhancement

speech-synthesis

voice-synthesis

feature-detection

camera-model

noise-cancellation

structured-light

texture-mapping