Skip to content
View natowi's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Organizations

@alicevision

Block or report natowi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 113 11 Updated Jan 25, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,489 313 Updated Jan 5, 2026

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 474 36 Updated Feb 21, 2026
Python 11,203 758 Updated Feb 9, 2026

Open-Source Frontier Voice AI

Python 46,835 5,200 Updated May 6, 2026

SoTA open-source TTS

Python 24,619 3,277 Updated May 1, 2026

Codes for automatic point-cloud-to-BIM conversion

Python 98 25 Updated Apr 7, 2026

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 19 2 Updated Apr 3, 2026

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 666 128 Updated Sep 11, 2023

ComfyUI wrapper for sam-3d-body

Python 303 29 Updated May 8, 2026

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

TTS model capable of streaming conversational audio in realtime.

Python 1,120 96 Updated Nov 29, 2025

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

307 18 Updated Apr 16, 2026

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 5,781 661 Updated May 8, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 12,107 1,373 Updated May 8, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 21,326 1,762 Updated Apr 30, 2026

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

CSS 115 7 Updated Apr 30, 2026

The Privacy First PDF Toolkit

JavaScript 13,143 1,066 Updated May 8, 2026

Epson Printer Configuration tool and waste ink counter resetter

Python 548 98 Updated Dec 30, 2025

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 912 70 Updated Jun 12, 2025

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

149 3 Updated Sep 11, 2024

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 62 Updated Nov 26, 2025

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 121 5 Updated Jan 19, 2026

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 29 3 Updated Jan 14, 2026

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 14,665 1,533 Updated Apr 20, 2026

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 245 13 Updated Apr 5, 2026

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 1,095 416 Updated Jan 23, 2026

A collection of MCP servers.

86,506 9,975 Updated May 2, 2026

Model Context Protocol (MCP) that allows LLMs to use QGIS Desktop

Python 939 151 Updated Oct 1, 2025
Next