Skip to content
View Gigi-G's full-sized avatar
:octocat:
Working
:octocat:
Working

Highlights

  • Pro

Organizations

@UNICT-DMI @fpv-iplab @triglie @I-Golem

Block or report Gigi-G

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,768 322 Updated May 8, 2026

Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**

Python 244 23 Updated Apr 11, 2026

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 453 31 Updated Feb 17, 2026

Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)

Python 31 1 Updated Jan 18, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 21,762 2,255 Updated Apr 4, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

1,617 52 Updated May 8, 2026

[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning

Python 83 5 Updated Dec 6, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,950 356 Updated Jan 4, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,257 2,621 Updated Mar 3, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 9,443 1,422 Updated May 6, 2026

Using advances in generative modeling to learn reward functions from unlabeled videos.

Jupyter Notebook 142 15 Updated Feb 12, 2024

A Large-scale Video Action Dataset

Python 461 13 Updated Jan 16, 2026
Python 28 1 Updated Jul 18, 2025

Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024

Python 58 Updated Aug 19, 2025
Python 41 3 Updated Jun 14, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 103,441 13,541 Updated May 8, 2026

An extension of the PyTorch library containing various tools for performing deep learning in hyperbolic space.

Python 179 13 Updated Jan 7, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,318 364 Updated Dec 4, 2025

UCI chess engine

C++ 36 6 Updated Nov 18, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,055 1,450 Updated Mar 3, 2026

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs [ECCV, 2024]

Python 8 1 Updated Jul 19, 2024

Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Python 11 1 Updated Apr 26, 2026

Code for the Molmo Vision-Language Model

Python 904 95 Updated Dec 12, 2024

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 8,580 527 Updated May 7, 2026

Differentiable Dynamic Programming

Python 72 19 Updated Sep 15, 2020

Implementation of Autoregressive Diffusion in Pytorch

Python 437 13 Updated Dec 4, 2025

[BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".

Python 35 9 Updated Feb 22, 2025

Official Pytorch Implementation of GraphiT

Python 110 13 Updated Jul 6, 2021
Next