Skip to content
View Cjkkkk's full-sized avatar
🏠
coding...
🏠
coding...

Block or report Cjkkkk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,039 134 Updated Apr 28, 2026

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 432 63 Updated Jan 5, 2026

pprof is a tool for visualization and analysis of profiling data

Go 9,168 657 Updated May 7, 2026

Puzzles for learning Triton

Jupyter Notebook 2,427 229 Updated Apr 1, 2026

JAX-Toolbox

Python 406 72 Updated May 8, 2026
Python 355 31 Updated Apr 13, 2026

A simple, performant and scalable Jax LLM!

Python 2,270 513 Updated May 8, 2026

Development repository for the Triton language and compiler

MLIR 19,126 2,837 Updated May 8, 2026
C++ 9 1 Updated Oct 31, 2022

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 550 72 Updated May 8, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,602 2,031 Updated Apr 22, 2026

A VM That is Dynamic and Fast

C 1,657 61 Updated Jun 8, 2025

A machine learning framework project motivated by CMU-10414

Python 1 Updated Dec 16, 2022

a language for fast, portable data-parallel computation

C++ 1 Updated Nov 10, 2025

Container plugin for Slurm Workload Manager

C 434 41 Updated Apr 29, 2026

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 934 128 Updated Apr 29, 2026

Collection of Summer 2026 tech internships!

Python 44,488 3,177 Updated May 8, 2026

Various translations of OSTEP can be found here. Help the cause and contribute!

3,061 513 Updated Jan 20, 2025

MIT 6.824 (Distributed Systems) labs in Go

Go 234 63 Updated Feb 22, 2021

A library for replicating your python class between multiple servers, based on raft protocol

Python 751 118 Updated Mar 17, 2026

HIPIFY: Convert CUDA to Portable C++ Code

C++ 695 106 Updated May 8, 2026

A GPU benchmark suite for assessing on-chip GPU memory bandwidth

C++ 110 28 Updated Aug 12, 2017

AI education materials for Chinese students, teachers and IT professionals.

HTML 14,064 2,941 Updated May 16, 2024

portion, a Python library providing data structure and operations for intervals.

Python 521 38 Updated Jan 28, 2026

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,669 177 Updated Jan 21, 2026

The reference implementation of the Linux FUSE (Filesystem in Userspace) interface

C 6,030 1,267 Updated May 5, 2026

TensorFlow code and pre-trained models for BERT

Python 40,007 9,718 Updated Jul 23, 2024

This is the top-level repository for the Accel-Sim framework.

Python 597 214 Updated Mar 24, 2026

A polyhedral compiler for expressing fast and portable data parallel algorithms

C++ 957 137 Updated Nov 20, 2024

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…

Python 4,771 1,202 Updated Jul 15, 2024
Next