Stars
A linter for Claude Code projects. Validates CLAUDE.md files, skills, settings, hooks, MCP servers, and plugins.
A guidance language for controlling large language models.
Undetected web-scraping & seamless HTML parsing in Python!
Run PyTorch LLMs locally on servers, desktop and mobile
Tiny status page generated by a Python script
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Power CLI and Workflow manager for LLMs (core package)
The OpenPoliceData (OPD) Python library is the most comprehensive centralized public access point for incident-level police data in the United States. OPD provides easy access to 550+ incident-leve…
✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML and CSV into interactive graphs.
Code relating to scraping public police data.
Meadowrun makes it easy to run your code on the cloud
Blazing fast framework for fine-tuning similarity learning models
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
Typescript wrapper for the Hugging Face Inference API.
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Generate prime numbers from pictures!
A collection of notebooks for Natural Language Processing from NLP Town
A small package to fuzzy match chinese words
A real-time tech course finder, created using Elasticsearch, Python, React+Redux, Docker, and Kubernetes.
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Platform for designing and evaluating Graph Neural Networks (GNN)
labourR: Methods, Classes and Data for Labour Market Analysis
Model parallel transformers in JAX and Haiku
A Dataset of Python Challenges for AI Research