Stars
Databricks Toolkit for Coding Agents provided by Field Engineering
🇧🇷 Brazilian OSSU-like Community built on the same principles of openness, inclusivity, and accessibility!
Continuous Unix commit history from 1970 until today
Apache Beam is a unified programming model for Batch and Streaming data processing.
A set of exercises for deliberate Git Practice
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
Secure, cross-platform Git credential storage with authentication to GitHub, Azure Repos, and other popular Git hosting services.
Curso de Sistemas Operacionais DCC/UFMG
Simple cross-platform colored terminal text in Python
⚡ A Fast, Extensible Progress Bar for Python and CLI
The leading native Python SSHv2 protocol library.
Bonus materials, exercises, and example projects for our Python tutorials
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
DuckDB is an analytical in-process SQL database management system
A mimic of the VirtualEnvWrapper project but with Powershell
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.