- Austin, TX
-
04:21
(UTC -05:00)
Stars
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
A Data Streaming Library for Efficient Neural Network Training
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…
XML data source for Spark SQL and DataFrames
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Code to accompany Advanced Analytics with Spark from O'Reilly Media
ZXing ("Zebra Crossing") barcode scanning library for Java, Android