Skip to content
View srowen's full-sized avatar
🤠
🤠

Organizations

@apache @OryxProject

Block or report srowen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 249 10 Updated Apr 17, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 3,041 262 Updated May 6, 2026

Curate better data for LLMs

Python 1,070 105 Updated Mar 19, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,503 191 Updated Feb 2, 2026

The Stockfish testing framework

Python 337 153 Updated May 7, 2026

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,792 1,145 Updated Jun 30, 2023

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

Python 679 255 Updated May 1, 2026

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 25,864 5,712 Updated May 11, 2026

XML data source for Spark SQL and DataFrames

Scala 513 224 Updated Aug 11, 2024

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Java 1,785 401 Updated Aug 16, 2021

Code to accompany Advanced Analytics with Spark from O'Reilly Media

Scala 1,520 1,018 Updated Sep 25, 2024

ZXing ("Zebra Crossing") barcode scanning library for Java, Android

Java 33,935 9,433 Updated May 4, 2026