Skip to content
View tatwan's full-sized avatar

Highlights

  • Pro

Block or report tatwan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
8 stars written in Scala
Clear filter

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,573 2,001 Updated Feb 7, 2026

A machine learning package built for humans.

Scala 4,799 563 Updated Nov 6, 2025

Spark: The Definitive Guide's Code Repository

Scala 3,092 2,890 Updated Aug 26, 2020

MLeap: Deploy ML Pipelines to Production

Scala 1,532 316 Updated Jan 12, 2026

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala 1,373 785 Updated Jan 28, 2025

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

Scala 1,045 315 Updated Jul 12, 2025

Data Lineage Tracking And Visualization Solution

Scala 653 159 Updated Feb 7, 2026

Automated data quality suggestions and analysis with Deequ on AWS Glue

Scala 91 23 Updated Dec 29, 2022