A human with a passion for open-ed, large datasets, and beautiful visualisations.
Stars
The best place to learn data engineering. Built and maintained by the data engineering community.
Article: Is There a Better Way to Rank Business Schools?
Apache Spark - A unified analytics engine for large-scale data processing
Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies
A guide for anyone interested in joining the 18F team
IPython and Jupyter in-depth Tutorial, first presented at PyCon 2012



