Skip to content
View rafaelgensen's full-sized avatar
๐Ÿ’ป
Working from home
๐Ÿ’ป
Working from home

Block or report rafaelgensen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
rafaelgensen/README.md

Rafael โ€” Data Engineer

Hi, I'm Rafael, a Data Engineer passionate about transforming raw data into reliable, scalable, and insightful systems.
I enjoy designing data platforms, automating workflows, and enabling analytics teams to move faster with trustworthy data.

(lost my previous account)


๐Ÿงฉ Core Technologies

Python SQL Spark Terraform Snowflake Databricks AWS GCP Docker Airflow dbt Kafka Delta Lake Kubernetes Tableau


โš™๏ธ What I Focus On

  • Delivering cloud-native and cost-efficient data solutions
  • Building robust data pipelines and ETL/ELT frameworks
  • Designing data models optimized for analytics and ML workloads
  • Implementing infrastructure as code and CI/CD for data
  • Delivering cloud-native and cost-efficient data solutions

๐Ÿ“Š GitHub Overview

Top Languages


๐ŸŒ Connect

LinkedIn

Pinned Loading

  1. crypto-lakehouse-pipeline crypto-lakehouse-pipeline Public

    Crypto batch data pipeline on AWS using Medallion architecture (Spark, Glue, Step Functions, EventBridge).

    HCL

  2. airbnb-berlin-analytics airbnb-berlin-analytics Public

    Analyzing Airbnb Berlin data with Snowflake, dbt, Preset and SQL. End-to-end ELT for pricing and host insights.

    Python

  3. realtime-car-sales-pipeline realtime-car-sales-pipeline Public

    Real-time event streaming pipeline to process car purchase and cancellation events on GCP.

    HCL

  4. SBDL SBDL Public

    Distributed data processing pipeline with PySpark and Kafka for scalable ETL.

    Python