Projects

A collection of data engineering and cloud architecture projects I've built throughout my career.

📊
Featured

Real-Time Data Pipeline

Built a real-time streaming pipeline using Kafka, Spark Streaming, and AWS services to process 1M+ events per day.

KafkaSparkAWS KinesisPython
📊
Featured

Scalable ETL Framework

Developed a reusable ETL framework with Airflow and AWS Glue, reducing development time by 60%.

AirflowAWS GluePythondbt
📊
Featured

Data Lakehouse on AWS

Architected a data lakehouse solution using S3, Glue, and Athena for cost-effective analytics.

S3GlueAthenaTerraform
📊

ML Feature Pipeline

Built an automated ML feature engineering pipeline using AWS SageMaker and Step Functions.

SageMakerStep FunctionsPython