Projects
A collection of data engineering and cloud architecture projects I've built throughout my career.
📊
Real-Time Data Pipeline
Built a real-time streaming pipeline using Kafka, Spark Streaming, and AWS services to process 1M+ events per day.
KafkaSparkAWS KinesisPython
📊
Scalable ETL Framework
Developed a reusable ETL framework with Airflow and AWS Glue, reducing development time by 60%.
AirflowAWS GluePythondbt
📊
Data Lakehouse on AWS
Architected a data lakehouse solution using S3, Glue, and Athena for cost-effective analytics.
S3GlueAthenaTerraform
📊
ML Feature Pipeline
Built an automated ML feature engineering pipeline using AWS SageMaker and Step Functions.
SageMakerStep FunctionsPython