Hello, I'm
Data Engineer
Get To Know More
Hi, I’m Rafid! I’m a Computer Science student at The City College of New York graduating in May 2026, and I’m especially interested in data. My primary focus is data engineering because I enjoy building the pipelines, infrastructure, and systems that make data usable in the first place. I like the challenge of turning messy, unstructured information into something clean, reliable, and ready for analysis.
Alongside that, I’m interested in data science work because I enjoy uncovering insights and understanding how data can drive better decisions. I also explore software engineering, since building reliable tools and applications ties everything together and lets those insights actually make an impact.
Outside of tech, I’m usually planning my next trip or taking photos of wherever I end up. I’m really into travel and photography. I keep myself active through sports too. And food? Yeah… I’m a full-on foodie 😅. I love trying different cuisines and dragging friends along for the adventure.
Browse My
Explore My
Jul 2025 – Present
May 2024 – Aug 2024
Jan 2022 – Aug 2022
Browse My Recent
Built a medallion-style ETL pipeline on Databricks and AWS for 120M+ FAA flight records to support analytics and ML.
PySpark · Databricks · Delta Lake · AWS
Automated an Airflow + Docker ELT pipeline to refresh 10,000+ marketplace records with validation and quarantine.
Airflow · Docker · Snowflake · dbt
Built ML-ready datasets from 2.19M NYC collisions with feature engineering to support severity prediction.
Python · pandas · scikit-learn
Analyzed 107,974 ridership rows and built dashboards comparing MetroCard and OMNY usage.
Python · BI
Get in Touch