Carlos León

Projects

Public projects that reflect how I approach analytics engineering, data modeling, cloud workflows, and stakeholder-facing outputs.

Chicago Taxi Trips vs Weather Pipeline

End-to-end pipeline combining Chicago taxi trip data and weather signals to produce analytics-ready models and dashboards.

Weather API
Taxi Data
Cloud Functions
BigQuery
dbt
Looker Studio
Google CloudBigQuerydbtTerraformCloud FunctionsLooker StudioGitHub Actions
Chicago Taxi Trips vs Weather Pipeline architecture

Problem / Context

I wanted a reproducible way to analyze how weather conditions affect taxi trip activity, using a real pipeline rather than an isolated notebook.

What I built

An end-to-end analytics workflow on Google Cloud, including ingestion, infrastructure, transformations, orchestration, testing, and BI output.

Architecture

  • Terraform provisions the GCP resources and baseline infrastructure.
  • Cloud Functions and scheduled jobs handle ingestion and refresh workflows.
  • BigQuery stores raw and modeled data, with dbt managing transformations.
  • Looker Studio consumes the final models for stakeholder-friendly reporting.

Outcomes

  • Connected ingestion, storage, modeling, and reporting in one reproducible system.
  • Kept the project close to real production patterns with orchestration and CI/CD.
  • Created a portfolio piece that shows both analytics engineering and cloud workflow thinking.

What I'd improve next

Add stronger monitoring and freshness checks around ingestion reliability and scheduled pipeline runs.

NBA Data Pipeline

ELT pipeline for NBA data using Snowflake and dbt, modeled with bronze, silver, and gold layers for BI consumption.

Raw Data
Bronze
Silver
Gold
Power BI
dbtSnowflakeSQLPower BIELT
NBA Data Pipeline architecture

Problem / Context

I built this project to show a clean analytics engineering workflow using medallion modeling and business-facing outputs rather than raw-source analysis.

What I built

A Snowflake and dbt pipeline that transforms raw NBA data into structured analytical models for game results, player performance, and team analysis.

Architecture

  • Raw NBA data lands in Snowflake and is organized into layered models.
  • dbt handles transformations, testing, and model structure across bronze, silver, and gold layers.
  • Gold-layer outputs are shaped for BI consumption and reporting in Power BI.

Outcomes

  • Demonstrated a strong grasp of medallion architecture and analytics-ready modeling.
  • Showed clear separation between raw ingestion, transformation logic, and business-facing outputs.
  • Created a project that maps well to real analytics engineering work.

What I'd improve next

Extend the project with more reusable metrics definitions and richer documentation for downstream consumers.