Jonathan Dang

Data Science & Computer Science Student

Strategic, results-focused CS student with a Data Science minor, seeking to apply analytical skills in data-focused roles. Aiming to leverage technical expertise to solve real-world problems and support data-informed decisions.

About Me

I'm a Computer Science student at California State Polytechnic University, Pomona, with a minor in Data Science. With a GPA of 3.94, I'm passionate about leveraging data and technology to solve complex problems and drive meaningful insights.

Currently working as an Enforcement Data Operations Intern at the California Air Resources Board, where I've developed machine learning models and automated data pipelines to improve environmental monitoring and compliance.

Professional Experience

Feb 2025 - Current

Enforcement Data Operations Intern

California Air Resources Board - Riverside, CA

  • Designed and implemented a real-time MIL detection system using an XGBoost classifier, increasing precision in identifying high-emitting heavy-duty vehicles from 18% to 36%
  • Developed and automated DBT models in SQL to clean, transform, and integrate emissions data across multiple sources, enhancing data reliability and workflow efficiency
  • Presented findings to section managers and authored a technical report, receiving positive feedback and positioning the project for future adoption
Feb 2025 - May 2025

Boeing Data Analytics Consultant

MISSA Data Analytics Team - Pomona, CA

  • Led analysis of 2024 commercial flight data to quantify total passenger volume on Boeing aircraft and benchmark market performance against competitors
  • Consolidated and enriched flight records by integrating aircraft reference data, standardizing key fields, and eliminating invalid entries to ensure data accuracy
  • Developed an interactive Tableau dashboard with dynamic filters by manufacturer, model, month, and state, enabling clear visualization of regional and seasonal demand trends
Jun 2024 - Aug 2024

Undergraduate Researcher

California State Polytechnic University Pomona - Pomona, CA

  • Conducted research on machine learning approaches to detect AI-generated content in student essays, contributing to academic integrity initiatives
  • Developed Python scripts to process and structure over 30,000 essay files, enabling efficient model training
  • Implemented a classification framework that achieved an F1 score of 0.89, demonstrating strong model performance and practical application in educational settings

Projects

OneStop Electronics E-Commerce Analysis

Python, Pandas, PostgreSQL, Tableau

May 2025 - June 2025

  • Designed and implemented a robust end-to-end data pipeline using Python and PostgreSQL to automate the ingestion, transformation, and validation of over 100,000 e-commerce transactions
  • Enhanced data accuracy by 15% through comprehensive sales data cleanup and validation
  • Developed an interactive Tableau dashboard to visualize key performance indicators

Bank Loan Default Risk Model

Python, Pandas, Scikit-learn, Matplotlib

Nov 2024 - Dec 2024

  • Led a team of four in developing a logistic regression model to predict loan default risk, achieving an F1 score of 0.71
  • Enhanced data quality by preprocessing 900k+ rows of financial data
  • Conducted a cost-benefit analysis that demonstrated an average profit of $4,200 per loan

Technical Skills

Languages

Python SQL Java HTML/CSS

Database Systems

MS SQL PostgreSQL dbt (Data Build Tool) MS Excel

Cloud Technologies & CI/CD

Git Snowflake

Data Visualization & Reporting

Tableau Pandas Matplotlib Scikit-learn

Analytics

Exploratory Data Analysis (EDA) Dashboard Design KPI Reporting

Get In Touch

I'm always interested in new opportunities and collaborations. Feel free to reach out!