Skip to content
View yullieyang's full-sized avatar

Block or report yullieyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yullieyang/README.md

Hi, I'm Yullie πŸ‘‹

Quantitative analyst building reproducible financial / economic research workflows, stress-testing tools, model-QA systems, and reviewable documentation. I work at the intersection of economics, data, and engineering β€” turning macro and financial data into versioned, reviewable artifacts. Interested in responsible use of AI coding tools to make analytical workflows more systematic and reviewable.

  • πŸ’Ό Quantitative Analyst, CoStar Group β€” Boston, MA (2025 – Present)
  • πŸ§ͺ Data Scientist / Consultant, Guidehouse β€” McLean, VA (2024 – 2025)
  • πŸ› Graduate Intern (Summer 2023), Board of Governors of the Federal Reserve System β€” Division of International Finance
  • πŸŽ“ M.S. Business Analytics, University of Maryland, College Park (Dec 2023)
  • 🌏 B.A. Global Political Economy, Waseda University, Tokyo (2022)
  • πŸ“ Boston, MA

What I'm focused on

Production-style research-support code: pulling macro / trade / commodity series from authoritative sources (FRED), harmonizing across frequencies, computing derived measures, producing short-horizon forecasts with documented uncertainty, and packaging deterministic QA and review workflows so a reviewer can audit every step from Git. AI coding tools may support scaffolding, documentation review, and consistency checks, but the workflow logic, assumptions, validation criteria, and final outputs remain human-reviewed.

Featured work

Repo What it is
r-macro-trade-commodity-forecast Reproducible R pipeline: 13 FRED macro / trade / commodity series β†’ quarterly panel with implicit trade deflators and terms of trade β†’ 8-quarter auto.arima forecasts for net exports, real GDP, and WTI β†’ distributed-lag FX pass-through regression on U.S. trade prices. CI + Quarto Pages dashboard.
cre_stress_test Portfolio-demo stress-testing workflow on 100% public data (FRED + Google Mobility + Boston Zoning). Python package + R/auto.arima companion + SQLAlchemy persistence + Streamlit dashboard. pytest + CI. Personal project; does not reflect any employer's internal data or models.
llm-research-workflow-assistant Prompt templates, sample outputs, and a human-in-the-loop checklist for using AI coding tools responsibly in recurring research workflows (data QA, code review, brief review, documentation drafting).
model-output-qa-dashboard Streamlit dashboard for quarterly model-release QA: compares prior vs. current model-output extracts, runs fixed checks (missing values, duplicate keys, schema drift, out-of-range, scenario coverage), computes row-level deltas, and exports a Markdown review report with a human-reviewer checklist. Synthetic data.
yullieyang.github.io Personal portfolio site (GitHub Pages) β€” bio, background, and featured projects.

Toolbox

  • Languages: R (primary), Python, SQL
  • Data & infra: FRED, SQLite / PostgreSQL / SQLAlchemy, AWS, Azure, Tableau, Power BI
  • Engineering: Git, GitHub Actions, R Markdown / Quarto, Streamlit, make-driven pipelines, pytest, testthat
  • Modeling: ARIMA / auto.arima, distributed-lag regression, classification under class imbalance, SHAP explainability
  • AI / LLM workflows: Claude API, Claude Code for pair-programming and documentation, prompt-template design, human-in-the-loop review processes

How I use AI tools

I treat AI coding tools as support for scaffolding, refactoring, and documentation review β€” not as substitutes for analyst judgment. Analytical logic, assumptions, validation checks, and final outputs are independently verified. Every repo that uses an LLM in its workflow includes a CLAUDE.md defining how the model should behave, a clear human-review step before outputs are shared, and explicit limitations of what the AI-assisted output represents.

Contact

πŸ“¬ yullieyang@gmail.com Β· 🌐 yullieyang.github.io Β· πŸ’Ό LinkedIn

Pinned Loading

  1. r-macro-trade-commodity-forecast r-macro-trade-commodity-forecast Public

    Reproducible R workflow: FRED macro/trade/commodity panel, auto.arima forecasts for net exports, real GDP, and WTI, FX pass-through regression.

    R

  2. llm-research-workflow-assistant llm-research-workflow-assistant Public

    Responsible AI workflow prototype for research QA, documentation, and human-in-the-loop review.

    Python

  3. cre_stress_test cre_stress_test Public

    Production-style CRE credit-risk modeling pipeline β€” Python package + R/auto.arima companion + SQLAlchemy persistence + Streamlit dashboard. FRED, Google Mobility, Boston Zoning. pytest + CI.

    HTML