Skip to content
View Mbehbahani's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@ApexAILabs

Block or report Mbehbahani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mbehbahani/README.md

Mohammad Behbahani

🎓 Senior AI Engineer
📍 Netherlands
🔹 Production LLM Systems · RAG & Agents · Decision Intelligence

I design and ship production LLM systems — multi-agent architectures, retrieval-augmented generation pipelines, and evaluation infrastructure — on top of a foundation in operations research, optimization, and probabilistic modeling.

My work focuses on:

  • Multi-agent LLM systems and orchestration
  • Enterprise RAG with grounded, citation-aware answers
  • LLM evaluation and AI observability
  • Production AI on AWS using Bedrock, Lambda, OpenSearch, FastAPI, and MLflow
  • Decision intelligence where mathematical rigor supports real-world AI products

I’m especially interested in building AI systems that are deployable, measurable, and trustworthy — not just demos.


Current Projects 🚀

  • 🏛️ Tax Authority Enterprise RAG — enterprise RAG architecture and evaluation framework for regulated tax-domain QA, with secure OpenSearch retrieval, Bedrock-based grading, citation verification, and observability.
  • 🤖 JobPilot — AI-powered job orchestration platform with Kanban workflows, personalized job search, automation, and production-oriented full-stack architecture. Live at jobpilot.oploy.eu.
  • 🤖 job-agent-backend — FastAPI backend for AI-powered job search, CV matching, tool-calling workflows, and MLflow-traced evaluation on AWS Lambda.
  • 📈 mlflow-tracking-server — self-hosted MLflow setup for experiments, traces, prompt optimization, and evaluation in AI systems.
  • 🧠 job-analytics-frontend — Next.js frontend for job analytics, AI chat, search workflows, and CV matching.
  • 📊 job-market-pipeline — Python pipeline for scraping, filtering, normalizing, deduplicating, and exporting job-market data.

Research & Applied Work

  • 🌐 Oploy Platform — applied decision-support platform translating optimization models into practical tools for routing, scheduling, and network design.
  • 🔬 2sHMMREM — research code for a two-stage Hidden Markov / Relational Event Modeling framework.
  • 🛠️ RepairShop-milp — MILP-based implementation for repair shop optimization.
  • ⚙️ ANN-SMB-SemiExpSimOpt — simulation-optimization and neural-network-based research code for process systems engineering.

Focus Areas

  • Enterprise RAG & Agentic Systems — RBAC-safe retrieval, grounding, citation verification, orchestration, and control loops
  • LLM Evaluation & AI Observability — evaluation pipelines, MLflow tracing, telemetry, and prompt iteration
  • Production AI on Cloud — FastAPI, Docker, AWS, deployable APIs, and model-agnostic system design
  • Applied AI Products — intelligent job-search workflows, CV matching, analytics dashboards, and AI-assisted tools
  • Decision Intelligence & Operations Research — optimization, simulation, and probabilistic modeling as the mathematical layer beneath the AI systems

Tech and Tools 🧰

LLMs, RAG and Agents

Development, Infrastructure and Cloud

Data Science, Optimization and Analytics


What I Build

  • Enterprise RAG systems — grounded retrieval, citation verification, and evaluation-gated quality
  • Multi-agent LLM applications — orchestration, tool-calling, memory, and reliable workflows
  • Evaluation pipelines and AI observability — quality metrics, tracing, prompt iteration, and runtime visibility
  • Production AI services on AWS — Bedrock, Lambda, OpenSearch, FastAPI, MLflow, and CI/CD-ready systems
  • AI-assisted products — job search, CV matching, analytics dashboards, and decision-support tools
  • Decision-intelligence systems — routing, scheduling, network design, and simulation-backed applications

Why this combination

The combination of operations research, probabilistic modeling, and production AI engineering is rare. Many AI engineers lack the mathematical depth for decision systems; many quantitative specialists do not build deployable products. I work where those two worlds meet, designing AI systems that are grounded, evaluated, and production-ready.


Selected Certifications


Connect 📫

Pinned Loading

  1. JobPilot JobPilot Public

    AI-powered job orchestration platform with Kanban workflows, LLM agents, and production-oriented system architecture.

    Svelte

  2. job-market-pipeline job-market-pipeline Public

    Open-source Python pipeline for scraping, normalizing, and analyzing job market data for operations research and data science roles.

    Python

  3. RepairShop-milp RepairShop-milp Public

    Official code for the research paper

    MATLAB 1

  4. mlflow-tracking-server mlflow-tracking-server Public

    Self-hosted MLflow tracking server for experiments, traces, and prompt optimization.

    Python 1 1

  5. wagtail/news-template wagtail/news-template Public

    A Wagtail template for a news site

    Python 173 73

  6. 2sHMMREM 2sHMMREM Public

    Two-stage Hidden Markov Model and relational event modeling project for sequential and event-driven data analysis.

    R 2