Skip to content
View jameskoero's full-sized avatar
💭
Open to opportunities worldwide
💭
Open to opportunities worldwide

Block or report jameskoero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jameskoero/README.md

Hi 👋, I'm James Koero

Junior ML Engineer | Physics & Math Background | Building AI for Africa 🌍

[

LinkedIn

](https://linkedin.com/in/jameskoero) [

Twitter

](https://twitter.com/jmsOnyango) [

Email

](mailto:jmskoero@gmail.com) [

Hire Me

](https://linkedin.com/in/jameskoero)

Profile Views


🧑‍💻 About Me

Self-taught ML Engineer from Kisumu, Kenya 🇰🇪 — building production-ready ML systems that solve real African problems, from flood prediction to salary transparency.

With a strong foundation in Physics and Mathematics from Moi University and hands-on geophysical research at KenGen's Olkaria Geothermal Project, I bring scientific rigour to every ML system I build — coded entirely on Android (Termux + PyramIDE).

  • 🎓 B.Sc. Physics (Major) + Mathematics (Minor) — Moi University, 2012
  • 🏭 Industrial Attachment — KenGen Olkaria Geothermal Project, 2011 (MT and TEM methods)
  • 🔬 Research — Eburru Geothermal Prospect geophysical study — Prof. Mghendi Mwamburi
  • 📍 Location — Kisumu, Kenya (near Lake Victoria)
  • 🕒 Timezone — EAT (UTC+3)
  • 🌐 Open To — Remote ML Engineer / Data Science roles — US · EU · Global

🚀 What I Do

  • 🧠 Build end-to-end ML systems — raw data to feature engineering to model training to live API
  • 📐 Apply Physics and Mathematics background to feature engineering and model evaluation
  • 🌍 Solve real African problems — flood risk, salary transparency, credit scoring
  • 🚢 Ship production-grade code — FastAPI · Docker · GitHub Actions CI/CD
  • 📊 Explain every prediction with SHAP — because unexplainable AI is not good enough

🛠️ Tech Stack

Languages and Data

Python

PostgreSQL

NumPy

Pandas

Machine Learning and AI

Scikit-learn

XGBoost

GradientBoosting

SHAP

Visualisation

Matplotlib

Web and Deployment

FastAPI

Flask

React

Docker

Render

Vercel

DevOps and Tools

GitHub Actions

Git

Jupyter

Google Colab


🏆 Live Projects

🌊 Nyando Flood AI — Production ML for 50,000+ Residents

End-to-end flood prediction system for Nyando Basin, Kenya

[

Live API

](https://nyando-flood-api.onrender.com/docs) [

CI

](https://github.com/jameskoero/nyando-flood-ai) [

Repo

](https://github.com/jameskoero/nyando-flood-ai)

Metric Value
Model GradientBoostingClassifier
AUC 0.9717
F1 Score 0.9022
CV Score 0.9727 +/- 0.004
Training Data 2,308 real GEE satellite points
Deployment FastAPI · Docker · Render
CI/CD GitHub Actions — 41 tests passing
Funding Target UNDP · USAID · Google.org · Green Climate Fund

💰 AfriSalaries — Africa Salary Transparency Platform

Real-time salary prediction API across 8 African countries

[

Live API

](https://afrisalaries.onrender.com/docs) [

Frontend

](https://afrisalaries.vercel.app) [

Repo

](https://github.com/jameskoero/afrisalaries)

Item Detail
Model XGBoost + TF-IDF (5,000 features) + SHAP
Coverage KE · NG · ZA · GH · ET · TZ · UG · RW
Data Real scraped data — BrighterMonday · Fuzu · Target: 3,000+ rows
Database Neon PostgreSQL (async)
Cache Upstash Redis REST
Stack FastAPI · React 18 · Vite · Tailwind CSS · Recharts

🏦 Loan Default Risk Assessment — Basel III Compliant

Credit risk ML system with regulatory-grade metrics

[

Repo

](https://github.com/jmskoero/loan-risk-assessment)

Metric Value
Model GradientBoostingClassifier + SHAP
Gini Coefficient 0.74 (Basel III minimum: 0.35 — pass)
Framework EL = PD x LGD x EAD
IFRS 9 Staging Stage 1 PD less than 2% · Stage 2 2-10% · Stage 3 above 10%
Threshold 0.35 saves 23% cost vs default 0.50

🚢 Titanic Survival Prediction — Interpretable ML Pipeline

Senior-grade, leak-free sklearn pipeline with full explainability

[

Repo

](https://github.com/jameskoero/titanic-survival-prediction)

Metric Value
Hold-out Accuracy 78.77%
CV Accuracy 82.4%
ROC-AUC 0.8456
Key Insight Sex coefficient +2.61 = 13.5x survival odds
Pipeline StratifiedKFold · ColumnTransformer · GridSearchCV F2 · Bootstrap 95% CIs

⛪ CMDMS — Church Digital Management System

Full-stack PWA managing 200+ members live

[

Live

](https://cmdms.onrender.com) [

Repo

](https://github.com/jameskoero/cmdms-web)

  • Features: 5-role RBAC · MRH-XXXXXX Member IDs · KES Finance · Attendance · Events
  • Stack: Flask 3.0 · React 18 PWA · PostgreSQL · Render · Vercel · GitHub Actions CI/CD

🌱 Africa ML Roadmap — Planned, Not Yet Built

Priority Project Domain Data Source
High Crop Disease Detection Computer Vision PlantVillage + field data
High Malaria Outbreak Prediction Public Health WHO, DHIS2, climate
High Flood Risk Mapping v2 Geospatial ML CHIRPS, USGS DEM
Planned M-Pesa Fraud Detection FinTech Transaction patterns
Planned Credit Scoring Unbanked Finance Alternative data
Planned Solar Potential Mapping Energy NASA POWER API
Planned Matatu Route Optimisation Transport OSM, GTFS
Planned Lake Victoria Water Quality Environment Satellite + IoT

📊 GitHub Stats

James's GitHub Stats

Top Languages

GitHub Streak


📜 Certifications

Certificate Issuer Date ID
Machine Learning using Python Programming Hub / Google Developers Launchpad Oct 2025 bae4cf502b3dfe5
Python Basics Programiz Sep 2025 08ddece2-fd4c-40eb-88d9-8f6b142466b0

🎓 Education and Background

B.Sc. Physics (Major) + Mathematics (Minor) — Moi University, Kenya | 2008-2012

  • Classical Mechanics · Statistical Physics · Linear Algebra · Calculus · Numerical Methods
  • Research: Eburru Geothermal Prospect using MT and TEM methods — Prof. Mghendi Mwamburi

Industrial Attachment — KenGen Olkaria Geothermal Project | 2011

  • Large-scale geophysical survey data collection and processing in the field
  • Applied MT and TEM subsurface imaging — first exposure to scientific data pipelines

💭 Engineering Philosophy

"I build production ML systems — not just notebooks. My Physics and Mathematics background means I think carefully about what a model is actually measuring before I trust its output. Every project I ship has a live URL, SHAP explainability, and a real problem it solves — because that is what actually helps people and what gets you hired."


📬 Let's Connect

Available for remote ML Engineer and Data Science roles — worldwide.

[

LinkedIn

](https://linkedin.com/in/jameskoero) [

Email

](mailto:jmskoero@gmail.com) [

CMDMS

](https://cmdms.onrender.com) [

Nyando

](https://nyando-flood-api.onrender.com/docs) [

AfriSalaries

](https://afrisalaries.onrender.com/docs)

📍 Kisumu, Kenya 🇰🇪 | 🕒 EAT (UTC+3) | 🌐 Remote-first


"The best time to build deep ML expertise was 5 years ago. The second best time is today."

— James Koero

Pinned Loading

  1. afrisalaries afrisalaries Public

    ML model predicting hidden tech salaries across Africa from job descriptions | XGBoost, FastAPI, Docker, SHAP, React

    Python

  2. cmdms- cmdms- Public

    Digital management system for Ministry of Repentance and Holiness — Carwash Main Altar, Kisumu

    Python

  3. loan-risk-assessment loan-risk-assessment Public

    Advanced Loan Default Risk Assessment" Loan default risk ML system · GradientBoosting · SHAP · Python

    Jupyter Notebook

  4. nyando-flood-ai nyando-flood-ai Public

    AI-powered flood risk prediction for Nyando Basin, Kenya. XGBoost + SHAP + FastAPI + React. AUC-ROC 0.94. 100% open data — Kenya DPA 2019 compliant.

    Python

  5. titanic-survival-prediction titanic-survival-prediction Public

    Senior-grade Logistic Regression on Titanic · Leak-free sklearn Pipeline + ColumnTransformer · SHAP waterfalls for Andrews/Dean/Brown · 13 evaluation charts · Bootstrap 95% CIs · F2-tuned threshold…

    Python