Work & Education

A deep dive into my technical projects, open-source contributions, and academic journey.

GitHub Projects

Price Optimization Model

End-to-end pricing pipeline: demand modeling, profit-maximizing recommendations, backtests.

PythonMLPlotly

SHA Disbursement Dashboard

Automates extraction from SHA PDFs and visualizes disbursements across facilities/time.

PythonDashCamelot

Neonatal Outcomes Dashboard

Sample analytics dashboard aligned to neonatal monitoring metrics.

DashPostgresSQL

Disaster Response Pipeline

Multilabel text classifier + needs detection; ETL orchestration and ingestion pipeline.

FastAPIDagsterNLP

Data Maturity & Governance Tool

Assessment utilities to score maturity, surface gaps and drive roadmaps.

GovernanceScoringCDMP

Bike Sharing Analysis

Explores drivers of demand in Washington, D.C. with temporal patterns.

EDATime SeriesARIMA

Bank Marketing Analysis

Campaign outcomes analysis; baselines for uplift/propensity modeling.

ClassificationFeature Eng

Sales Forecast

Classic demand forecasting with ARIMA/Prophet; seasonality and trend components.

ProphetForecasting

Redact PIIs

Named-entity based PII redaction workflow; quick compliance helper.

spaCyNERRegex

Where the Devs Are

Geospatial exploration of developer distributions from Stack Overflow survey data.

GeoPandasQGISMarimo

Education Timeline

Ongoing

MSc in Data Management

International University of Applied Sciences — Germany

Advanced study of governance operating models, data architecture, quality frameworks, and metadata/lineage management.

  • Produced strategy documents and governance playbooks mapped to DAMA-DMBOK2 functions.
  • Applied GDPR and Kenya DPA principles to data lifecycle design and AI use-case risk reviews.
  • Explored research methods, survey design, and field data collection techniques.
Data GovernanceData StrategyMetadata
2024–2025

CDMP Certification

DAMA International

Certified in data management practices aligned with DAMA-DMBOK2.

  • Developed frameworks for data governance, stewardship, and compliance.
  • Applied methods for data quality, metadata, and master/reference data management.
  • Covered topics on data architecture, integration, and warehousing design.
GovernanceStewardshipData Quality
2021

Data Science Nanodegree

Udacity

Hands-on projects across supervised and unsupervised learning, model evaluation, and deployment.

  • Applied data wrangling, feature engineering, and experimentation techniques.
  • Built and deployed machine learning pipelines with attention to scalability.
  • Capstone: Disaster Response Pipeline — multilabel NLP classification.
PythonMLNLP
2021

Data Analyst Nanodegree

Udacity

SQL, statistical inference, exploratory data analysis (EDA), and visualization.

  • Applied statistical methods to evaluate hypotheses and uncover trends.
  • Projects included OpenStreetMap data wrangling and weather trends analysis.
SQLEDAStatistics
2013 – 2019

BSc in Information Technology

Multimedia University of Kenya (MMU)

Database systems, networking, software engineering, and IT governance.

  • Applied SDLC best practices in individual and group projects.
  • Final project: developed a real-time bidding application using Django/SQLite.
Software EngineeringDjangoIT Governance