Work & Education

A deep dive into my technical projects, open-source contributions, and academic journey.

GitHub Projects

Price Optimization Model

End-to-end pricing pipeline: demand modeling, profit-maximizing recommendations, backtests.

PythonMLPlotly

SHA Disbursement Dashboard

Automates extraction from SHA PDFs and visualizes disbursements across facilities/time.

PythonDashCamelot

Neonatal Outcomes Dashboard

Sample analytics dashboard aligned to neonatal monitoring metrics.

DashPostgresSQL

Disaster Response Pipeline

Multilabel text classifier + needs detection; ETL orchestration and ingestion pipeline.

FastAPIDagsterNLP

Data Maturity & Governance Tool

Assessment utilities to score maturity, surface gaps and drive roadmaps.

GovernanceScoringCDMP

Bike Sharing Analysis

Explores drivers of demand in Washington, D.C. with temporal patterns.

EDATime SeriesARIMA

Bank Marketing Analysis

Campaign outcomes analysis; baselines for uplift/propensity modeling.

ClassificationFeature Eng

Sales Forecast

Classic demand forecasting with ARIMA/Prophet; seasonality and trend components.

ProphetForecasting

Redact PIIs

Named-entity based PII redaction workflow; quick compliance helper.

spaCyNERRegex

Where the Devs Are

Geospatial exploration of developer distributions from Stack Overflow survey data.

GeoPandasQGISMarimo

Education Timeline

  • MSc in Data Management

    International University of Applied Sciences — Germany

    Advanced study of governance operating models, data architecture, and quality frameworks.

    Advanced study of governance operating models, data architecture, quality frameworks, and metadata/lineage management.

    • Produced strategy documents and governance playbooks mapped to DAMA-DMBOK2 functions.
    • Applied GDPR and Kenya DPA principles to data lifecycle design and AI use-case risk reviews.
    • Explored research methods, survey design, and field data collection techniques.
    Data GovernanceData StrategyMetadata

    Now

  • CDMP Certification

    DAMA International

    Certified in data management practices aligned with DAMA-DMBOK2.

    Certified in data management practices aligned with DAMA-DMBOK2.

    • Developed frameworks for data governance, stewardship, and compliance.
    • Applied methods for data quality, metadata, and master/reference data management.
    • Covered topics on data architecture, integration, and warehousing design.
    GovernanceStewardshipData Quality

    2025

  • Data Science Nanodegree

    Udacity

    Hands-on projects across supervised and unsupervised learning.

    Hands-on projects across supervised and unsupervised learning, model evaluation, and deployment.

    • Applied data wrangling, feature engineering, and experimentation techniques.
    • Built and deployed machine learning pipelines with attention to scalability.
    • Capstone: Disaster Response Pipeline — multilabel NLP classification.
    PythonMLNLP

    2021

  • Data Analyst Nanodegree

    Udacity

    SQL, statistical inference, EDA, and visualization.

    SQL, statistical inference, exploratory data analysis (EDA), and visualization.

    • Applied statistical methods to evaluate hypotheses and uncover trends.
    • Projects included OpenStreetMap data wrangling and weather trends analysis.
    SQLEDAStatistics

    2021

  • BSc in Information Technology

    Multimedia University of Kenya (MMU)

    Database systems, networking, software engineering, and IT governance.

    Database systems, networking, software engineering, and IT governance.

    • Applied SDLC best practices in individual and group projects.
    • Final project: developed a real-time bidding application using Django/SQLite.
    Software EngineeringDjangoIT Governance

    2019