Summary
Overview
Work history
Education
Skills
Websites
Project samples
Timeline
Generic

Ezra Kipchirchir Bett

Bristol,Bristol

Summary

Data Scientist skilled in Python, NumPy, Pandas, with a strong foundation in machine learning frameworks such as Keras, Pytorch, Scikit-Learn and TensorFlow. Expertise in SQL and data visualization tools like Matplotlib, Seaborn, Plotly, and Tableau, complemented by experience in regression, classification, and clustering techniques. Proficient in using Jupyter Notebooks, Git, Docker, and AWS for development and deployment, with a focus on big data processing using Apache Spark and PySpark. Proven ability to manage projects effectively while maintaining attention to detail and fostering collaboration among team members.

Overview

1
1
year of professional experience
4
4
years of post-secondary education

Work history

Data Science cohort

Moringa school
Nairobi, Nairobi
01.2024 - 07.2024
  • Developed expertise in Python and SQL by building solutions that streamlined data manipulation and improved analysis efficiency by 30%.
  • Applied data preprocessing techniques, such as cleaning and transformation, to prepare raw datasets and uncover meaningful patterns.
  • Implemented machine learning algorithms using Scikit-learn and TensorFlow to solve classification and regression problems with measurable accuracy of over 80%.
  • Delivered real-world data science projects that addressed complex challenges and produced actionable insights for stakeholders.
  • Leveraged deep learning models for image recognition tasks, strengthening product quality assurance and reducing defect detection errors.
  • Employed statistical modelling techniques for hypothesis testing, enabling validation of key business assumptions and data-driven decision-making.
  • Executed SQL queries to extract and aggregate data from relational and non-relation databases, supporting the creation of critical business reports.
  • Optimised Python code to accelerate data processing workflows, reducing analysis runtimes and improving overall productivity.
  • Engineered scalable ETL pipelines that automated data integration, increasing system reliability and reducing manual workload.
  • Applied advanced time-series analysis methods to improve forecasting accuracy, enhancing the planning and resource allocation process.
  • Strengthened technical proficiency through hands-on work in diverse projects and enhanced interpersonal skills by collaborating effectively with peers.

Intern

Kenya Power and Lightning Company (KPLC)
Thika town, Kiambu
05.2023 - 08.2023
  • Developed algorithms that improved pattern recognition accuracy in complex datasets, enabling the identification of hidden trends and anomalies.
  • Supported project planning by tracking deliverables and ensuring milestones were consistently met, which maintained an on-time delivery rate of 95%.
  • Contributed to strategic planning discussions during team meetings by providing data-driven insights that influenced project decisions.
  • Leveraged SQL to extract and manipulate data from databases of over 5 million records, improving query efficiency by 25%.
  • Streamlined data processing through Python scripts, reducing analysis time by 30% and enabling faster turnaround of insights.
  • Applied machine learning techniques to increase predictive accuracy from 76% to 91%, leading to more reliable data interpretations and forecasts.
  • Assisted senior scientists in exploratory data analysis that uncovered critical patterns and guided subsequent stages of research.
  • Collaborated with the IT team to troubleshoot and optimize system performance, reducing downtime by 20% and enhancing overall reliability.
  • Conducted statistical analysis that clarified data trends, reduced reporting errors by 15%, and improved the quality of business insights.
  • Designed and deployed user-friendly dashboards to present complex data visually, which increased stakeholder adoption and usage by 40%.
  • Incorporated Big Data tools such as Hadoop and Spark to efficiently process datasets exceeding 1TB, reducing computation time by 30%.
  • Developed predictive models that strengthened business decision-making, improving forecasting reliability by 18% and supporting more accurate planning.
  • Analyzed and processed complex datasets using advanced analytical tools, delivering actionable insights that contributed to cost savings of more than £25,000 annually.

Education

Master's - Data Science

University of the West of England (UWE)
Bristol, England
09.2024 - 09.2025

Certificate - Data Science

Moringa School
Nairobi, Kenya
09.2023 - 03.2024

Bachelor's of Science - Information Technology

Mount Kenya University
Nairobi, Kenya
09.2020 - 09.2023

Skills

  • Python
  • NumPy
  • Pandas
  • Scikit-Learn
  • Keras
  • TensorFlow
  • SQL
  • Matplotlib
  • Seaborn
  • Plotly
  • Tableau
  • Regression
  • Classification
  • Clustering
  • Ensemble Methods
  • Neural networks
  • Jupyter Notebooks
  • Colab
  • VScode
  • Git
  • Docker
  • Streamlit
  • AWS
  • Hypothesis testing
  • A/B testing
  • Data wrangling and analysis
  • Apache Spark
  • PySpark
  • MySQL
  • MongoDB
  • Project management
  • Collaboration abilities
  • Attention to Detail
  • Data reporting
  • Communication skills
  • Calm under pressure
  • Problem-solving
  • Team building

Project samples

Car parking detection using computer vision, https://github.com/dev-ezzy/Car-parking-space-detection-using-computer-vision.git, This project uses computer vision to detect and monitor parking space availability in real time from CCTV footage. By applying object detection and image processing techniques, the system can identify vehicles, map them to predefined parking slots, and determine whether each slot is occupied or vacant.

Traffic-weather analysis, https://github.com/dev-ezzy/TrafficWeather-analysis.git, This repository is a collaborative project where we analyze the impact of weather conditions on traffic patterns using data science techniques. Our goal is to uncover how factors like rain, temperature, wind, and humidity influence traffic flow, congestion, and accident rates.

Recommender system, https://github.com/dev-ezzy/Movie-Recommendation-System-.git, Machine learning project aimed towards making a model to solve the problem of recommending movies to consumers enjoying the film industry products. 

Pump status prediction model, https://github.com/dev-ezzy/Classification-model.git, An intermediate intense machine learning approach to predict binary and ternary classification problems using the most efficient algorithms.

House price prediction model, https://github.com/dev-ezzy/House_price-prediction-model.git, Utilize historical real estate data for training, apply regression techniques to establish correlations, and create a predictive model for accurate price predictions in the housing market. The project aims to assist homebuyers and sellers by providing data-driven insights into property values.

Timeline

Master's - Data Science

University of the West of England (UWE)
09.2024 - 09.2025

Data Science cohort

Moringa school
01.2024 - 07.2024

Certificate - Data Science

Moringa School
09.2023 - 03.2024

Intern

Kenya Power and Lightning Company (KPLC)
05.2023 - 08.2023

Bachelor's of Science - Information Technology

Mount Kenya University
09.2020 - 09.2023
Ezra Kipchirchir Bett