Summary
Overview
Work history
Education
Skills
Websites
Projects
Certification
Timeline
Generic

SURABHI S

Summary

Data science master's student with over six years of experience in data-focused roles, currently serving as a machine learning engineer since April 2024. Expertise in developing and deploying machine learning and predictive models, including large language models and CI/CD pipelines. Proficient in Microsoft Fabric modern MLOps practices and end-to-end model lifecycle management. Strong background in data collection, processing, and analysis to drive data-driven decision-making.

Overview

9
9
years of professional experience
4
4
years of post-secondary education
1
1
Certification

Work history

Data Analyst

Mitie
2025.12 - Current
  • Streamlined data processing using automation techniques.
  • Created interactive dashboards with Power BI to assist in data interpretation and presentation.
  • Leveraged OpenAI - GPT models to provide summarised outputs for QHSE dashboard.
  • Utilised SQL programming tasks to manage large databases efficiently.
  • Improved the existing large semantic data model for QHSE reporting.

Jr. Machine Learning Engineer

Mitie
2024.04 - 2025.11
  • Assisted in executing machine learning workflows for data transformation and model deployment.
  • Supported the development of summarisation pipelines using GPT models and OpenAI API for improved data processing.
  • Facilitated the design of predictive models with XGBoost, focusing on probability calibration and performance evaluation.

Delivery Analyst

Kyndryl (IBM Ind Pvt Ltd)
2021.01 - 2022.01
  • Assisted in presenting technical insights and model outputs to non-technical stakeholders in clear actionable formats.
  • Created reports using Excel and Power BI.

Reporting Analyst

IBM India Pvt. Ltd
2020.01 - 2021.01
  • Compiled and analysed statistical data to generate reports and dashboards on performance to highlight defects.
  • Supported team leads by managing data reporting and analysis tasks collaboratively.

Senior Associate

IBM India Pvt. Ltd
2017.01 - 2020.01

· Proactive escalations and ownership of client concerns.

· 1 year of experience as a Desktop Support Engineer in IT Platform.

· Secondary function includes providing support to level 1 agents

Education

M.Sc - Data Science

University of the West of England
Bristol, UK
2023.01 - 2024.01

B.Sc - Software Systems

Sri Krishna Arts & Science College
Coimbatore
2014.01 - 2017.01

Skills

  • Python, PySpark, R, Scikit-learn, Pandas, NumPy
  • SQL, SparkSQL
  • Power BI, Tableau
  • Microsoft Fabric
  • Azure DevOps, CI/CD Pipelines
  • LLM APIs (OpenAI, Azure OpenAI)
  • Data preprocessing, model tuning, evaluation
  • Agile Methodology, Six Sigma
  • Office 365, Active Directory
  • ITIL Concepts
  • Agile Methodology
  • Service Delivery
  • Communication skills
  • Capability to work in fast paced environment
  • Problem solving ability
  • Team Leadership & Mentorship
  • Client Engagement

Projects

  • Mitie Projects: Led LLM-driven categorisation projects at Mitie, processing large datasets using async pipelines to optimise throughput and run tasks in efficient batches. Integrated GPT models via the OpenAI API in Microsoft Fabric, implementing robust pipelines with code review, monitoring, and quality controls.
  • Designed reusable, no-code LLM summarisation templates that allow non-technical users to generate consistent, high-quality summaries without needing coding expertise. Standardised inputs, prompts, and workflow steps to ensure ease of use and reliability across teams.
  • Developed Salesforce predictive models using XGBoost, focusing on feature engineering, model optimisation, and business KPI alignment. Improved model performance through hyperparameter tuning with Optuna, resulting in more accurate and reliable predictions.
  • Masters: I have hands-on experience in data analysis and machine learning through a significant project where I analyzed open accident data (2010-2020) from gov.uk. In this project, I not only performed data cleaning, exploratory data analysis, and feature engineering but also effectively addressed the high imbalance in the data. Handling this data imbalance was crucial for achieving accurate results in the machine learning model. The model, which predicted the criticality of accidents based on factors such as vehicle type and geographical location.

Certification

  • Microsoft Certified: DP-600 – Implementing Analytics Solutions Using Microsoft Fabric (2025)
  • Data Science Foundations - Level 1 – Issued by IBM
  • Deep Learning Essentials – Issued by IBM
  • IBM Agile Explorer

Timeline

Data Analyst

Mitie
2025.12 - Current

Jr. Machine Learning Engineer

Mitie
2024.04 - 2025.11

M.Sc - Data Science

University of the West of England
2023.01 - 2024.01

Delivery Analyst

Kyndryl (IBM Ind Pvt Ltd)
2021.01 - 2022.01

Reporting Analyst

IBM India Pvt. Ltd
2020.01 - 2021.01

Senior Associate

IBM India Pvt. Ltd
2017.01 - 2020.01

B.Sc - Software Systems

Sri Krishna Arts & Science College
2014.01 - 2017.01
SURABHI S