Summary
Overview
Work History
Education
Skills
Timeline
Generic

Alikhan Menseitov

WINCHESTER

Summary

Data Scientist with 2 years of industry experience using Python to extract insights from Big Data and visualising with Tableau. Proficient in setting up machine learning and ETL pipelines in production. Equally confident working independently and collaboratively as needed and utilising excellent communication skills.

Overview

3
3
years of professional experience

Work History

AI and Machine Learning Developer

KOIOS Master Data
07.2023 - Current
  • Implementing a combination of vector databases (Pinecone, WeAviate and PostgreSQL with PGvector) and Large Language Models (GPT-3.5 turbo and open source HuggingFace models) for automation of creating an industrial concept dictionary using semantic search.
  • Preparing training data for fine-tuning Llama 2.
  • Created pipelines for onboarding manufacturers’ data and orchestrating it with Apache AirFlow.
  • Visualising data and creating dashboards using Tableau and presenting those to the board of directors to support key business decisions.
  • Managing a PostgreSQL database on Azure server.

Data Analyst

Office For Students
09.2022 - 07.2023
  • Improved legacy SAS code and convert it into Python.
  • Identified a process that was causing a process bottleneck and optimised it in Python by splitting and processing on 3 virtual machines simultaneously, which sped up the process by 30%.
  • Collaborated on setting up a Data Warehouse in Azure Databricks to improve streamlining ETL pipelines.
  • Statistical analysis of universities’ financial returns in Python and visualising it in Tableau.

Analyst Assistant

Office For Students
07.2021 - 07.2022
  • Improved a process by automating it using combination of Python scripts, Task Scheduler and SAS code, resulting in increased usability and reduced case lifetime by 60%.
  • Collaborated with colleagues on improving legacy code using Azure DevOps.
  • Large data loading and pre-processing using a combination of SQL Server and SAS.
  • Statistical analysis for regular sector wide publications using SAS.
  • Populating and managing databases in SQL Server, including writing complex queries to create views.
  • Implementing Azure Databricks cloud solutions and participating in developing a new structure of subscriptions and containers within it. Also, gained experience in data loading, mounting and partitioning.
  • Gained sufficient presentation skills via presenting a newly developed solution to the directorate of over 70 people and twice a month presenting reports to the director of the department.
  • Maintained a Tableau dashboard used for monitoring a process and have later replaced it with an enhanced PowerBI dashboard.

Education

Bachelor of Science - Data Science And Analytics

University of Portsmouth
Portsmouth
06.2023

Foundation - Data Analytics

University of Plymouth
Plymouth
04.2019

Skills

  • Programming Languages : Python, SQL, R, SAS, C#
  • Automation Tools : Python, Shell Scripting, BASH, Autosys
  • Visualisation tools : Tableau, PowerBI
  • Additional: Azure Databricks, Excel, Rust, Apache Airflow, Data Warehousing, Git, Azure DevOps, MatLab

Timeline

AI and Machine Learning Developer

KOIOS Master Data
07.2023 - Current

Data Analyst

Office For Students
09.2022 - 07.2023

Analyst Assistant

Office For Students
07.2021 - 07.2022

Bachelor of Science - Data Science And Analytics

University of Portsmouth

Foundation - Data Analytics

University of Plymouth
Alikhan Menseitov