Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Surya Gavini

London

Summary

Dynamic and results-driven data engineering professional with over 6 years of industry experience and a Master's degree in Data and Decision Analytics. Proficient in designing and optimizing data pipelines, data modeling, and transformation using Python, PySpark, Databricks, Snowflake, Snowpark, Streamlit, and Azure technologies. Skilled in deploying machine learning models and developing AI-driven solutions, with hands-on expertise in ETL tools, Unix shell scripting, SQL, and visualization platforms like Dash Plotly, Power BI, and Tableau. Demonstrates strong analytical, problem-solving, and leadership abilities, with a proven track record of delivering impactful solutions across Healthcare, Finance, Retail, and Geoscience domains. Seeking a challenging role in a fast-paced environment where my technical expertise and innovative mindset can drive organizational success and personal growth.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Data Engineer

SDG groups UK
06.2022 - Current
  • CGG: Developed interactive dashboards using Python and Dash Plotly to visualize complex datasets, enabling data-driven decision-making for stakeholders.
  • Santen: Performed data engineering tasks using PySpark, Python, and Databricks to process and transform large-scale datasets, ensuring efficient data pipelines.
  • Yell: Designed and deployed a Streamlit application in a Snowflake environment using Python, delivering user-friendly data insights to business users.
  • Methods: Applied hands-on expertise in deploying machine learning models into Snowflake, streamlining model integration and performance monitoring.
  • Castore: Engineered a data warehouse using Matillion and Snowflake, optimizing data storage and retrieval for business intelligence applications.
  • Internal Project: Spearheaded the development of a chatbot proof of concept using Python, LangChain, and the Groq LLM, enabling query responses based on specific organization data to enhance data accessibility.
  • Demonstrated strong proficiency in supervised and unsupervised machine learning techniques, applying them to solve real-world business problems.

Data Engineering

Data Artisans Limited
08.2021 - 04.2022
  • Developed Python-based data processing pipelines using Pandas and SQL for Healthcare, Insurance, and Retail clients, ensuring efficient ETL workflows.
  • Supported multiple applications with Unix shell scripting, SQL, and ETL tools like Ab Initio Suite, delivering solutions for diverse business users.
  • Optimized ETL processes using Matillion and SQL, enhancing data integration for Banking and Geoscience projects with minimal latency.
  • Resolved technical bottlenecks using Analytical Problem Solving, leveraging Visual Studio Code and PuTTY to prevent issue escalation for clients.
  • Managed projects within Agile frameworks using Jira, coordinating with product owners and scrum masters to deliver solutions on time.
  • Facilitated change management for code promotion post-user acceptance testing, ensuring system integrity with Git for version control.
  • Mentored junior team members on Python and ETL tools, promoting collaboration and adaptability in high-pressure environments.
  • Applied domain expertise in Finance and Healthcare, using critical thinking to deliver tailored data solutions for client-specific needs.

Trainee Data Engineer

Commerz Bank
01.2017 - 08.2020
  • Engineered data pipelines using Azure Data Factory (ADF) and Databricks, automating real-time transaction processing for Banking operations.
  • Migrated on-premises SQL Server data to Azure SQL Database with SSIS, achieving zero data loss and modernizing legacy pipelines.
  • Developed financial dashboards in Power BI and Tableau, implementing Row-Level Security (RLS) for secure transaction monitoring and risk analytics.
  • Implemented automated data quality checks using SQL and Azure Synapse Analytics, ensuring compliance with financial regulations.
  • Collaborated with stakeholders using Microsoft Office and Azure DevOps, gathering requirements to align projects with Banking standards.
  • Utilized Python and Pandas for ad-hoc data analysis, supporting financial reporting and quantitative analysis for risk management.
  • Contributed to Agile project management, tracking progress with Jira and delivering solutions under strict regulatory deadlines.
  • Demonstrated process improvement by optimizing ADF workflows, reducing processing time for customer data and enhancing operational efficiency.

Education

MSc - Project Management

BPP University
London, ENG
02.2022

Skills

  • Python
  • LangChain
  • Groq LLM
  • Power BI
  • Tableau
  • Microsoft Excel
  • Matplotlib
  • Seaborn
  • Dash Plotly
  • Snowflake
  • Azure SQL Database
  • Azure Data Lake
  • Azure Lakehouse
  • SQL
  • Matillion
  • SSIS
  • Microsoft Azure
  • Azure Data Factory
  • Azure Synapse Analytics
  • Azure Databricks
  • Azure Data Lake Storage
  • Logic Apps
  • Azure DevOps
  • AWS
  • Ab Initio Suite
  • Data Warehousing
  • Data Modeling
  • Data Analysis
  • PySpark
  • Spark SQL
  • Snowpark
  • Streamlit
  • REST API
  • Git
  • Visual Studio Code
  • Pandas
  • NumPy
  • Sk-learn
  • Supervised and Unsupervised Algorithms
  • Statistics
  • Quantitative Analysis
  • Jira
  • PuTTY
  • Microsoft Office
  • Strategic Planning
  • Analytical Problem Solving
  • Critical Thinking
  • Process Improvement
  • Innovation
  • Collaboration

Certification

  • Azure Data Engineering
  • Databricks Data Engineering Associate
  • Python Certification

Timeline

Data Engineer

SDG groups UK
06.2022 - Current

Data Engineering

Data Artisans Limited
08.2021 - 04.2022

Trainee Data Engineer

Commerz Bank
01.2017 - 08.2020

MSc - Project Management

BPP University
Surya Gavini