Summary
Overview
Work history
Education
Skills
ACADEMIC PROJECTS
RIGHT TO WORK -VISA STATUS
Timeline
Generic

Sowmya Batta

Milton Keynes

Summary

Results-driven Data Engineer with a proven track record at Hawk Sense, specializing in cloud migration and data integration. Expert in GCP and BigQuery, I achieved a 95% reduction in manual QA efforts through automated frameworks. Strong collaborator with a keen focus on data quality and business intelligence initiatives. Recent graduate with foundational knowledge in Data Analytics and hands-on experience gained through academic projects and internships. Demonstrates strong teamwork, problem-solving, and time-management skills. Prepared to start career and make meaningful contributions with commitment and drive.

Overview

6
6
years of professional experience

Work history

Career

Break
London, United Kingdom
11.2023 - Current
  • Dedicated time to full-time childcare following the birth of my child on November 2023 and to provide support to family after a bereavement.
  • This period also allowed for personal development and skill development through e-learning platforms.

Data Engineer

Hawk Sense
Hyderabad, India
07.2019 - 06.2022
  • Spearheaded the migration from an on-prem Hadoop ecosystem to GCP, re-writing Hive queries to BigQuery and moving data to GCS and Apache Spark processing to Dataproc. This involved in-depth analysis of existing data structures and gathering requirements for new cloud-based data integration initiatives.
  • Designed and developed automated frameworks using Python scripts to validate data pre- and post-processing, achieving a 95% reduction in manual QA effort and ensuring 99% data accuracy. This demonstrates a keen eye for data quality and its impact on business outcomes.
  • Designed and implemented a Medallion Architecture (Bronze, Silver, Gold layers) to streamline data ingestion, transformation, and consumption. This involved understanding diverse business needs for data layers to enable scalable and reliable data pipelines.
  • Implemented automated monitoring, alerting, and failure-handling mechanisms across all stages of the data pipeline to ensure reliability and minimise downtime, crucial for maintaining data integrity and availability for business users.
  • Collaborated effectively with the BI team to model data for reporting and dashboarding in Looker and Data Studio, directly supporting business intelligence initiatives and understanding user consumption patterns.

Education

Master of Science - Data Analytics

Bpp University
London,United Kingdom

Bachelor Of Technology - Computer Science and Engineering

St Ann's Engineer College
Chirala,India

Skills

  • Cloud Platforms: Amazon Web Series (AWS), Google Cloud Platform (GCP)
  • Data Warehousing & Business Intelligence: BigQuery, Looker, Data Studio, Power BI
  • Data Engineering Tools: PySpark, Hive, Hadoop, Cloud Storage, Dataflow,Snowflake
  • Programming & Scripting: Python, SQL
  • Workflow Orchestration: Cloud Composer, Cron, Airflow
  • Version Control: Git, GitHub, Jenkins
  • Frameworks & Other Tools: Jupyter, Pandas, Docker, Terraform
  • Monitoring Tools: Stackdriver, Cloud Logging

ACADEMIC PROJECTS

Cloud Data Warehouse Modelling - 06/2023 - 09/2023

  • Developed frameworks in Python to extract data from open weather APIs and transform data using Pandas, NumPy, etc., demonstrating data sourcing and transformation capabilities essential for data integration.
  • Developed various PySpark jobs for data extraction and transformation and deployed in GCP cloud.
  • Created BigQuery tables and developed various automated scripts to load data into these tables, highlighting data modelling and loading processes for analytical use.
  • Created Looker dashboards connected to BigQuery to demonstrate various aspects of data, showcasing skills in data visualization and presenting insights to stakeholders.


Spotify Music Data Analysis - 02/01/19 - 05/31/19

  • Successfully completed a project utilising Python and SQL to analyse and visualise Spotify music data available via APIs, demonstrating strong analytical and problem-solving skills in a data context.
  • Created complex SQL queries to extract valuable insights, including trend analysis of song popularity, distribution of song features, and recommendation systems based on user preferences. This highlights ability to extract actionable insights for business understanding.
  • Utilised tools such as Power BI to create interactive dashboards for visualisation purposes, and developed an automated framework in Python to convert different data formats to standard JSON format for processing. This experience is relevant to data delivery and reporting for business users.

RIGHT TO WORK -VISA STATUS

I have right to work without any restrictions. I'm currently on Tier 2 Dependent Visa hence no sponsorship is required.

Timeline

Career

Break
11.2023 - Current

Data Engineer

Hawk Sense
07.2019 - 06.2022

Master of Science - Data Analytics

Bpp University

Bachelor Of Technology - Computer Science and Engineering

St Ann's Engineer College
Sowmya Batta