Summary
Overview
Work history
Skills
Websites
Certification
Timeline
Generic

Siddhartha Muvva

Milton Keynes,Buckinghamshire

Summary

Experienced Azure Data Engineer/ Big Data Developer with 5+ years of experience in designing, implementing, and maintaining data solutions on the Azure platform. Proficient in leveraging Azure services such as Azure Data Factory, Azure Databricks, Azure Synapse Analytics, and Azure Cosmos DB to develop scalable and efficient data pipelines. Skilled in data modeling, ETL/ELT processes, data warehousing, and performance tuning.

Overview

7
7
years of professional experience
1
1
Certification

Work history

Data Engineer

EPAM Systems
Hyderabad
05.2022 - Current
  • Walgreens is a healthcare organization, focused on improving health outcomes and reducing aggregate costs
  • Implemented ADF pipeline and Databricks Notebooks to read data from multiple sources apply transformations and business logic and insert into delta lake tables
  • Optimized long-run jobs to reduce the cluster run time by 20-30 %
  • Implemented Report Files Encryption and Decryption using PGP encryption
  • Analyzed different aspects of data, fixed the issues, and improved the performance of data fetch jobs from 45 to 20 minutes
  • Orchestrated the End-to-end Data ingestion process using Azure Data Factory
  • Conducted performance tuning and optimization to enhance data processing efficiency
  • Orchestrating the Spark Structured Streaming jobs in Azure Databricks with dependencies
  • Identified and addressed ADF bottlenecks, orchestrated ETL workflows, and reduced latency
  • Achieved a 15% improvement in overall data processing efficiency.

Data Engineer

Capgemini
Hyderabad
03.2021 - 04.2022
  • Unilever PLC is an FMCG Company focused on product sales and revenue
  • Implemented the Data Quality Framework on the source files and sending email notifications, it increased the Data Quality by 70%
  • Implementing all kinds of transformation with the help of Spark Scala and Spark SQL in Azure data bricks notebooks
  • I have dealt with multiple sources such as databases, Flat files, Parquet files, and ADLS and loaded the data in the form of parquet files into ADLS.

Data Engineer

Tata Communications and Transformation Services
Hyderabad
12.2019 - 02.2021
  • Airtel is a Telecom organization focused on Customer Experience
  • Automated ETL processes, making it easier to wrangle data and reducing time by as much as 40%
  • Increased the efficiency of the data fetching by approximately 30% using query optimization and indexing
  • Extract Transform and Load data from Source Systems to Azure Data Storage services using Azure Data Factory.

Big Data Developer

Vodafone Idea
Hyderabad
01.2018 - 12.2019
  • Vodafone is a telecom company focused on Revenue and products
  • Developed Sqoop jobs to pull the data from MYSQL - RDBMS and load it to HDFS
  • Worked on Performance tuning on different applications in Hive QL and Spark SQL & dataframe
  • Perform root-cause analysis of any issues post-implementation and work on solutions related to issue fixing.

Skills

  • Git Hub
  • IntelliJ IDEA
  • Azure Devops
  • Jira
  • Confluence
  • Python
  • SQL
  • PySpark
  • HiveQL
  • Spark Scala
  • Spark SQL
  • Azure Data Factory
  • Azure Databricks
  • ADLS
  • Azure EventHub
  • Azure Cosmos DB
  • Delta lake
  • Hadoop
  • Hive
  • Sqoop
  • HDFS
  • Spark

Certification

  • Databricks Certified Data Engineer Associate, Databricks
  • Azure Data Engineer Associate: DP-203, Microsoft Azure

Timeline

Data Engineer

EPAM Systems
05.2022 - Current

Data Engineer

Capgemini
03.2021 - 04.2022

Data Engineer

Tata Communications and Transformation Services
12.2019 - 02.2021

Big Data Developer

Vodafone Idea
01.2018 - 12.2019
  • Databricks Certified Data Engineer Associate, Databricks
  • Azure Data Engineer Associate: DP-203, Microsoft Azure
Siddhartha Muvva