Data Engineer With Specialization In Insurance, Healthcare And Automobile Domains
Summary
Software professional with 5 years experience Data driven Engineering and Analysis. Well versed in data modelling, ETL processes, database management and Data Integration.
Proficiency in curating, manipulating, refining and integrating heterogenous data sets through the application of Python, SQL, and Spark.
knowledge of Azure Databricks to design, implement, and optimize scalable data processing workflows, ensuring efficient and reliable data pipelines in Azure Data Factory.
Expertise in Data warehousing concepts including relational and non relational databases.
knowledge on the cloud based data storage in Azure Data Lake Gen2.
strong data analysis skills, extracting actionable insights by analyzing complex datasets using statistical methods, SQL queries, and data visualization tools like powerBI.
Transformed raw data into analytics-ready data sets using DBT ( Data Build Tools) by defining and executing SQL-based transformations.
Basic knowledge on implementation of CI/CD pipelines to automate the build, test, and deployment processes, reducing manual errors and accelerating software delivery.
In-depth knowledge of Agile methodologies, contributing to successful project execution in Agile environments.
Possess strong experience in Healthcare domain with an experience of 2.2 years working knowledge.
Possess strong experience in Insurance domain with an experience of 2 years working knowledge.
Good Expertise in Automobiles domain with 9 months of working experience..
Overview
5
5
years of professional experience
Work History
Data Engineer
Capgemini Technology Services
05.2021 - 01.2022
Experience in designing, implementing, and building large data pipelines that clean, transform, and aggregate data from various sources.
Work closely with data science and business intelligence teams to develop data models and enable more effective strategic, tactical, and operational insights and decision-making to drive significant business impact.
Experience in developing Spark applications using Spark SQL, Pyspark in Databricks for data extraction, transformation, and aggregation from multiple data sources for analyzing & transforming the data to uncover insights into the customer usage patterns.
Experience in writing Spark UDFs in Pyspark to handle specific business requirements and improve performance by code optimization and performance tuning.
Expertise in integrating, transforming, and consolidating data from various structured and unstructured data systems into structures that are suitable for building analytics solutions in Azure Data Platform.
Helping stakeholders understand the data through exploration, building, and maintaining secure and compliant data processing pipelines by using different Azure tools (Azure Data Factory, Azure SQL, Azure Functions, and Azure Storage/Data Lake) and Delta Lake.
Client: Stellantis Europe
Data Engineer
Cognizant Technology Services
04.2017 - 05.2021
Responsible for Gathering Business requirement and design data products.
Responsible for designing and implementing multiple ETL data pipelines using Python /Spark in Azure Data-bricks and on the traditional VMs using regular Python and SQL.
Utilize Apache Spark with Python and SQL queries, views, stored procedures to perform data transformations, including cleaning, filtering, aggregating, and joining datasets.
Designing and building data models in both SQL and Python and Snowflake schema.
Responsible for Curation of Raw tables into meaningful Data for Business requirement for various usecases and KPI's.
Implement data validation and quality checks to identify and rectify inconsistencies or anomalies in the data.
Responsible for Performance tuning by Optimizing Spark jobs and SQL queries for improved performance and resource utilization.
Expose the transformed data to azure synapse and use the DataMart for analyzing of data.
Testing of the Data pipelines and the transformations of data for delivering with no bugs.
Experienced in Azure Devops for Usecase and Tasks management.
Major Clients: Geisinger Health plan, Health Alliance plan Michigan, State Farm.
Design, Develop and Implementation at Capgemini Technology Services India LimitedDesign, Develop and Implementation at Capgemini Technology Services India Limited