Results-driven Data Engineer with 8+ years of expertise building enterprise-scale data solutions for financial services organizations. Specialized in designing robust ETL pipelines using Ab Initio, AWS/Azure cloud platforms, and modern visualization frameworks (Tableau, Angular). Proven track record leading technical teams, optimizing data workflows, and implementing risk controls for regulatory compliance. Expert in Spark, Python, SQL, Snowflake, and Databricks with demonstrated ability to reduce costs while improving performance.
Overview
10
10
years of professional experience
Work history
Data Engineer
JPMorgan Chase & Co.
Glasgow
2023.06 - 2026.02
Migrated existing process from on premise to AWS Cloud.
Troubleshot Delta Live Tables jobs Implemented a PoC for Azure Databricks-based Data Lake.
Led a team of 2 junior data engineers, setting clear goals and mentoring them on technical challenges and career growth.
Designed ETL processes (pyspark, Databricks Workflows).
Created CICD processes for schema migrations, workflows, cluster pools, etc.
Developed and maintained data lakes and analytical platforms using Databricks on AWS and Azure, ensuring scalability, data security, and automation of infrastructure as code (IaC).
Successfully implemented ETL pipelines using pyspark, databricks.
Successfully implemented Data Pipeline for Processing CSV Files Using S3 Lambda Glue and QuickSight, snowflake, eventbridge, sns.
Orchestrated the migration of legacy data warehouses to Databricks Lakehouse, resulting in a 60% reduction in infrastructure costs and a 3x improvement in query performance for business intelligence applications.
Designing and developing code, scripts and data pipelines that leverage structured and unstructured data integrated from multiple sources.
Established and enforced data quality standards, security measures, and governance policies, identifying and mitigating risks across data lifecycle management.
Conducted comprehensive analysis of data from multiple internal and external sources to solve complex business problems and drive data-driven decision-making.
Developed responsive data monitoring dashboards using Angular framework, integrating real-time pipeline metrics and data quality indicators for stakeholder visibility.
Built Ab Initio graphs for complex data integration workflows, processing 500GB+ daily transaction data from multiple banking systems into centralized data lake.
Created interactive Tableau dashboards for executive leadership, visualizing ETL pipeline performance, data lineage, and processing SLAs, reducing manual reporting effort by 40%.
Software Engineer
Tata consultancy services
Glasgow
2019.08 - 2023.05
Developed different Spark-jobs: To Convert the CSV, , text files into parquet files, using Dataset API and then providing a single view of class by giving them hierarchical structure, using Scala implicits, Case classes etc.
Data gathering using multiple channels and data understanding.
Worked on CI/CD pipelines by configuring Jenkins jobs, Gitlab, Maven and Kubernetes etc.
Build and improve the file ingestion framework using Spark/Scala to process and transform data from multiple sources and multiple file formats, and build bespoke solutions to manage the data using Spark/Scala.
Data ingestion using sqoop for Batch based customer details for both Regulatory/non-Regulatory processes.
Involved in creating Hive tables, loading with data and writing hive queries.
Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
Provided support during design, implementation, testing, deployment and
Developed, tested and deployed Terraform scripts to create and manage infrastructure on AWS.
Developed, tested and deployed Terraform scripts to migrate the data from DB2 to s3 bucket.
The project is to implement and support data ingestion from multiple sources, enrich the data using Spark/Scala and ingest the data-to-Data Lake.
Involved in architecture, design, development and data modelling of new dataset onboarding to aws.
Designed and implemented Ab Initio ETL workflows for batch processing customer transaction data, achieving 99.9% data accuracy for regulatory reporting
Developed Tableau visualizations for business stakeholders to monitor data pipeline health, ingestion volumes, and processing times across 15+ data sources
Created Angular-based internal tools for data engineers to track job status, monitor cluster utilization, and trigger manual data refreshes
DEVOPS ENGINEER
Livevox solutions pvt Ltd
Bangalore
2015.12 - 2019.07
Successfully deployed 28 applications across pre-production and production environments with zero downtime.
Worked in spark POC and developed code to process structured data using spark RDD and spark SQL.
Built sample ETL processes in spark to transform semi-structured data into structured datasets.
Responsible for deploying 28 applications in pre-prod and production environments successfully.
Responsible for automating the deployment of an application into test and pre-prod using shell scripting.
Built a Jenkins CI/CD pipeline for builds and deployments.
LiveVox, is a true omnichannel platform that offers customers a fully integrated suite of communication channels, and as well as reporting capabilities.
Education
Bachelor of Technology (B.Tech) - Computer Science