
Aspiring Data Engineer with 2.5 years of experience supporting and monitoring enterprise ETL pipelines in production environments. Through hands-on involvement in incident management, root-cause analysis, data validation, and pipeline monitoring, I have developed a strong understanding of how large-scale data systems operate and support business-critical processes. Experienced in ensuring data reliability, maintaining SLA compliance, and troubleshooting data processing issues across cross-functional teams.
Completed an MSc in Big Data Technologies, where I strengthened my practical experience through hands-on technical projects involving data cleaning, transformation, processing, validation, and analysis of complex datasets. In addition to my academic experience, I have gained hands-on experience designing and implementing ETL pipelines using Azure Data Factory, Azure Databricks, PySpark, ADLS Gen2, Azure SQL, and Azure DevOps CI/CD pipelines through technical projects and self-directed learning. Combining production ETL support experience with hands-on data engineering projects has given me a strong understanding of the end-to-end data lifecycle.
Monitored Informatica & Azure pipelines
•Ensured timely completion of scheduled jobs and compliance with SLA targets through continuous monitoring, proactive issue resolution, and timely escalation.
Investigated pipeline failures
•Performed root-cause analysis and coordinated with development teams to resolve ETL failures and prevent disruption to business operations.
Performed data analysis and validation using SQL
• Analyzed and validated data using SQL, tracing data flows and investigating source to target mappings to support incident resolution and ETL pipeline troubleshooting.
Performed post-release ETL validation
•Conducted post-release validation of ETL pipelines to confirm successful releases and ensure successful execution of daily ETL jobs.
Supported critical marketing applications
• Managed and resolved production incidents for critical marketing applications, minimizing service disruptions and ensuring continuous business operations.
Managed production changes
• Coordinated OLM approvals, sent advance notifications to business stakeholders for critical marketing application changes, and conducted post-change validation to ensure successful deployments and minimize business disruption.
Managed ticket backlogs
• Generated daily backlog and SLA reports, helping teams prioritize unresolved incidents, meet SLA targets, and reduce the risk of SLA breaches.
Programming:
Python
PySpark
SQL
Azure:
Azure Data Factory
Azure Databricks
ADLS Gen2
Azure SQL Database
Azure Key Vault
Data Engineering:
Version Control & DevOps:
Git
Azure DevOps Pipelines
CI/CD
Machine Learning:
Synthetic Data Generation (GANs)
Differential Privacy