

Senior Data Engineer with 7+ years of experience developing scalable and robust data engineering solutions on Microsoft Azure. Skilled in PySpark, Azure Databricks, Azure Data Factory (ADF), ADLS, Python, and SQL, with proven experience in building modern data pipelines, optimizing distributed data workloads, and supporting enterprise analytics initiatives. Worked across manufacturing, finance, retail, HR, supply chain, and e-commerce domains, delivering high-quality, business-focused data solutions with emphasis on scalability, reliability, automation, and data quality.
Project : Panda Restaurant Group (U.S.)
Domain: Retail | Supply Chain | Commercial Analytics
Technology Stack: Azure Synapse, Azure Databricks, PySpark, ADLS, Azure Logic Apps, Python, Claude Sonnet API, Parallel Processing, Databricks AI Functions, Databricks Genie
Responsibilities & Achievements:
1)Designed and developed a scalable data engineering solution to process store-level operational and commercial data from Azure Synapse using Azure Databricks and PySpark.
2)Built parallelized data processing workflows to efficiently handle large-scale store data and improve execution performance across multiple business units.
3)Integrated Claude Sonnet AI APIs with Databricks to generate automated store-level business insights by analyzing Actuals, Plan, and Previous Year YTD performance across key KPIs including revenue growth, operational efficiency, and cost trends.
4)Developed automated pipelines to orchestrate data extraction, AI inference, validation, and structured storage of generated insights in Azure Data Lake Storage (ADLS) for downstream reporting and analytics consumption.
5)Implemented automated data quality and validation checks using Databricks AI capabilities to compare AI-generated insights with source data and ensure consistency, accuracy, and reliability.
Defined validation thresholds and quality control mechanisms to ensure only verified insights were shared with business stakeholders.
6)Built end-to-end workflow automation using Azure Logic Apps to distribute validated store insights and recommendations to business teams through automated email notifications.
7)Collaborated with cross-functional stakeholders to support data-driven decision-making through scalable analytics and AI-enabled reporting solutions.
Project: Ardagh – Unified Manufacturing Data Integration (Europe)
Domain: Packaging (Metal & Glass)
Project: Coca-Cola Hellenic Bottling Company (CCHBC)
Domain: Beverage Manufacturing
Project: V360 – Supply Chain Visibility
Domain: Logistics & SCM
Project: SIRVIS – Seller & Operational Dashboard
Domain: E-commerce & Retail
Pyspark
Python
SQL
ADF
ADLS
Azure DevOps
ETL development
Big data processing
Spark framework
Microsoft Fabric
Databricks Genie
Logic Apps
CICD
Azure Cloud
Fractal Certified Data Engineer