Cloud-native data platforms

Dhanesh Sarpale

I design scalable data pipelines and cloud data platforms as a Senior Data Engineer

Senior Data Engineer with 7+ years of experience building scalable, reliable data platforms across banking, healthcare, and global technology organisations including Bank of Ireland, Amgen, and Meta. Specialized in Azure, AWS, Databricks, Spark, PySpark, Python, ETL pipelines, and business intelligence.

Illustrated portrait placeholder for Dhanesh Sarpale
Azure + PySpark
FastAPI Data Services
0 Years Experience
0 Projects
0 Certifications
0 Companies Worked
What I Do

Data engineering that behaves well under pressure.

I design, build, and operate data platforms where clean architecture, production quality, and stakeholder usefulness meet. My work spans ingestion, transformation, APIs, analytics, governance, and release reliability.

Skills

Engineering toolkit

Programming

Python SQL PySpark FastAPI

Cloud

Azure AWS Databricks Cloudera

Big Data

Hadoop Hive HiveQL Spark

Storage

ADLS Blob Storage S3 Redshift

Processing

Databricks ADF PySpark

Databases

Oracle Teradata Presto Delta Lake

DevOps

Git Docker Azure DevOps Jenkins

Visualization

Power BI Tableau

Testing

Pytest Zephyr

Selected Work

Featured data products

All projects
BA

Banking Risk Data Lake

A governed banking data lake for risk analytics with curated zones, lineage, controls, and high-trust reporting tables.

AzureADLSADFPySpark
GitHub
HE

Healthcare ETL Framework

Reusable ingestion and transformation framework for healthcare data products with validation, monitoring, and restartable jobs.

PySparkDatabricksSQLDelta Lake
GitHub
GL

Global Tech Analytics Pipelines

Distributed analytics pipelines supporting operational and strategic decision-making in a large-scale technology environment.

SparkHiveHiveQLPython
GitHub

Let's Connect

Open for conversations about senior data engineering, cloud and on-prem platforms, and analytics engineering.

Start a Conversation