Lead Data Engineer
Supan S is a Lead Data Engineer with 10+ years of hands-on experience in backend data engineering and machine learning. He specializes in designing scalable ETL pipelines, integrating data from diverse sources, and delivering high-quality, reliable data solutions using tools like Airflow, Snowflake, and Databricks. With a strong foundation in cloud-based architectures and a collaborative mindset, he works closely with DevOps, QA, data scientists, and frontend teams to drive data excellence across projects.
Scala
6 years
Akka
2 years
Developed a cloud-native data pipeline architecture for a banking client, enabling automated ingestion, transformation, and reporting of quarterly datasets. Leveraging AWS Glue and S3, the platform delivers structured, partitioned data ready for business analytics via PowerBI. Infrastructure as Code (IaC) principles were applied using CloudFormation to ensure reliable deployments. The solution optimized reporting workflows and significantly reduced manual intervention in data preparation.
Built a robust data observability and governance platform supporting 25+ databases and connectors, enabling comprehensive metadata extraction, quality checks, and query-driven insights. The system features in-built query support, 20+ data quality metrics, and automated ingestion pipelines. It facilitates seamless integration of on-prem and cloud data sources into a unified metadata layer, enhancing data trust and visibility across the enterprise.
Developed a native Snowflake application to streamline data quality and profiling processes using Streamlit and Snowpark. The app delivers automated data structure analysis, customizable quality rules, and integration with leading data validation libraries. Designed to embed directly within the Snowflake environment, it supports real-time quality monitoring with seamless user experience for analysts and engineers.
Architected an end-to-end data pipeline on Alibaba Cloud to process and analyze daily telecom data for business intelligence and personalized recommendations. Leveraged scalable cloud services for ingestion, transformation, and visualization. Integrated a machine learning–powered recommendation engine to enhance user engagement by suggesting optimal plans based on historical data patterns.
Developed a marketing analytics platform on Google Cloud to unify campaign data, automate ETL workflows, and deliver actionable insights through advanced dashboards. The system supports pricing optimization and customer segmentation strategies by providing enriched, analysis-ready data to business and data science teams.
Led the redesign and migration of three legacy databases into a unified, scalable architecture on AWS Cloud. The solution improved data quality, enforced referential integrity, and enabled future-proof reporting and analytics with optimized schema design and automated pipelines.
L.D College of Engineering (NAAC accredited A+)
Bachelor of Engineering focused on Computer Engineering
HashiCorp Certified Terraform Associate
Team of the Quarter
2024
Employee of the Quarter
2025

Our data infrastructure needed a serious upgrade, and Supan delivered exactly that. Thanks to his expertise, our processing speeds improved dramatically, and the data quality is noticeably better across all teams.

When we faced challenges scaling our data systems, Supan stepped in with practical, effective solutions. He understood our business goals and designed architecture that supports growth without adding unnecessary complexity.

Supan automated many of our previously manual data workflows, saving hours of work every week and drastically reducing errors. His approach was thoughtful and aligned perfectly with our needs.

Migrating legacy systems to the cloud felt daunting until Supan took charge. He managed the entire transition smoothly, ensuring zero downtime and no disruption to our daily operations.