Supan S - Lead Data Engineer

Supan S

location Ahmedabad, Gujarat
BOOK A 30 MIN CALL

Supan S

Lead Data Engineer

Supan S is a Lead Data Engineer with 10+ years of hands-on experience in backend data engineering and machine learning. He specializes in designing scalable ETL pipelines, integrating data from diverse sources, and delivering high-quality, reliable data solutions using tools like Airflow, Snowflake, and Databricks. With a strong foundation in cloud-based architectures and a collaborative mindset, he works closely with DevOps, QA, data scientists, and frontend teams to drive data excellence across projects.

  • With Bacancy Since: January 2022
  • No. of Projects: 15+ Projects
  • Language: English – Fluent | Hindi - Fluent | Gujarati - Fluent

Main Expertise

  • PythonPython 10 years
  • SnowflakeSnowflake 8 years
  • Azure Data FactoryAzure Data Factory 10 years
  • PySparkPySpark 10 years
  • DatabricksDatabricks 10 years
  • AWS Data EngineeringAWS Data Engineering 6 years
  • Google Cloud Data EngineeringGoogle Cloud Data Engineering 3 years
  • Microsoft AzureMicrosoft Azure 3 years
  • Big Data EngineeringBig Data Engineering 3 years
  • ETL PipelinesETL Pipelines 4 years
  • PostgreSQLPostgreSQL 2 years
  • Data LakehouseData Lakehouse 3 years
  • Data Warehouse DesignData Warehouse Design 3 years
  • TerraformTerraform 3 years
  • DockerDocker 4 years
  • Power BIPower BI 2 years
  • Apache SparkApache Spark 3 years

Other Skills

  • Apache AirflowApache Airflow 8 years
  • MLML 7 years
  • ScalaScala 6 years
  • Apache KafkaApache Kafka 2 years
  • Apache HiveApache Hive 2 years
  • HadoopHadoop 2 years
  • ELK StackELK Stack 2 years
  • StreamlitStreamlit 2 years
  • LookerLooker 1 years
  • ERPNextERPNext 2 years
  • Apache SupersetApache Superset 3 years
  • Alibaba CloudAlibaba Cloud 1 years
  • SQLSQL 3 years
  • JavaJava 1 years
  • KerasKeras 1 years
  • TypeScriptTypeScript 1 years
  • SupabaseSupabase 2 years
  • AkkaAkka 2 years
  • GitHubGitHub 5 years
  • LinuxLinux 5 years
  • WindowsWindows 5 years
  • MacOSMacOS 5 years
  • JiraJira 4 years
  • CI/CDCI/CD 3 years
  • FrappeFrappe 2 years

AI & Automation

  • CursorCursor 2 years
  • ChatGPTChatGPT 2 years
  • GeminiGemini 1 years
  • GitHub CopilotGitHub Copilot 2 years

Major Projects

AWS Glue Catalog Solution – Scalable Banking Data Architecture

Developed a cloud-native data pipeline architecture for a banking client, enabling automated ingestion, transformation, and reporting of quarterly datasets. Leveraging AWS Glue and S3, the platform delivers structured, partitioned data ready for business analytics via PowerBI. Infrastructure as Code (IaC) principles were applied using CloudFormation to ensure reliable deployments. The solution optimized reporting workflows and significantly reduced manual intervention in data preparation.

USA

Cloud & Data Engineering:

  • AWS Athena
  • AWS CloudFormation
  • AWS Glue
  • AWS Glue Catalog
  • AWS S3
  • Python

Business Intelligence:

  • PowerBI

Cloud & DevOps:

  • AWS CloudFormation
  • CloudFormation Templates
  • Scheduled Pipelines

Data Catalog & Governance Platform – Full-Stack Observability & Metadata Integration

Built a robust data observability and governance platform supporting 25+ databases and connectors, enabling comprehensive metadata extraction, quality checks, and query-driven insights. The system features in-built query support, 20+ data quality metrics, and automated ingestion pipelines. It facilitates seamless integration of on-prem and cloud data sources into a unified metadata layer, enhancing data trust and visibility across the enterprise.

Japan

Backend & Data Engineering:

  • Airflow
  • Apache Atlas
  • Java
  • MySQL
  • Python
  • REST APIs
  • SQL

DevOps & Infrastructure:

  • AWS CodeBuild
  • Docker
  • Terraform

Platform & Data Tools:

  • Airbyte
  • AWS Glue
  • Databricks
  • Lake Formation
  • Matillion
  • MSSQL
  • PostgreSQL
  • Snowflake

Snowflake Data Quality Native App – Streamlit-Powered Quality Governance Tool

Developed a native Snowflake application to streamline data quality and profiling processes using Streamlit and Snowpark. The app delivers automated data structure analysis, customizable quality rules, and integration with leading data validation libraries. Designed to embed directly within the Snowflake environment, it supports real-time quality monitoring with seamless user experience for analysts and engineers.

USA

Data Engineering & Platform:

  • Python
  • Snow-SQL
  • Snowflake
  • Snowpark
  • Streamlit

Data Quality & Profiling:

  • Great Expectations
  • Soda Core

Alibaba Cloud Telecom Data Solution – Scalable Analytics & Recommendation System

Architected an end-to-end data pipeline on Alibaba Cloud to process and analyze daily telecom data for business intelligence and personalized recommendations. Leveraged scalable cloud services for ingestion, transformation, and visualization. Integrated a machine learning–powered recommendation engine to enhance user engagement by suggesting optimal plans based on historical data patterns.

China

Analytics & BI:

  • PowerBI

AI/ML & Personalization:

  • Custom ML Algorithms
  • Historical User Behavior Modeling
  • Platform for AI

Google Cloud Marketing Data Analysis – Scalable Customer Intelligence Platform

Developed a marketing analytics platform on Google Cloud to unify campaign data, automate ETL workflows, and deliver actionable insights through advanced dashboards. The system supports pricing optimization and customer segmentation strategies by providing enriched, analysis-ready data to business and data science teams.

UK

Cloud & Data Engineering:

  • BigQuery
  • Google Cloud Storage
  • Matplotlib
  • NumPy
  • Pandas
  • Python

Analytics & AI/ML:

  • Vertex AI

AWS Cloud Data Migration & Database Redesign – Unified Data Platform Modernization

Led the redesign and migration of three legacy databases into a unified, scalable architecture on AWS Cloud. The solution improved data quality, enforced referential integrity, and enabled future-proof reporting and analytics with optimized schema design and automated pipelines.

USA

Cloud & Migration Stack:

  • AWS DMS
  • AWS Glue
  • AWS RDS
  • AWS S3
  • Python

Database & Transformation:

  • Data Cleansing Scripts
  • Data Profiling
  • Schema Normalization
  • SQL

Domain Worked In

  • Data engineeringData engineering
  • Data analyticsData analytics
  • Data observabilityData observability
  • Data QualityData Quality
  • Data CatalogData Catalog
  • TelecomTelecom
  • BankingBanking
  • MarketingMarketing
  • Business IntelligenceBusiness Intelligence
  • Cloud ComputingCloud Computing
  • Enterprise SoftwareEnterprise Software

Community Contributions

  • Organized 1st ever Serverless Community Day in Ahmedabad
  • Member of the Docker Community Ahmedabad, and organizing various community meetups
  • Member of AWS Community Ahmedabad, giving sessions on various AWS services of data engineering

Education

L.D College of Engineering (NAAC accredited A+)

Bachelor of Engineering focused on Computer Engineering

Achievements

HashiCorp Certified Terraform Associate

Team of the Quarter

2024

Employee of the Quarter

2025

Client Testimonials

Amanda P

Amanda P

Our data infrastructure needed a serious upgrade, and Supan delivered exactly that. Thanks to his expertise, our processing speeds improved dramatically, and the data quality is noticeably better across all teams.

Brian K

Brian K

When we faced challenges scaling our data systems, Supan stepped in with practical, effective solutions. He understood our business goals and designed architecture that supports growth without adding unnecessary complexity.

Megan L

Megan L

Supan automated many of our previously manual data workflows, saving hours of work every week and drastically reducing errors. His approach was thoughtful and aligned perfectly with our needs.

Ethan R

Ethan R

Migrating legacy systems to the cloud felt daunting until Supan took charge. He managed the entire transition smoothly, ensuring zero downtime and no disruption to our daily operations.

Hire Our Proficient Developers As Per Your Needs

Rajiv M - Lead AI/ML Expert

Rajiv M

Lead AI/ML Expert GET STARTED NOW
Nayan G - Lead AI/ML Expert

Nayan G

Lead AI/ML Expert GET STARTED NOW
Sagar M - Lead Data Engineer

Sagar M

Lead Data Engineer GET STARTED NOW

How Can We Help?