Sagar M - Lead Data Engineer

Sagar M

location Ahmedabad, Gujarat
BOOK A 30 MIN CALL

Sagar M

Lead Data Engineer

Sagar M is a Lead Data Engineer with 10+ years of experience designing scalable, cloud-native data pipelines using AWS, Airflow, Snowflake, and Databricks. He excels at building CI/CD-driven, production-ready systems with Docker and Terraform. With strong expertise in Python, SQL, and API integration, Sagar delivers clean, reliable, and future-ready data solutions that balance engineering precision with business impact.

  • With Bacancy Since: January 2024
  • No. of Projects: 10+ Projects
  • Language: English – Fluent | Hindi - Fluent | Gujarati - Fluent

Main Expertise

  • PythonPython 5 years
  • SQLSQL 5 years
  • AWSAWS 5 years
  • Microsoft AzureMicrosoft Azure 4 years
  • SnowflakeSnowflake 3 years
  • Apache AirflowApache Airflow 4 years
  • TerraformTerraform 3 years
  • TrinoTrino 3 years
  • PySparkPySpark 3 years
  • FastAPIFastAPI 2 years
  • DockerDocker 4 years
  • DatabricksDatabricks 2 years
  • Apache HiveApache Hive 2 years
  • Apache KafkaApache Kafka 3 years
  • Apache SparkApache Spark 3 years
  • RedshiftRedshift 2 years
  • SupabaseSupabase 1 years
  • TableauTableau 5 years
  • HadoopHadoop 5 years
  • StreamlitStreamlit 4 years

Other Skills

  • Apache SupersetApache Superset 3 years
  • MySQLMySQL 5 years
  • PostgreSQLPostgreSQL 5 years
  • Power BIPower BI 5 years
  • MLML 5 years
  • Aurora Aurora  2 years
  • React.jsReact.js 5 years
  • TypeScriptTypeScript 5 years
  • Lake FormationLake Formation 5 years
  • ElasticsearchElasticsearch 2 years
  • ScalaScala 2 years
  • GitLab CI/CDGitLab CI/CD 4 years
  • DBeaverDBeaver 3 years
  • GitHub ActionsGitHub Actions 4 years
  • Jupyter NotebookJupyter Notebook 4 years
  • ERPNextERPNext 2 years
  • FrappeFrappe 2 years
  • VS CodeVS Code 4 years
  • PostmanPostman 4 years
  • Agile WorkflowsAgile Workflows 4 years

AI & Automation

  • CursorCursor 2 years
  • GeminiGemini 3 years
  • PerplexityPerplexity 2 years
  • ChatGPTChatGPT 3 years

Major Projects

Scalable Financial Data Pipeline with Real-Time Reporting and Visualization

Built a fully automated, serverless ETL pipeline using AWS services (Lambda, S3, Glue, Athena) to process large-scale financial usage data across multiple sources. Integrated Amazon Redshift for optimized querying and Tableau dashboards for real-time business reporting. Set up CI/CD with GitLab and managed infrastructure using Terraform, ensuring scalable, low-latency performance with clean, production-ready code and high data reliability.

USA

Cloud & Data Services:

  • Amazon S3
  • Athena
  • AWS Lambda
  • Glue

Infrastructure & Automation:

  • GitLab CI/CD
  • Terraform

Programming & Query Languages:

  • Python
  • SQL

Databases & Storage:

  • S3 Bucket

Data Visualization:

  • Tableau Dashboards

AI:

  • ETL Pipelines
  • Glue Jobs
  • Lambda Functions

Centralized Data Observability Platform for Metadata Governance and Reliability

Designed and implemented core modules for a unified data observability platform, enabling metadata profiling, secure querying, and pipeline governance across 25+ data connectors. Built Airflow-based ingestion pipelines, integrated systems like Snowflake and Databricks, and developed Trino-FastAPI-powered query management. Streamlined deployment with Docker and AWS CI/CD, enforced licensing with Keygen, and maintained modular code while collaborating via GitHub and Jira for efficient product delivery.

USA

Cloud:

  • AWS

Core Stack:

  • Airflow
  • FastAPI
  • Python
  • SQL

Database:

  • MySQL
  • Snowflake

DevOps & Tools:

  • CI/CD
  • Docker

Snowflake Native Application for Real-Time Data Profiling and Quality Monitoring

Developed a Snowflake-native application from the ground up to perform real-time data profiling and rule-based quality checks within the Snowflake ecosystem. Leveraged Streamlit for UI and Snowflake for computation and storage. Designed a structured architecture for storing profiling results, rules, and execution logs. Enabled both manual and scheduled validations, integrated policy enforcement workflows, and built user-facing features for monitoring, customization, and seamless in-app governance.

USA

Platform:

  • Snowflake

Core Stack:

  • Python
  • SQL
  • Streamlit

Role-Based Health Analytics Dashboard for Injury & Recovery Insights

Designed and built a role-based, multi-client Tableau dashboard to track workplace injuries, treatment progress, and recovery bottlenecks. Integrated drill-down body map visualizations and dynamic time-based filters for detailed trend analysis. Developed ranking sheets highlighting top recovery barriers and optimizing user experience through stakeholder collaboration. Implemented secure access control and environment segmentation, ensuring scalable and compliant deployment.

Australia

Frontend:

  • HTML5 & CSS3

Backend:

  • Python
  • ServiceNow
  • SQL

Domain Worked In

  • Data engineeringData engineering
  • Data analyticsData analytics
  • Financial servicesFinancial services
  • Data observabilityData observability
  • Data QualityData Quality
  • HealthcareHealthcare
  • Business IntelligenceBusiness Intelligence
  • Cloud AutomationCloud Automation
  • Enterprise SoftwareEnterprise Software

Community Contributions

  • Active contributor to open-source projects in the AI/ML ecosystem

Education

Government Engineering College, Gandhinagar

Bachelor of Engineering focused on Instrumentation & Control

Client Testimonials

Ryan M

Ryan M

The health analytics dashboard Sagar delivered completely transformed how we track recovery outcomes. His data visualizations were intuitive, and the layered access controls gave our stakeholders confidence in the system’s integrity.

Emily C

Emily C

We needed someone to bridge the gap between advanced modeling and business outcomes. Sagar did exactly that. His forecasting work and automation solutions helped our team operate faster and smarter.

Jason C

Jason C

Sagar’s approach to solving our data quality challenges was both innovative and methodical. The custom-built profiling solution he developed within Snowflake now powers our core analytics pipeline.

Pamela N

Pamela N

We appreciated how quickly Sagar understood our domain and pain points. The dashboard he built for tracking injuries and recovery patterns is now a central part of how our leadership makes strategic decisions.

Hire Our Proficient Developers As Per Your Needs

Rajiv M - Lead AI/ML Expert

Rajiv M

Lead AI/ML Expert GET STARTED NOW
Nayan G - Lead AI/ML Expert

Nayan G

Lead AI/ML Expert GET STARTED NOW
Supan S - Lead Data Engineer

Supan S

Lead Data Engineer GET STARTED NOW

How Can We Help?