Simran C - Senior Data Engineer

Simran C

location Ahmedabad, Gujarat
BOOK A 30 MIN CALL

Simran C

Senior Data Engineer

Simran is a Senior Data Engineer with 8+ years of experience building scalable, production-ready AI systems. She specializes in NLP, Computer Vision, and agent-based LLM workflows using tools like Python, TensorFlow, PyTorch, Hugging Face, and OpenAI. Simran has delivered impactful AI solutions across healthcare, finance, and document automation, translating complex business needs into intelligent, domain-specific applications with measurable value.

  • With Bacancy Since: January 2022
  • No. of Projects: 8+ Projects
  • Language: English – Fluent | Hindi - Fluent | Gujarati - Fluent

Main Expertise

  • DS PipelinesDS Pipelines 8 years
  • PythonPython 8 years
  • PySparkPySpark 7 years
  • SnowflakeSnowflake 6 years
  • Apache AirflowApache Airflow 6 years
  • DatabricksDatabricks 5 years
  • Apache AtlasApache Atlas 5 years
  • Azure SynapseAzure Synapse 5 years
  • Apache SparkApache Spark 3 years
  • Scikit-learnScikit-learn 5 years
  • PostgreSQLPostgreSQL 2 years
  • AWSAWS 5 years

Other Skills

  • NLPNLP 6 years
  • JuliaJulia 6 years
  • Neural NetworkNeural Network 5 years
  • TensorFlowTensorFlow 5 years
  • KerasKeras 3 years
  • PyTorchPyTorch 4 years
  • Apache SupersetApache Superset 3 years
  • Hugging FaceHugging Face 3 years
  • TransformersTransformers 3 years
  • LangChainLangChain 2 years
  • OpenAI GPTOpenAI GPT 3 years
  • Computer VisionComputer Vision 3 years
  • OCROCR 3 years
  • CUDACUDA 3 years
  • PyCharmPyCharm 3 years
  • GitHubGitHub 5 years
  • Jupyter NotebookJupyter Notebook 2 years
  • ScalaScala 2 years
  • SupabaseSupabase 1 years

AI & Automation

  • CursorCursor 2 years
  • GeminiGemini 2 years
  • ChatGPTChatGPT 2 years
  • PerplexityPerplexity 2 years

Major Projects

Intelligent Document Classification & Extraction Pipeline

Led the end‑to‑end development of an AI‑powered workflow for automatic document segmentation, classification, and data extraction. Implemented computer‑vision PDF splitting, integrated OCR with LLM‑driven field extraction, and added rigorous field‑level validation. Deployed the modular, scalable solution on Google Cloud Platform to support enterprise performance and easy future extensions.

Spain

Frontend:

  • FastAPI

Backend:

  • LangChain
  • Python

AI/ML/NLP:

  • Computer Vision
  • OCR
  • OpenAI GPT

Cloud Infrastructure:

  • GCP

AI Task Execution System with Agno Framework

Designed and developed a modular AI-powered task execution system using the Agno framework to automate daily productivity tasks via natural language commands. Architected multi-tool orchestration to enable seamless execution of actions like email handling, calendar management, shell operations, and data retrieval. Integrated specialized tools with a user-centric design, ensuring intuitive mapping from user input to backend operations. Rigorously tested for reliability, speed, and adaptability across diverse task scenarios.

USA

Frontend:

  • Natural language interface

Backend:

  • Agno framework

AI/ML/NLP:

  • multimodal workflow coordination
  • NLP

Integrations:

  • CalendarTools
  • EmailTools
  • MapTools
  • NewspaperTools
  • ShellTools
  • WikipediaTools
  • YFinanceTools
  • YouTubeTools

Design Anything – Advanced Image Editing System

Developed an AI-powered image editing platform enabling seamless content replacement within images. Designed the integration architecture combining Segment Anything Model (SAM) for precise segmentation with Stable Diffusion’s inpainting for high-quality visual edits. Focused on performance tuning, usability enhancements, and iterative improvements based on user feedback to deliver a robust and intuitive editing experience.

India

Backend:

  • Python

AI/ML:

  • Segment Anything Model (SAM)
  • Stable Diffusion

Image Processing:

  • OpenCV

Intelligent Context Manager for LLMs

Designed and built a Model Context Protocol (MCP) to dynamically manage and retrieve contextual memory for large language model applications. Implemented FAISS-based vector retrieval using embeddings, schema-driven validation with JSON Schemas, and modular pipelines for context ingestion, merging, and token overflow handling. Developed versioning and scalability features to support enterprise-level integration and personalized AI agent interactions.

Germany

Backend:

  • FAISS
  • PostgreSQL

AI/ML/NLP:

  • Embedding-based similarity search
  • LangChain
  • OpenAI GPT models

Data Management:

  • Context prioritization logic
  • JSON Schema
  • Version control

Domain Worked In

  • Document AutomationDocument Automation
  • NLP AutomationNLP Automation
  • Image ProcessingImage Processing
  • Multimodal AI agentsMultimodal AI agents
  • ShippingShipping
  • FinanceFinance
  • HealthcareHealthcare

Community Contributions

  • Active contributor to open-source projects in the AI/ML ecosystem
  • Experience taking part in hackathons and AI competitions, including an event organized at Bacancy
  • Contributor to open-source AI tools and frameworks
  • Active in prompt engineering and LLM tool benchmarking

Education

St. Xavier's College

MSc. Big Data Analytics

Client Testimonials

Ethan W

Ethan W

Simran brought technical clarity and innovation to our AI infrastructure. Her ability to design modular, LLM-integrated systems streamlined our internal tooling and improved productivity by over 40%. Highly professional and easy to collaborate with.

Maria S

Maria S

We were struggling with document automation at scale until Simran stepped in. She engineered a solution that intelligently classifies and extracts data from thousands of unstructured documents daily. Her attention to detail and system-level thinking are exceptional.

Jason B

Jason B

Simran’s implementation of a custom context management protocol for our LLMs significantly improved our financial chatbot's accuracy and contextual relevance. Her combination of strategic planning and deep technical skills is rare.

Ava T

Ava T

Simran helped us deploy a computer vision–based medical document parser that cut down our intake processing time by 60%. Her understanding of healthcare data, compliance, and model performance optimization made a critical impact on our outcomes.

Hire Our Proficient Developers As Per Your Needs

Rajiv M - Lead AI/ML Expert

Rajiv M

Lead AI/ML Expert GET STARTED NOW
Nayan G - Lead AI/ML Expert

Nayan G

Lead AI/ML Expert GET STARTED NOW
Sagar M - Lead Data Engineer

Sagar M

Lead Data Engineer GET STARTED NOW

How Can We Help?