S

Sandipan Basak

Senior Software Engineer

Bengaluru, Karnataka, India7 yrs 9 mos experience
Highly StableAI Enabled

Key Highlights

  • Expert in building scalable AI infrastructure and ML pipelines.
  • Proven track record in real-time feature development and optimization.
  • Strong leadership in cross-functional collaboration and project management.
Stackforce AI infers this person is a SaaS and AI-focused Fullstack Developer with strong expertise in MLOps and infrastructure.

Contact

Skills

Core Skills

Ai InfrastructureModel ServingMlopsFullstack DevelopmentDevopsSystems EngineeringTechnical DesignDeep LearningComputer VisionIot Development

Other Skills

AI AgentsAWS IoTAgent OpsAirflowAmazon Web Services (AWS)Android DevelopmentApache KafkaArtificial IntelligenceArtificial Neural NetworksBootstrap (Framework)CC++Cascading Style Sheets (CSS)Convolutional Neural Networks (CNN)Data Mining

About

Experienced AI Platform Engineer focused on enabling scale, reliability, and rapid iteration for ML & agentic systems. Over ~6 years, I’ve contributed to scalable model serving, orchestration, prompt governance, and observability via robust metrics, logging, evaluation, and validation. I deploy cloud-agnostic, Kubernetes-driven infrastructure, build AI gateway services, sdks and mcp services, delivering efficient, performance-oriented features for LLM-centric products. I thrive in growth-stage teams where platform foundations make the difference.

Experience

Autonomize ai

Senior Software Developer

Jun 2024Present · 1 yr 9 mos · Bengaluru, Karnataka, India · On-site

  • At Autonomize AI, I work as a core contributor on the platform team, building next-gen AI infrastructure, agentic systems, and intelligent automation pipelines from the ground up. My role spans across platform architecture, backend systems, and real-time ML deployment.
  • ⚙️ What I’ve Been Building:
  • 🚀 Agentic AI Frameworks
  • Co-designed and developed an event-driven agent orchestration system that powers 24/7 autonomous operations across distributed components.
  • 🧩 ML Pipelines & Automation
  • Built scalable pipelines for document intelligence (classification, extraction, enrichment) using Python microservices, async workflows, and Airflow DAGs.
  • 📦 Model Serving Infrastructure
  • Engineered Kubernetes-native model deployment infrastructure supporting 50+ live models with autoscaling and real-time inference.
  • 🌐 Multi-Region Infra & DR
  • Helped architect and deploy a resilient, multi-region Kafka streaming system with automated failover—supporting high availability for AI-powered search systems.
  • 📊 Observability & Governance
  • Set up complete observability stack using Prometheus, Grafana, and OpenTelemetry—improving reliability, traceability, and compliance across the platform.
  • 🤝 Collaboration & Design
  • Worked closely with the CTO, infra leads, and ML teams on systems design, architectural planning, and high-impact product launches.
  • 🧰 Tech Stack
  • Python · Kubernetes · Terraform · Kafka · FastAPI · Airflow · Prometheus · OpenTelemetry · TensorFlow Serving · GCP · PostgreSQL
PythonKubernetesTerraformKafkaFastAPIAirflow+7

Quizizz inc.

Fullstack Developer

Jul 2021Jul 2024 · 3 yrs · Bengaluru, Karnataka, India

  • My role encompassed leading a comprehensive Vue migration from an outdated MVC framework to a modern architecture, leveraging cutting-edge technologies like Pinia for state management and optimizing the initial load process. This transformation was aimed at enhancing performance, improving code maintainability, and facilitating a more agile development environment. I spearheaded the development of real-time features to enhance user interaction and engineered a sophisticated data response service to improve data handling efficiency. My efforts extended to refining store management practices and overseeing code quality through diligent PR reviews and bug resolution, significantly advancing the project's overall quality and efficiency.
  • Development Efficiency: Reduced development cycles by 2 days per engineer, directly translating into faster project turnaround times.
  • Performance Improvements: Achieved substantial enhancements in site performance metrics, including reducing the LCP from 11s to 5s and the FCP from 5s to 2s, which greatly improved UI/UX responsiveness and positively impacted site SEO and user engagement.
  • User Interaction: The introduction of real-time features and efficient server load management led to more responsive and engaging user interactions.
  • Data Handling: Through the use of advanced technologies for data aggregation and storage, I managed to significantly optimize API call efficiency and page responsiveness, which contributed to a more seamless user experience and supported the onboarding of over 20k new teachers.
  • Productivity and Code Quality: The adoption of Pinia for store management and the focused effort on PR reviews and bug fixes resulted in a marked reduction in development time and a 50% weekly decrease in bugs, enhancing productivity, system reliability, and accelerating the release of more robust and efficient software solutions.
Apache KafkaSQLDjangoDockerGrafanaNode.js+14

Vmware

Technical Staff Engineer

Jul 2019Jul 2021 · 2 yrs · Bengaluru Area, India

  • I managed VMware vSphere environments, emphasizing detailed Root Cause Analysis for vCenter and ESXi to improve performance. My optimization of the VMware infrastructure setup, including VM/resource allocation and automated certificate management, led to enhanced cloud integration and resource efficiency. Additionally, I enhanced vCenter and ESXi security and operational efficiency through better upgrade processes and advanced certificate management scripting, resulting in strengthened system security and streamlined maintenance processes.
  • Improved System Reliability: Reduced incident response times, enhancing reliability and uptime.
  • Enhanced Infrastructure Efficiency: Achieved better cloud integration and resource utilization, boosting performance.
  • Robust Security and Maintenance: Strengthened security and simplified processes, ensuring more efficient operations.
DockerPythonStorage VirtualizationShell ScriptingJavaScriptSystems Engineering+4

Eximius design

DL Intern

Oct 2018Jun 2019 · 8 mos · Bengaluru Area, India

  • I was instrumental in developing a cutting-edge end-to-end pipeline for employee facial recognition, encompassing everything from data collection to authorization. This project involved the enhancement of the RESNET model accuracy through meticulous fine-tuning with Transfer Learning techniques. Additionally, I pioneered the creation of an application designed for comprehensive facial data capture, utilizing OpenCV and MTCNN for data gathering, and subsequently trained the model on the YOLOv3 SSD network to improve angle detection capabilities. My work focused on optimizing deep neural network architectures and applying Transfer Learning with models like Inception-Net on custom datasets, including dog breeds and facial data, to push the boundaries of machine learning applications in authentication and recognition technologies.
  • Facial Recognition Accuracy: Through the fine-tuning of the RESNET model with Transfer Learning, I significantly enhanced the facial detection and recognition capabilities, achieving an 89% accuracy rate. This advancement notably refined the accuracy of employee authentication processes.
  • Advanced Detection Performance: The application developed for facial data capture, when trained on the YOLOv3 SSD network, attained an 80.19% mean Average Precision (mAP), outperforming both Faster R-CNN and YOLOv2. This breakthrough led to faster and more precise user authentication, showcasing the potential of optimizing deep neural network architectures for real-world applications.
Deep LearningImage ProcessingComputer VisionPandas (Software)PythonConvolutional Neural Networks (CNN)+3

Objectsol technologies pvt ltd

IoT Intern

Jun 2017Nov 2017 · 5 mos · Kolkata Area, India

  • I developed an IoT product that integrates sensors for humidity, temperature, soil moisture, pH, and nutrients, enabling precise agricultural management through wireless server connectivity. This solution provides real-time environmental monitoring, with data visualized on a website using Node-RED, InfluxDB, and Grafana for comprehensive insights. This streamlined approach enhances agricultural decision-making and resource optimization, significantly advancing crop management practices.
  • Water Usage Efficiency: By enabling precise irrigation based on real-time soil moisture and weather data, water usage has been reduced by up to 30%, conserving vital resources and reducing costs associated with over-irrigation.
  • Crop Yield Increase: The optimal environmental conditions maintained through the system have contributed to a 20% increase in crop yields, enhancing food production without additional land cultivation.
  • Reduction in Fertilizer Use: With nutrient sensors providing detailed soil chemistry data, fertilizer application has been optimized, leading to a 25% reduction in usage. This not only lowers the cost of inputs for farmers but also minimizes environmental pollution.
Node-REDGrafanaHTML5PythonJavaScriptMqtt+3

Education

Vellore Institute of Technology

Bachelor of Technology — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Ai Infrastructure & Model Serving

Explore similar profiles based on matching skills and experience