Deepak Soni

CTO

Málaga, Andalusia, Spain23 yrs 4 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 20+ years of expertise in HPC and AI/ML environments
  • Managed €50 million GPU infrastructure deployment
  • Achieved 99.9% uptime for mission-critical workloads
Stackforce AI infers this person is a Cloud Computing and HPC Infrastructure Architect with extensive experience in AI/ML solutions.

Contact

Skills

Core Skills

Ai/ml & Hpc Infrastructure ArchitectureCloud Solutions ArchitectureHigh Performance Computing (hpc)

Other Skills

ANSYSAbaqusActive ListeningAnsibleAnsysArtificial Intelligence (AI)AutomationBackup & Recovery SystemsCUDACloud AdministrationCloud SecurityCloud-Native ArchitectureClusterCommunicationContinuous Improvement

About

It’s not enough to have a vision. It’s not enough to have a strategy. You need scalable AI-ready infrastructure that delivers. I bring 20+ years of expertise architecting and deploying GPU-accelerated HPC and AI/ML environments, including end-to-end solutions for LLM workloads across public and hybrid cloud platforms. My specialties include high-performance computing (HPC), AI model infrastructure optimization, GPU/accelerator benchmarking, and AI cluster design — with deep exposure to data center, automotive, finance, and healthcare verticals. Key Achievements: • Managed the deployment of GPU infrastructure worth over €50 million for more than 25 European clients • Reduced costs by 40% by optimising OCI for automotive CAE workloads • Made sure that GenAI platforms that handle sensitive customer data were compliant with GDPR • Provided 99.9% uptime for mission-critical AI/ML workloads Core Expertise - AI/ML & HPC Infrastructure Architecture: Leading the design and implementation of large-scale, high-performance computing environments for AI/ML workloads. - Large-Scale GPU Cluster Design: Architecting and deploying multi-node GPU clusters using NVIDIA A100/H100 for GenAI and LLM training. - Cloud Solutions Architecture: Designing and implementing robust, scalable, and cost-effective cloud solutions on Oracle Cloud Infrastructure (OCI). - Performance Optimization & Tuning: Enhancing system performance for HPC and AI workloads through platform scalability and fine-tuning. Domain Knowledge - Automotive HPC: Computer-Aided Engineering (CAE), and simulation workloads. - Financial and defence domain: monte Carlo simulation, financial app orchestration. - Generative AI & NVIDIA Ecosystem: Deep expertise in GenAI platforms, LLM training, and leveraging the full NVIDIA AI software/hardware stack. Cloud & DevOps Technologies - Cloud Platforms: Oracle Cloud Infrastructure (OCI), Hybrid & Multi-Cloud Architecture - Containerization & Orchestration: Kubernetes, Docker - Infrastructure as Code (IaC): Terraform, Ansible - CI/CD & Monitoring: Git, Prometheus, Grafana - Scripting: Python, Shell HPC & Technical Skills - Workload Managers: Slurm, IBM LSF - Networking: RDMA (RoCE v2) - High-Performance Storage: Lustre, GPFS - Operating Systems: Linux

Experience

Oracle

2 roles

AI Architect - AI Centre of Excellence

Aug 2025Present · 7 mos · Málaga, Andalusia, Spain · Remote

  • scoping and discovery session with the customers to figure out the use cases and design the solution and the end-to-end architecture, Define enterprise reference architectures, performance baselines, and optimization for GPU-accelerated GenAI/HPC on OCI. Architect and operate GPU-accelerated Gen AI and HPC/AI platforms on OCI (Kubernetes/OKE plus Slurm & PBS Pro), performance engineering & benchmarking, CUDA/NCCL micro-benchmarks, optimize GPU utilization and throughput, Enable distributed training & inference for LLMs. Build reusable IaC blueprints (Terraform/Resource Manager, Helm, OCI DevOps/OCIR) for rapid GPU cluster deployment. Partner with automotive CAE/simulation teams to map CFD/FEA/crash workloads to optimal shapes/schedulers
AI/ML & HPC Infrastructure ArchitectureCloud Solutions ArchitecturePerformance Optimization & Tuning

Principal Cloud Architect - AI Infra/GPUs and HPC

Feb 2021Jul 2025 · 4 yrs 5 mos · Málaga, Andalusia, Spain · Remote

  • Main activities and responsibilities:
  • Cloud Architect – AI/ML/HPC (Oracle Cloud Infrastructure) with extensive experience designing and deploying high-performance, cloud-native AI and HPC solutions across multiple industries.
  • Key Contributions:
  • Architected and delivered OCI-based AI/ML and HPC solutions tailored for clients across various sectors.
  • Led GenAI and machine learning proof-of-concepts, leveraging OCI Generative AI services, Data Science platform, and GPU-accelerated compute.
  • Designed and deployed RDMA-optimized HPC clusters using Slurm and NVIDIA GPUs for compute-intensive AI and CAE workloads.
  • Actively supported onshore LIFT engagements in Spain and EMEA, collaborating with customer infrastructure and cloud adoption teams.
  • Built and delivered customer-ready Oracle AI services, ensuring optimal performance on OCI's high-end GPU and AI infrastructure.
  • Developed enablement material, including blog posts, reference architectures, and Terraform automation scripts for deployment and reproducibility.
  • Led cloud migration projects for AI/HPC workloads, focusing on performance tuning, scalability, and hybrid integration.
  • Served as technical advisor and internal mentor, guiding teams on AI platform architecture, OCI best practices, and multi-region HPC design.
Oracle CloudArtificial Intelligence (AI)High Performance Computing (HPC)Cloud Solutions ArchitectureAI/ML & HPC Infrastructure Architecture

Dxc technology

HPC Consultant

Oct 2018Jan 2021 · 2 yrs 3 mos · Bangaluru · Hybrid

  • HPC and emerging technologies consultant supporting scientific computing across financial services, aerospace, and automotive industries throughout Europe and UK
  • Key Achievements: • €8M+ Project Portfolio: Delivered HPC infrastructure for 15+ European automotive and aerospace clients • Operational Excellence: Achieved 99.5% uptime for hybrid cloud HPC environments • Automation Success: Reduced cluster provisioning time by 70% through Ansible and Docker automation • Performance Optimization: Improved simulation workflow efficiency by 45% using Intel Cluster Checker • Hybrid Cloud Integration: Enabled seamless AWS/GCP integration for 5+ clients
  • European Market Focus: • Led GDPR-compliant data processing implementations for German automotive clients • Delivered hybrid cloud solutions meeting data residency requirements across UK and EU • Managed cross-cultural teams spanning UK, Germany, Netherlands, and France
HPCAutomationPerformance OptimizationHigh Performance Computing (HPC)

Citi

HPC Analyst

Aug 2016Oct 2018 · 2 yrs 2 mos · Pune Area, India · Hybrid

  • HPC Engineer supporting Financial Engineering Research Group, enabling high-performance simulation and analytics platforms for real-time financial trading and risk modeling supporting European markets.
  • Key Achievements: • Mission-Critical Support: Maintained 99.9% uptime for trading systems supporting €500M+ daily European volumes • Performance Improvement: Reduced simulation runtime by 60% through hardware optimization • Compliance Excellence: Ensured MiFID II compliance for algorithmic trading systems
HPCFinancial EngineeringSimulationHigh Performance Computing (HPC)

Tata technologies

Lead Solution Developer HPC

Jun 2008Aug 2016 · 8 yrs 2 mos · Pune Area, India · On-site

  • HPC operations lead managing in-house computing facility supporting Computer-Aided Engineering (CAE) Research Group. Focused on optimizing simulation workflows for automotive R&D and virtual testing.
  • Key Achievements: • Infrastructure Management: Led 200+ node heterogeneous HPC cluster supporting 500+ engineers • Application Performance: Optimized CAE applications (LS-DYNA, Abaqus, Ansys) achieving 30% performance gains • Custom Solutions: Developed CAE job submission portal integrated with PBS Pro, improving user productivity by 50% • Cost Efficiency: Implemented resource utilization tracking reducing infrastructure costs by 25%
Team ManagementHPC OptimizationHigh Performance Computing (HPC)

Sankalp venture

Linux System Administrator

Mar 2007Apr 2008 · 1 yr 1 mo · PUNE · On-site

  • Administered and maintained enterprise-level Linux infrastructure, ensuring optimal performance and security.
  • Lead a team of 5 developers in designing and implementing dynamic web projects for Indian Express news portals.
  • Executed end-to-end network troubleshooting, enhancing server uptime and reliability across multiple platforms.
Linux AdministrationNetwork Troubleshooting

Vindhya institute of technology & science, karhi road, amoudha, satna-485441

Programmer

Mar 2004Feb 2007 · 2 yrs 11 mos · SATNA M.P · On-site

  • Developed and delivered comprehensive coursework in core computer science subjects, enhancing student engagement and understanding.
  • Conducted hands-on lab sessions in Unix/Linux environments, ensuring practical application of theoretical concepts.
  • Mentored B.E. students on real-world projects, fostering problem-solving skills and professional readiness.

Freelance

Freelance Software Engineer

Jun 2002Feb 2004 · 1 yr 8 mos

  • i was a Freelance Software Engineer and did lot of small projects for institution related requirements

Bhabha atomic research centre (div of department of automic energy, governm

Project Team Member

Mar 2002May 2002 · 2 mos · India

  • At Bhabha Atomic Research Centre, I contributed to the development of a department-wide Personnel Database System, focusing on maintaining accurate employee and trainee records. My role involved designing a secure database and creating an intuitive GUI, which significantly improved data accessibility. I also implemented various database functionalities, enhancing overall system performance and reliability.

Education

Rajiv Gandhi Prodyogiki Vishwavidyalaya

Master of Computer Applications (MCA) — Computer Science

Jan 2000Jan 2002

Govt. Autonomous P.G. College,Satna

Bachelor of Science (BSc)

Jan 1995Jan 1998

Stackforce found 100+ more professionals with Ai/ml & Hpc Infrastructure Architecture & Cloud Solutions Architecture

Explore similar profiles based on matching skills and experience