Deepak Soni — CTO
It’s not enough to have a vision. It’s not enough to have a strategy. You need scalable AI-ready infrastructure that delivers. I bring 20+ years of expertise architecting and deploying GPU-accelerated HPC and AI/ML environments, including end-to-end solutions for LLM workloads across public and hybrid cloud platforms. My specialties include high-performance computing (HPC), AI model infrastructure optimization, GPU/accelerator benchmarking, and AI cluster design — with deep exposure to data center, automotive, finance, and healthcare verticals. Key Achievements: • Managed the deployment of GPU infrastructure worth over €50 million for more than 25 European clients • Reduced costs by 40% by optimising OCI for automotive CAE workloads • Made sure that GenAI platforms that handle sensitive customer data were compliant with GDPR • Provided 99.9% uptime for mission-critical AI/ML workloads Core Expertise - AI/ML & HPC Infrastructure Architecture: Leading the design and implementation of large-scale, high-performance computing environments for AI/ML workloads. - Large-Scale GPU Cluster Design: Architecting and deploying multi-node GPU clusters using NVIDIA A100/H100 for GenAI and LLM training. - Cloud Solutions Architecture: Designing and implementing robust, scalable, and cost-effective cloud solutions on Oracle Cloud Infrastructure (OCI). - Performance Optimization & Tuning: Enhancing system performance for HPC and AI workloads through platform scalability and fine-tuning. Domain Knowledge - Automotive HPC: Computer-Aided Engineering (CAE), and simulation workloads. - Financial and defence domain: monte Carlo simulation, financial app orchestration. - Generative AI & NVIDIA Ecosystem: Deep expertise in GenAI platforms, LLM training, and leveraging the full NVIDIA AI software/hardware stack. Cloud & DevOps Technologies - Cloud Platforms: Oracle Cloud Infrastructure (OCI), Hybrid & Multi-Cloud Architecture - Containerization & Orchestration: Kubernetes, Docker - Infrastructure as Code (IaC): Terraform, Ansible - CI/CD & Monitoring: Git, Prometheus, Grafana - Scripting: Python, Shell HPC & Technical Skills - Workload Managers: Slurm, IBM LSF - Networking: RDMA (RoCE v2) - High-Performance Storage: Lustre, GPFS - Operating Systems: Linux
Stackforce AI infers this person is a Cloud Computing and HPC Infrastructure Architect with extensive experience in AI/ML solutions.
Location: Málaga, Andalusia, Spain
Experience: 23 yrs 4 mos
Skills
- Ai/ml & Hpc Infrastructure Architecture
- Cloud Solutions Architecture
- High Performance Computing (hpc)
Career Highlights
- 20+ years of expertise in HPC and AI/ML environments
- Managed €50 million GPU infrastructure deployment
- Achieved 99.9% uptime for mission-critical workloads
Work Experience
Oracle
AI Architect - AI Centre of Excellence (7 mos)
Principal Cloud Architect - AI Infra/GPUs and HPC (4 yrs 5 mos)
DXC Technology
HPC Consultant (2 yrs 3 mos)
Citi
HPC Analyst (2 yrs 2 mos)
Tata Technologies
Lead Solution Developer HPC (8 yrs 2 mos)
Sankalp Venture
Linux System Administrator (1 yr 1 mo)
Vindhya Institute of Technology & Science, Karhi Road, Amoudha, Satna-485441
Programmer (2 yrs 11 mos)
Freelance
Freelance Software Engineer (1 yr 8 mos)
Bhabha Atomic Research Centre (Div Of Department Of Automic Energy, Governm
Project Team Member (2 mos)
Education
Master of Computer Applications (MCA) at Rajiv Gandhi Prodyogiki Vishwavidyalaya
Bachelor of Science (BSc) at Govt. Autonomous P.G. College,Satna