Nestor Colt

Founder

Barcelona, Catalonia, Spain7 yrs 5 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in deploying large-scale AI platforms on Kubernetes.
  • Proven track record in cost optimization and governance.
  • Strong background in Python automation and CI/CD.
Stackforce AI infers this person is a Cloud Architect specializing in AI and automation for enterprise solutions.

Contact

Skills

Core Skills

Cloud InfrastructureArtificial Intelligence (ai)ArchitectureDevops

Other Skills

Platform ArchitectureTechnical Project LeadershipPythonKubernetesAI GovernanceAutomationWorkflow DesignAPI IntegrationREST APIsIT ManagementLarge Language Models (LLM)Project ManagementLinuxServerless ComputingC#

About

Cloud Architect and AI Platform Engineer specialized in AWS-native GenAI, automation, and large-scale Kubernetes deployments. I design and implement resilient, cost-optimized cloud systems—bridging LLM infrastructure, orchestration, and governance. I’ve led enterprise initiatives building LLM and automation platforms (Onyx, LiteLLM, n8n, ChatGPT MCPs) across Kubernetes and AWS, with GPU-enabled clusters, Bedrock inference profiles, and company-wide automation frameworks. My background spans Python automation, CI/CD design, Helm/ArgoCD delivery, and multi-account AWS architecture, alongside LLMOPS practices: lifecycle management, fine-tuning, RAG integration, and operational observability. Focused on pragmatic, production-grade AI systems that balance performance, governance, and reliability—turning complex cloud and AI infrastructure into maintainable, automated platforms.

Experience

7 yrs 5 mos
Total Experience
11 mos
Average Tenure
1 yr 8 mos
Current Experience

The cloud clockwork

Solutions Architect | Founder

Oct 2024Present · 1 yr 8 mos · European Union · Remote

  • Onyx (Enterprise Gen AI Platform) – Deployed the full platform to production on Kubernetes with multi-node workers, scaling, and operational runbooks. Led enterprise onboarding with documentation and tutorials.
  • LiteLLM gateway – Centralized LLM access through automated key provisioning. Designed the workflow that generates virtual keys per team from a request form with usage tracking and quotas.
  • AWS Bedrock governance – Implemented inference profiles and tagging by team and business unit for cost tracking and accountability. Established basic guardrails and model routing policies.
  • ChatGPT Enterprise MCPs and internal GPTs – Created custom connectors for Jira, Confluence, and Backstage. Built internal GPTs exposing REST APIs for integration with internal systems.
  • n8n (company-wide LLM automations) – Deployed a multi-node Kubernetes cluster for large-scale workflow execution. Authored starter templates, documentation, and onboarding materials to drive adoption.
  • AKS GenAI enablement – Enabled GPU-based workloads across clusters and accounts to support LLM/ML inference. Standardized Helm deployments for LLM services and jobs.
  • Backups and DR – Implemented Velero for cluster and namespace backups with documented recovery procedures and tested restores.
  • Observability – Built Datadog dashboards for LLM gateways, GPU utilization, and performance metrics.
  • AI governance – Established practical processes for request handling, access issuance, tagging, and usage visibility across AI platforms.
  • Cost optimization – Introduced tag-based showback via Bedrock profiles and LiteLLM usage metrics; provided guidance on right-sizing and off-peak scheduling.
Platform ArchitectureCloud InfrastructureTechnical Project LeadershipPythonArtificial Intelligence (AI)

Clarivate

2 roles

Lead Cloud Engineer

Promoted

Dec 2023Apr 2026 · 2 yrs 4 mos

  • Organizational tagging strategy: optimized cost allocation (30%), enhanced resource recycling (25%), established chargeback models
  • AI agentic solutions and DAG orchestration (AutoGen, Llama-Index, LangChain): 40% boost in process automation
  • LLM expertise: training, fine-tuning (35% performance improvement), prompt engineering (28% accuracy increase), RAG (45% reduction in hallucinations), PEFT (20% efficiency optimization)
  • Tool consolidation and solution design: 22% cost reduction.
  • Multi-account compliance setup: 99.9% adherence to regulatory standards
  • Enterprise architecture for generative AI governance: 100% alignment with corporate policies and ethical guidelines
  • LLM-Ops platform development: 50% streamlined model deployment, 40% reduction in time-to-production
DevOpsIT ManagementArtificial Intelligence (AI)Large Language Models (LLM)ArchitectureProject Management

Senior Cloud Engineer

Jun 2022Dec 2023 · 1 yr 6 mos

  • Lead event-driven pipelines for infrastructure management + 300 AWS accounts (99.9% uptime achieved)
  • IAM lifecycle for all the accounts in the ORG (100% compliance rate maintained)
  • OpenSearch solution improving a complex company-wide logging platform (60% faster ingestion time)
  • Sustainability and cost optimization (15% reduction in cloud spending)
  • Solutions architecture and technical implementation (40% increase in system efficiency)
  • Fargate and container orchestration (70% improvement in resource utilization)
  • LeadBase AMI / Golden AMI CICD pipeline (80% faster deployment cycles)
  • Enforce, manage, and integrate cyber security products for VM and CIS.
DevOpsIT ManagementArchitectureLinuxServerless Computing

Outsystems

DevOps Engineer

Sep 2021May 2022 · 8 mos · Lisbon, Portugal · Remote

  • CICD Pipelines for Microservices
  • Azure, ADO, AWS, ACR, ECR, EKS
  • Event-driven solutions
  • Kubernetes administration
  • Operators for Kubernetes deployment and administration
  • Serverless for Kubernetes with OpenFaaS and NATS
  • Serverless in AWS, Lambda, SNS, SQS
  • Monitoring, Prometheus, Grafana, Cloudwatch
  • Infrastructure as Code - Terraform Cloud, Cloudformation
  • NoSQL - DynamoDB
  • On-Call & SRE

Block in service

DevOps Engineer

Oct 2020Sep 2021 · 11 mos · United States

  • Event-driven architecture using SQS, SNS, and Lambda
  • Metric monitoring and budget control using Cloudwatch and Cost Explorer
  • Web application served by Elastic Beanstalk
  • Logs archiving in S3
  • Lambda and serverless using Python and C# and Cloud-formation deployment
  • API Gateway access and Cognito security
  • IaC with Cloudformation
  • DB DynamoDB, RDS
  • Sys-Admin

Omnidrone

Pipeline Developer

Nov 2019Feb 2021 · 1 yr 3 mos · Greater Barcelona Metropolitan Area

  • Python developer and pipeline automation CI/Jenkins
  • REST API (API gateway)
  • Lambda (Serverless)
  • Dynamo DB
  • Sheel scripting
  • Batch scripting

Smartninja barcelona

Python Teacher

Oct 2019Sep 2020 · 11 mos · Greater Barcelona Metropolitan Area

B-water animation studios

Pipeline Technical Director & IT

Jul 2019Oct 2019 · 3 mos · Greater Barcelona Metropolitan Area

  • Linux server provisioning (CentOS, RedHat, ANSIBLE)
  • On-Premises Server management (backup, security)
  • Bash scripting and Python programming
  • SysAdmin

Minimo vfx

Python Programmer

Nov 2018Apr 2019 · 5 mos · Barcelona, Catalonia, Spain

  • Python development for internal production tools
  • bash scripting

La salle bcn

Rigging and Python Teacher

May 2018Jun 2019 · 1 yr 1 mo · Barcelona Area, Spain

  • 3D Rigging & Python Programming

Tinker group

Python Programmer

May 2018Oct 2018 · 5 mos · Barcelona Area, Spain

  • Python development for internal studio tooling
  • Bash scripting

Self-employed

Python Developer (Web, Full - Stack)

Apr 2017Apr 2018 · 1 yr · Málaga, Andalusia, Spain

  • Python development for web scraping
  • Django, Full-stack web development, back-end MySQL

Stackforce found 100+ more professionals with Cloud Infrastructure & Artificial Intelligence (ai)

Explore similar profiles based on matching skills and experience