Arun Raman

CTO

San Francisco, California, United States15 yrs 5 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in deploying AI solutions for enterprise clients.
  • Proven track record in optimizing cloud deployments.
  • Strong collaboration skills across technical teams.
Stackforce AI infers this person is a SaaS expert with a focus on AI and cloud solutions.

Contact

Skills

Core Skills

Large Language Models (llm)InferenceDockerOpenstack

Other Skills

BERTCCSSDLIDiameterDistributed SystemsElasticSearchExalogicGTPGitGithubIPv6IntegrationJavaKibana

About

Strong experience as a ”Voice of Customer” and ”Trusted Advisor” for the customer. Strong experience in finding the right customer workload, resulting in multi million dollar revenue wins. A technical focal point for customers on accelerating their DL/ML training, inference, and MLOps challenges. Strong experience in designing scalable solutions. Strong experience in internal and cross-team collaboration.

Experience

15 yrs 5 mos
Total Experience
3 yrs 1 mo
Average Tenure
7 yrs 11 mos
Current Experience

Nvidia

2 roles

Lead AI Solutions Architect

Promoted

Jan 2020Present · 6 yrs 3 mos

  • Leading the Nvidia Dynamo [https://github.com/ai-dynamo/dynamo] customer enablement.
  • Engineering lead on the Nvidia LLM Router Blueprint [https://github.com/NVIDIA-AI-Blueprints/llm-router/], designing it from concept to open-source release.
  • Enabled multiple green field Gen-AI and RAG deployments for customers, thereby enabling them to go to market with AI solutions.
  • Covered the full spectrum from customer workload identification, scoping, revenue analysis, POC, Production, and Post-Sale handoff.
  • Cross-functional collaboration with the product managers, engineering, and press communications to enable the customer with production-grade solutions with joint press releases.
  • Led the development for multiple strategic accounts on accelerating their Deep Learning inference workload by moving to Nvidia Triton server and GPU.
Voice of the CustomerTrusted AdvisorLarge Language Models (LLM)Inference

Lead Software Development Engineer

May 2018Jan 2020 · 1 yr 8 mos

Oracle

Senior Software Engineer

May 2015May 2018 · 3 yrs · San Francisco Bay Area

  • Improved deployment by 25% with parallelization, to deploy Oracle Object Storage container workloads as microservices in Docker for Oracle Cloud.
  • Effectively transitioned the legacy async message brokering model from RabbitMQ to Kafka/Zookeeper
  • cluster to improve latency and scalability.
  • Designed an HA solution to improve latency for Object Storage requests in the Oracle Cloud using Nginx.
  • Designed and developed the Oracle Private Cloud OpenStack Nova driver, VNC console proxy via SSH
  • tunnel in Python using Parakmiko.
  • Reduced the deployment time by 50% by building a CLI using click, jinja2, and fabric python modules
  • for deploying and upgrading OpenStack in production.Built Jenkins jobs in bash for CI/CD of Oracle Object Storage for gating new releases to customers.
  • Built Oracle Object Storage microservice using on Docker containers on Oracle Linux.

Vmware

Member Of Technical Staff

Jun 2013May 2015 · 1 yr 11 mos · San Francisco Bay Area

  • Worked on OVS on ESX integration.
  • Worked on SMP Fault Tolerance for Multi Core VMs.
  • Worked on Network IO Control(NetIOC).
  • Worked on Multi-VC/NSX for L2 networking

Juniper networks

2 roles

Software Engineer

Feb 2011May 2013 · 2 yrs 3 mos · Sunnyvale, CA

  • Worked on AnchorPFE and Anchor Service PIC redundancy.
  • Worked on a Call Model simulation tool for SGW for parallel BSD machines and scaling up to million sessions with X2, S1, Irat, Traffic area update, Routing area update.
  • Worked on PCRF and OCS server simulation tool, running FreeBSD machine.

Software Engineering Intern - Routing Protocol Daemon Group

Jun 2010Aug 2010 · 2 mos · Sunnyvale, CA

  • Set up generic API libraries for the RPD - FT team.
  • Created a framework for APCON switches for testing.

Ericsson - nortel labs

Software Release Engineering Intern

Aug 2010Dec 2010 · 4 mos · Richardson, Texas

  • * Created a CGI back end engine to churn out build metrics data from the database and process the data to create graphs using the data.

Imagestream internet solutions, inc

Software Development Intern

Feb 2010May 2010 · 3 mos

  • Built Firewall congurations for the Linux routers using the IP-tables with inbuilt and user defined chains and developed a DHCP lease data logging module and integrated to the ISC DHCP Daemon.
  • Built automation scripts for the Managed Services Program (MSP), using Shell scripts for network monitoring of OSPF, PPP, BGP routes using Nagios - an open source network monitoring system.

Education

The University of Texas at Dallas

M.S — Communication and Signal Processing

Jan 2008Jan 2010

ANNA UNIVERSITY - India

B.E — Electronics and Communication Engineering

Jan 2004Jan 2008

Stackforce found 100+ more professionals with Large Language Models (llm) & Inference

Explore similar profiles based on matching skills and experience