Vikrant Aggarwal

Software Engineer

Canada15 yrs 5 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in managing large-scale cloud infrastructure.
  • Strong background in Kubernetes and Google Cloud Platform.
  • Proven track record in incident management and ITIL processes.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with expertise in high-performance computing and observability.

Contact

Skills

Core Skills

KubernetesGoogle Cloud Platform (gcp)ObservabilityIncident Management

Other Skills

High Performance Computing (HPC)StorageAutomationReliabilityKafkaElasticsearchLoggingMetricsITILShell ScriptingOpenstack (IaaS)Redhat LinuxOracle SolarisOracle Virtualization Manager x86VMware

About

Working as Openstack (IaaS) STSE. Specialties: Openstack (IaaS), Redhat Linux, Oracle Solaris, Oracle Virtualization Manager x86, VMware

Experience

Tower research capital

Sr. Systems Engineer

Feb 2019Present · 7 yrs 1 mo · Montreal, Quebec, Canada · On-site

  • Driving end-to-end ownership of infrastructure at massive scale -- from designing and deploying storage, compute, and k8s clusters, to implementing observabiilty, automation, and reliability practices that power mission-critical workloads across on-premises and cloud environments.
  • I manage large-scale storage clusters at the petabyte level, providing multiple protocols high availability and performance. Responsible for benchmarking the storage systems to find best fit for our use cases. I have designed and currently oversee compute schedulers running 100K+ cores in both on-prem and cloud environments. Extending this infra with all available high end GPUs in the market.
  • Actively contribute in building and optimizing Kubernetes clusters, focusing on capacity planning, reliabliity, and storage availability. Strong expertise in Google cloud.
  • In addition, I have strong knowledge of observability platforms and technologies such as Kafka, Elasticsearch and other log/metrics pipelines. I have designed and currently manage end-to-end logging and metric solutions that provide deep visibility, reliability and actionable insights into distributed system at scale.
KubernetesHigh Performance Computing (HPC)Google Cloud Platform (GCP)StorageObservabilityAutomation+1

Ciena

System Engineer

Jun 2017Feb 2019 · 1 yr 8 mos · Gurgaon, India

Red hat

2 roles

Senior Technical Support Engineer

Dec 2016Jun 2017 · 6 mos

  • Providing collaboration on complex issues.
  • Helping team-members to reproduce the issues in house.
  • Joining mult-vendor calls.

Technical Support Engineer

Nov 2014Nov 2016 · 2 yrs

  • Providing support to Redhat Customers.
  • Majorly dealing with Openstack (IaaS) cases.
  • Analyzing the Customer provided data and providing the solution to fix the issues.
  • Joining the multi-vendor calls in critical situations to fix the customer issue.
  • Preparing articles for knowledge reference after fixing the customer issues.
  • Taking remote sessions to provide the quick solution to customers.
  • Coordinating with engineering team to expedite the bug fixes.
  • Simulating the customers issue in test lab.

Sungard availability services

Sr. System Engineer

Oct 2013Nov 2014 · 1 yr 1 mo · Pune Area, India

  • Developing Shell Scripts for internal activities
  • Daily calls with Onsite Team for improvements on ongoing processes .
  • Mentoring the bridges for Incident management
  • Attending meetings with Service Managers.
  • Mentored the fresher’s in team for ITIL processes , Services and Incident managements
  • Strong knowledge on Incident management, Problem Management , Change Management , Event Management .
  • Installing software, OS version up gradation, and package management.
  • Configuring and maintaining services.
  • Implementation, configuration and maintaining of LVM (Logical Volume Manager)

Tata consultancy services

IT System Engineer

Sep 2010Sep 2013 · 3 yrs

  • Operating Systems: Solaris 10, RHEL,
  • Ticketing System: Service Now
  • File Systems: ZFS,ext3,ext4,xfs
  • Server Virtualization: Solaris Zones, OVM x86, VMware
  • Job Schedulers: CRON, AT
  • System Health Check up: PRSTAT, TOP, SAR, VMSTAT, MPSTAT, IOSTAT
  • Shell Scripting: SH, BASH
  • System Hardening: Customization based on client's standard
  • Run Control Scripts: Administration of System run-levels & Deployment of Custom scripts based on Apps request.
  • Management Skills:
  • Key driver for business process design and solution identification.
  • Process vision and design, enhancement strategy and transition strategy.
  • Troubleshooting problems during installation etc.
  • Auditing customer system, workflow and processes for suggesting solutions.
  • Providing technical support to team.
  • Conducting weekly review meeting with Team.
  • Negotiating SLAs for large and medium-size business transformation projects.
  • Responsibilities:
  • Provide services for management of server environment to maintain performance at optimum levels on a 24x7 basis.
  • Operating system administration, including management of users, processes.
  • The re-installation of OS in the event of system crash/failures.
  • Applying operating system updates, patches, and configuration changes.
  • Assist the Application team, Storage and network team with various deployments and configuration as needed.
  • Actively monitoring the systems for any memory, cpu, disk space etc and taking corrective actions.
  • Creating Server knowledge base for team.
  • Evaluate System Performances & report metrics to management.

Education

Lovely Professional University

B.Tech — Electronics and Communication

Jan 2006Jan 2010

Punjab School Education Board

10+2 — PCM

Jan 2005Jan 2006

Punjab School Education Board completed

Matriculation — Non-Medical

Jan 2003Jan 2004

Stackforce found 100+ more professionals with Kubernetes & Google Cloud Platform (gcp)

Explore similar profiles based on matching skills and experience