Syed ZahirUllah.M

Platform Engineer

Riyadh, Saudi Arabia13 yrs 9 mos experience

Key Highlights

  • 13 years of experience in DevOps and SRE solutions.
  • Expert in designing scalable cloud solutions.
  • Proficient in CI/CD and Infrastructure as Code.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with expertise in DevOps and Site Reliability Engineering.

Contact

Skills

Core Skills

Cloud Infrastructure ManagementKubernetesCloud ArchitectureDevopsSite Reliability EngineeringInfrastructure AutomationOperational ExcellenceCloud Solutions DesignInfrastructure ManagementConfiguration Management

Other Skills

AWSAWS CloudFormationAmazon Web Services (AWS)AnsibleApache KafkaAppDynamicsArchitectureArgoChefConfluentDynatraceElasticsearchFluentdGCPGoogle Cloud Platform (GCP)

About

• 13 Years of experience in designing and implementation of DevOps/SRE solutions • Expert in DevOps, production engineering, operations and automation • Expert in designing highly scalable solutions on cloud platforms • Working with Cloud Platforms like AWS, Joyent, OCI, GCP • Result oriented passionate professional with 11 years across IT / Networking, Systems Administration and Maintenance. • Working for Vision Bank as Senior Platform Engineer located in Riyadh. • Mainly focusing on Orchestration of Cloud Infrastructure Services from Design, Implementation and Operation of systems that offer cost savings to the business. Current interests are BigData, Containers [K8s] Orchestration, Concurrent and Distributed Systems, Private and Public Cloud, Monitoring Systems & Log Analysis. • Continuous Integration & Deployment of applications using Git, Maven, Jenkins, Ruby, Chef & Opsworks pipelines. • Setting up log analytics with tuning elasticsearch, Index designing and visualizing it with Kibana and other tools. • Ramping up on Containers [Docker] Orchestration, Concurrent and Distributed Systems, Private and Public Cloud, Monitoring Systems , Infrastructure as Code using Terraform & CloudFormation. • Experience in using the repositories like SVN, Git, Perforce. • Practical experience of working in Databases like MySql, MariaDB, InfluxDB, Cassandra, Couchbase and Mongo DB.

Experience

Vision bank

Senior Platform Engineer

Jan 2025Present · 1 yr 2 mos · Riyadh, Saudi Arabia · On-site

  • Responsible for the design, development, upgrading and maintenance of the platform in Public Cloud and build a stable operation and maintenance platform.
  • Responsible for the microservice deployment, configuration and maintenance on K8s - Kong API Gateway, Istio Service Mesh , Argo CD Application sets , Gitops Model
  • Contributed to a set of best patterns and practices for deploying cloud-based infrastructure as code in a secure, reliable and efficient manner - Improved Terraform Module structure so that it can be used across multiple environments.
  • Provided subject matter expertise in troubleshooting issues impacting the performance, security, efficiency and reliability of cloud based services.
  • Provisioning and scaling of environment to meet production traffic variations (networking and HPA)
  • Gather and analyze metrics from cloud resources to assist in performance tuning and fault finding.
  • Implemented configuration management using Ansible.
  • Image standardization for all workloads was completed based on CIS standard
  • using technologies like Ansible.
  • Worked on improving Gitlab CI Templates for deployment of resources with tools for Security Scanning
  • Experience with Oracle and Google Cloud - IAAS and PAAS
TerraformokeTeleportConfluentPostgreSQLFluentd+6

Banque saudi fransi

Senior Devops Engineer

Feb 2023Dec 2024 · 1 yr 10 mos · Riyadh, Saudi Arabia · On-site

  • Working with OCI public cloud to manage applications.
  • Responsible for designing and implementing cloud-based solutions that meet the business requirements of an organisation.
  • Responsible for designing and implementing the infrastructure for a cloud environment, including the network, storage, and computing resources.
  • Responsible for implementing and maintaining security controls in a cloud environment to ensure the confidentiality, integrity, and availability of data and applications.
  • Responsible for developing and implementing automated processes for building, deploying, and testing software applications in a cloud environment.
  • Responsible for managing the day-to-day operations of a cloud environment, including monitoring, troubleshooting, and optimizing the performance of cloud-based applications and infrastructure.
  • Responsible for designing and implementing the network infrastructure for a cloud environment, including virtual cloud networks (VCNs), load balancers, and firewalls.
TerraformGoogle Cloud Platform (GCP)Apache KafkaArchitectureOCIAmazon Web Services (AWS)+10

Palo alto networks

Principal Site Reliability Engineer

May 2021Jan 2023 · 1 yr 8 mos · Bangalore Urban, Karnataka, India

  • Coding experience in Python.
  • Proficient in CI/CD platforms like Jenkins, build and configuration management using Terraform and Ansible.
  • 9+ years experience in building infrastructure automation processes with a focus on scalability and reliability.
  • Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive.
  • Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions.
  • Passionate to learn, understand, and dissect new technology stack.
  • Proficient in Linux, Git, AWS, Docker, K8s.
  • Passion for automation and monitoring instrumentation in the code.
  • Excellent communication skills and the ability to work well in a team.
  • Skills: Infrastructure Automation · Scripting · Site Reliability Engineering · Elasticsearch · Terraform · Amazon Web Services (AWS) · Continuous Integration and Continuous Delivery (CI/CD) · Grafana · Elastic Stack (ELK) · System Monitoring · MongoDB · Bash · docker · Jenkins · Ansible · Kubernetes · Infrastructure as code (IaC)
TerraformGoogle Cloud Platform (GCP)Apache KafkaAmazon Web Services (AWS)Python (Programming Language)AppDynamics+5

Intuit

Site Reliability Engineer

Mar 2019May 2021 · 2 yrs 2 mos · Bangalore

  • Responsible for driving operational excellence for the connected services that a business offers to its customers to deliver an "always on" operation, year-round, at the right cost
  • Uses knowledge of technology and operational best practices to drive the design, development and implementation of operational standards and capabilities for connected services that enable highly available, scalable & reliable customer experiences
  • Analyzes and synthesizes a variety of inputs to drives the end-to-end incident management for multiple offerings
  • Includes creating, developing & managing the deployment architecture for applications
  • Developing the monitoring architecture and implementing monitoring agents, dashboards, escalations and alerts
  • Developing and driving incident management processes, playbooks and stakeholder communication mechanisms
  • Overseeing change management & configuration management operating mechanisms
  • Driving root cause analysis (RCA) and risk management processes
  • Driving ongoing improvements and efficiencies in operational practices, tools & processes BU and Intuit-wide
  • As part of the SRE will be responsible for configuration, optimization, documentation and support of the infrastructure components which are hosted in collocated facilities and cloud services such as AWS.
  • Work independently across multiple platforms and applications to understand dependencies
  • Evaluate new tools, technologies, and processes to improve speed, efficiency, and scalability of continuous integration environments
  • Design, build, and deliver cloud computing solutions, hosted services, and underlying software infrastructure.
  • Ability to support 12x7 oncall on rotational basis along with other team member
TerraformGoogle Cloud Platform (GCP)Apache KafkaAmazon Web Services (AWS)Python (Programming Language)AppDynamics+5

Samsung r&d institute india - bangalore private limited

2 roles

Lead Engineer

Promoted

Mar 2017Feb 2019 · 1 yr 11 mos · Bangalore

  • Samsung providing platforms for software systems which powers and mobile devices to next user level in Industry.
  • Setting up AWS environment from scratch till production release using infrastructure as code with tools like terraform, Cloud formation.
  • Worked with custom Chef recipes with AWS Opswork for configuration management and code deployment using Terraform, Boto, AWS CloudFormation.
  • Worked on Chef recipes to automate tasks like environment setup, hardening servers, centralized log collection, monitoring, continuous deployments etc.
  • Implementing various architecture design for High availability, Scalability and Disaster recovery at region level as per the projects requirements.
  • Doing on call on weekly basis to support the production issues.
TerraformApache KafkaAmazon Web Services (AWS)Python (Programming Language)FluentdAWS CloudFormation+3

Senior Software Engineer

Nov 2015Feb 2017 · 1 yr 3 mos · Bangalore

  • Worked on Chef Recipes to automate tasks like setup centralized log collection, monitoring, vulnerabilities patching, security audits etc.
  • Configured Centralized log collection using Fluentd, Logstash, Elasticsearch and Kibana stack for parsing common web server, app server logs, Syslog for intrusion detection. Configured WebShelter DDOS attacks, ds_agent deep protection
  • Implementing various architecture design for High availability, Scalability and Disaster recovery at region level as per the client requirements. Worked on Cloud‐formation templates.
  • Setting up AWS environment from scratch till production release.
  • Worked with custom Chef recipes with Aws Opswork for configuration management and code deployment using Jenkins, AWS code deploy.
TerraformApache KafkaAmazon Web Services (AWS)Python (Programming Language)AWS CloudFormationKubernetes+2

Sap

Consultant

Apr 2015Nov 2015 · 7 mos · SAP Labs Bangalore

  • Expertise in UNIX administration activities with excellent performance along with meeting consistently SLA’s.
  • LVM Management on Linux Servers.
  • Successfully working and resolving an issue over various complex situations.
  • Mentoring & coaching the new joiners.
  • Work on disk usage, high CPU utilization and swap usage issues.
  • Client presentations and status reporting.
  • Managing the installation of monitoring tool at the client side.
  • Managing the Patch activity for Linux Servers.
  • Ensuring the healthy functioning of the entire process.
  • Efficiently worked on several urgent issues which not only brushed up my skill set, but also demonstrate the trait of Multitasking
Amazon Web Services (AWS)

Cognizant

2 roles

Senior Systems Engineer

Promoted

Mar 2013Apr 2015 · 2 yrs 1 mo

  • Performed successfully the role for Acting Lead during the Pilot phase of the project
  • Had been a key Resource during the transition of the project
  • Resolving issues for various complex situations.
  • Maintaining the server load average and maintaining at normal level by diagnosing performance problems related to memory/CPU utilization.
  • Day to Day deployment and changes using svn and .rb scripts
  • Managing file security permissions.
  • Linux user account and group creation and modification.
  • Managing 3000+ Linux and 500+ Windows Servers
  • A good understanding of services running and troubleshooting if the resource utilization is high such as Memory and CPU.
  • Work on disk usage, high CPU utilization and Memory usage issues.
  • Managing the installation of monitoring tool at the client side.
  • Managing the Patch activity for Linux Servers using Spacewalk.
  • Server Capacity Reporting and other Reports on Daily Basis.
  • Basic Scripting for Automating Daily activities
  • Mentoring freshers & training less experienced team members.
  • Day to day communication with stake holder for the smooth process flow. Client presentations and status reporting
  • Consistent & extra mile efforts fetch me the Shift lead Position during the very early tenure in the company
Amazon Web Services (AWS)

Engineer Level Trainee

Mar 2012Mar 2013 · 1 yr

  • Performed successfully the role for Acting Lead during the Pilot phase of the project
  • Had been a key Resource during the transition of the project
  • Resolving issues for various complex situations.
  • Maintaining the server load average and maintaining at normal level by diagnosing performance problems related to memory/CPU utilization.
  • Day to Day deployment and changes using svn and .rb scripts
  • Managing file security permissions.
  • Linux user account and group creation and modification.
  • Managing 3000+ Linux and 500+ Windows Servers
  • A good understanding of services running and troubleshooting if the resource utilization is high such as Memory and CPU.
  • Work on disk usage, high CPU utilization and Memory usage issues.
  • Managing the installation of monitoring tool at the client side.
  • Managing the Patch activity for Linux Servers using Spacewalk.
  • Server Capacity Reporting and other Reports on Daily Basis.
  • Basic Scripting for Automating Daily activities
  • Mentoring freshers & training less experienced team members.
  • Day to day communication with stake holder for the smooth process flow. Client presentations and status reporting
  • Consistent & extra mile efforts fetch me the Shift lead Position during the very early tenure in the company
Amazon Web Services (AWS)

Education

Birla Institute of Technology and Science, Pilani

Master of Technology - MTech — Software Engineering

Jan 2013Dec 2015

National College of Engineering

Bachelor’s Degree

Jan 2007Jan 2011

IIPE Laxmi Raman Higher Secondary School

High School

Jan 2005Jan 2007

Stackforce found 100+ more professionals with Cloud Infrastructure Management & Kubernetes

Explore similar profiles based on matching skills and experience