Bastian VS

SRE (Site Reliability Engineer)

Milpitas, California, United States17 yrs experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Over 13 years of experience in Site Reliability Engineering.
  • Led SRE teams at major tech companies like Cisco and Yahoo.
  • Expertise in cloud computing and automation.
Stackforce AI infers this person is a Site Reliability Engineering expert with a strong focus on cloud and big data solutions.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingBig DataAutomationProduction EngineeringData Management

Other Skills

AWSAnsibleApacheApache OozieApache PigBashCentOSChefConsulCouchDBCouchbaseDNSDockerDruidElasticsearch

About

Highly experienced and results-oriented Site Reliability Engineering (SRE) Manager with 13+ years of expertise in building, scaling, and maintaining mission-critical systems. Proven ability to lead and mentor SRE teams, optimize application performance, and ensure high availability. Passionate about leveraging cutting-edge technologies to solve complex challenges and drive innovation in the DevOps space. My background includes significant contributions at industry leaders like Cisco, Yahoo, Couchbase, and Fidelity, where I've had the opportunity to work on high-impact projects, including being one of the first engineers at Couchbase and Admod Technologies. I'm a firm believer in automation, continuous improvement, and fostering a collaborative environment. Always eager to explore the latest advancements in cloud computing, containerization, and observability.

Experience

17 yrs
Total Experience
2 yrs 4 mos
Average Tenure
2 yrs 5 mos
Current Experience

Cisco

Site Reliability Engineering Manager

Jan 2024Present · 2 yrs 5 mos · On-site

Cisco tetration analytics

2 roles

Techical leader Data Engineering

Nov 2021Present · 4 yrs 7 mos

Technical Leader Site Reliability Engineering (Tetration)

Jan 2018Apr 2022 · 4 yrs 3 mos

  • ● As one of the first engineers, played a crucial role in building the SRE team and scaling the infrastructure for the product
  • ● Lead and mentor a team of SREs responsible for the reliability and performance of Cisco Secure Workload (Tetration Accquired )
  • ● Developed and implemented SRE best practices, including incident management, capacity planning, and performance monitoring.
  • ● Working with Escalation Team to handle day to day maintenance for the customer clusters.
  • ● Testing the application releases before releasing to the production.
  • ● Deploy the new cloud application clusters and maintain.
  • ● Resolving the customer hdfs /mongo/hbase upgrade and solve other hardware related issues.
  • ● Debugging application issues and closely working with the engineering team.
  • ● Expertise in investigating and troubleshooting application issues by tracing python applications and ansible playbooks.
  • Technologies Used:
  • Consul, Vault, Anisible, Shell, Python, Ruby, KVM, Kubernates, Docker, Oracle Cloud, AWS, Hadoop, Mongodb, Haproxy, Centos, Postgress, Redis, Elasticsearch, Mapreduce, Hbase, Kafka, VMware.
ConsulVaultAnsibleShellPythonRuby+18

Fidelity investments

Tech lead

Jan 2017Jan 2018 · 1 yr · Bengaluru, Karnataka, India

  • ● Working with Big Data kafka team as a technical lead to build kafka as a service to Fidelity.
  • ● Build kafka service which can create a cluster less than 10 mins using nodejs and ansible.
  • ● Setup monitoring for kafka service using prometheus and grafana.
  • ● Completely designed the project were to serve kafka as a service similar to AWS Kenisis model (Serverless Architecture )
  • ● Completely automated the releases using jenkins and artifactory.
  • Technologies worked:
  • Kafka, Zookeeper ,Grafana -: Automated monitoring with grafana and promethues.
  • Promethus , Ansible, Nodejs, Dockers, Influxdb, jmx-trans, REST API
KafkaZookeeperGrafanaPrometheusAnsibleNode.js+5

Yahoo!

2 roles

Technical Lead

Promoted

Jan 2016Jan 2017 · 1 yr · Bengaluru, Karnataka, India

  • ● Working with User Targeting and Grid application Production Engieering team which generate the user profile datafor advertising systems.
  • ● Responsible to manage/upgrade 100 node Kafka clusters in 6 colos used in
  • Yahoo Advertising applications.
  • ● Assist dev to onboard new projects to Oozie.
  • ● Setup monitoring for new User targeting projects.
  • ● Configure and setup Splunk for monitoring.
  • ● Onboarding hosts to chef. Writing the cookbooks and deploying in production.
  • Technologies Used:
  • Chef, Kafka, Zookeeper, Apache Oozie. ,Druid.
ChefKafkaZookeeperApache OozieDruidProduction Engineering+1

Systems Engineer in Big Data (Hadoop)

Nov 2011Feb 2014 · 2 yrs 3 mos · Banglore

  • Designation : Systems Engineer in BIG DATA (Hadoop)
  • ● Working with User Data Analytics Team which generates reports for audience engagement for Yahoo websites.
  • ● Support for multiple clusters, with varied frameworks, that generate about 8TB data per day.
  • ● Managing and troubleshooting data analysis and warehousing clusters based on Torque/Maui, and internal software frameworks (PERL based).
  • ● Managing and troubleshooting issues in Hadoop based data analysis applications written in Java MapReduce and Pig.
  • ● Responsible for managing FreeBSD/RHEL servers which run user data analysis apps.
  • ● Configure Manage MySQL replication servers for data warehousing
  • ● Managing host configurations using CM3 (puppet), application configurations using Igor.
  • ● Responsible to replicate HDFS data across Yahoo Hadoop clusters.
  • ● Part of a team that provides 12/7 support for Yahoo’s traffic analysis processes.
  • ● Primary Service Engineer in Bangalore for Hadoop based ETL processes for data analysis.
  • ● Primary SE for Frontpage Analytics, Search and Marketing Analytics, among others.
  • ● Analyzing cluster utilization and planning capacity augmentation for data growth, etc.

Couchbase

Devops Engineer

Feb 2014Jan 2016 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • Providing support for Couchbase Server, an open-source, NoSQL, document-oriented database. Historically the product was based on memcached and couchdb.
  • Customers include Apple, Adobe, Amadeus, AT&T, Bally’s, Beats Music, Betfair, Blizzard Entertainment, Ebay, BMW, British Gas, Cisco, Comcast, SAP / Concur, Disney, Ebay, Electronic Arts, Honda, Intel, Mozilla, Major League Baseball, Nike, Nokia, Orbitz, Paypal, Rakuten / Viber, Sky / BskyB, Symantec, Tencent, Tesco, Thomson Reuters, Ubisoft, Verizon, Vodafone and Walmart

Admod technologies pvt ltd

Systems Engineer

May 2010Nov 2011 · 1 yr 6 mos · Cochin

  • Member of linux/Unix Adminstration Team Managing ecommeice websites remotely which has webserver like apache, nginx . Also worked as a Mysql DBA

Make-a-store, inc.

Unix System Enigineer TeamLead

Mar 2010Nov 2011 · 1 yr 8 mos · Cochin

  • Worked as a Unix System Engineer for Unix System Engineer who reponsable to manage live ecomerce websites.

Infunitum inc.

Linux Administrator

Jun 2009Apr 2010 · 10 mos

  • Worked as a Linux Administrator with Rapidvps.com Managing Linux Servers , OPenvz

Education

Vinayaka Mission's Research Foundation - University

BE — Commputer Science

Jan 2005Jan 2009

V.M.K.V Engineering College

BACHELOR — ENGINEERING

Jan 2005Jan 2009

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience