Jaishal Bansal

SRE (Site Reliability Engineer)

India11 yrs 5 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in Site Reliability Engineering with extensive Kafka experience.
  • Proven track record in incident management and process optimization.
  • Strong background in DevOps and automation solutions.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in SaaS infrastructure and automation.

Contact

Skills

Core Skills

Site Reliability EngineeringKafkaAutomationIncident ManagementTeam LeadershipWeb DevelopmentTeam CollaborationDevopsSystem Administration

Other Skills

AnsibleApacheBashC++CCNACentOSCloud ComputingCommunicationComputer Network OperationsConsulDNSDockerDomain Name System (DNS)EIGRPGit

About

A Computer Engineer interested in exploring the value of technology to create change. Have strong analytical skills, able to work well with a team to troubleshoot complex issues. Enthusiastic about the latest trends in technology and like to spend my time learning and trying new things which helps me get better everyday at my work. While I follow conventions and best practices, I like to hack on things until they work. Sometimes it's genius, other times it's um less than stellar.

Experience

Imc trading

Senior Site Reliability Engineer (Trading Engineer)

Feb 2023Present · 3 yrs 1 mo · Mumbai, Maharashtra, India · On-site

Linkedin

3 roles

Site Reliability Engineer

Aug 2021Feb 2023 · 1 yr 6 mos

  • Scale and maintain the streaming infrastructure of LinkedIn.
  • Ensure one of the world's largest Kafka deployments is up and running.
  • Lead the initiative to automate the OS update Lifecycle for the Kafka Ecosystem.
  • Mentored new hires in coming up to speed and getting familiar with the LinkedIn application stack.
  • Worked on initiatives for spare health & capacity management.
KafkaAutomationIncident ManagementSite Reliability Engineering

Sr. Site Operations Engineer

Promoted

Oct 2018Aug 2021 · 2 yrs 10 mos

  • Led the team in handling all major operational issues and incidents and driving them to resolution.
  • Worked on redesigning & revamping the Incident Management process at LinkedIn.
  • Worked on projects to streamline the process, and to better optimize the efficiency of the team.
  • Played a part in hiring and building the team & mentoring new hires.
  • Worked with my Manager to discuss, decide, and take steps to achieve short-term as well as long terms goals for the team.
Incident ManagementTeam LeadershipProcess Optimization

Site Operations Engineer

Nov 2016Sep 2018 · 1 yr 10 mos

  • Worked with the team in handling all major operational issues and incidents.
  • Worked on projects to streamline the process, and to better optimize the efficiency of the team.
  • Building scalable web applications using technologies like Python, Jinja, HTML, and JS to automate/streamline the team
  • processes & workflow.
  • Handling applications & deployments contribute to the scaling of the app stack to multiple data centers.
  • Drive daily stand-up meetings discussing critical production issues & assist the teams in taking appropriate actions to
  • resolve them.
PythonHTMLJavaScriptTeam CollaborationWeb Development

Zycus

Sr Cloud Engineer

Jul 2016Oct 2016 · 3 mos · Mumbai Area, India

  • Part of a new DevOps Team working towards Automating and optimizing existing workflows.
  • Implementing Tools/Services/ to fix and automate common repetitive issues.
  • Wrote custom Ansible playbooks for managing the full application stack.
  • Proof of concept & Implementation of Consul for Service Discovery & Key/Value Management.
  • Setting up Monitoring Solutions using tools like Nagios/ Icinga, InfluxDB, and Grafana.
AnsibleDevOpsMonitoring SolutionsAutomation

Directi

System Administrator

Aug 2014Jun 2016 · 1 yr 10 mos · Mumbai Area, India

  • Managing products using Linux and Linux application stacks. Automation and implementation of
  • permanent resolutions to prevent outages/ downtimes.
  • Configuring, monitoring, and ensuring the integrity, and consistency of regular backups.
  • Handle incident response, troubleshooting, fix & escalations for various products/services.
  • Monitor the stability of Infrastructure.
LinuxIncident ResponseTroubleshootingSystem AdministrationIncident Management

Education

Thakur College of Engineering & Technology Shaymnarayan Thakur Marg Thakur Villaige Samata Nagar Kandivli (E) Mumbai 400 101

Bachelor of Engineering (B.E.) — Computer Engineering

Jun 2011Jun 2014

Shree Bagubhai Mafatlal Polytechnic

Diploma — Computer Engineering

Jan 2007Jan 2010

Gokuldham High School & Jr. College

Jan 2001Jan 2007

Stackforce found 100+ more professionals with Site Reliability Engineering & Kafka

Explore similar profiles based on matching skills and experience