Jaishal Bansal

SRE (Site Reliability Engineer)

India11 yrs 5 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

Expert in Site Reliability Engineering with extensive Kafka experience.
Proven track record in incident management and process optimization.
Strong background in DevOps and automation solutions.

Stackforce AI infers this person is a Site Reliability Engineer with expertise in SaaS infrastructure and automation.

Contact

Skills

Core Skills

Site Reliability EngineeringKafkaAutomationIncident ManagementTeam LeadershipWeb DevelopmentTeam CollaborationDevopsSystem Administration

Other Skills

AnsibleApacheBashC++CCNACentOSCloud ComputingCommunicationComputer Network OperationsConsulDNSDockerDomain Name System (DNS)EIGRPGit

About

A Computer Engineer interested in exploring the value of technology to create change. Have strong analytical skills, able to work well with a team to troubleshoot complex issues. Enthusiastic about the latest trends in technology and like to spend my time learning and trying new things which helps me get better everyday at my work. While I follow conventions and best practices, I like to hack on things until they work. Sometimes it's genius, other times it's um less than stellar.

Experience

Imc trading

Senior Site Reliability Engineer (Trading Engineer)

Feb 2023 – Present · 3 yrs 1 mo · Mumbai, Maharashtra, India · On-site

3 roles

Site Reliability Engineer

Aug 2021 – Feb 2023 · 1 yr 6 mos

Scale and maintain the streaming infrastructure of LinkedIn.
Ensure one of the world's largest Kafka deployments is up and running.
Lead the initiative to automate the OS update Lifecycle for the Kafka Ecosystem.
Mentored new hires in coming up to speed and getting familiar with the LinkedIn application stack.
Worked on initiatives for spare health & capacity management.

KafkaAutomationIncident ManagementSite Reliability Engineering

Sr. Site Operations Engineer

Promoted

Oct 2018 – Aug 2021 · 2 yrs 10 mos

Led the team in handling all major operational issues and incidents and driving them to resolution.
Worked on redesigning & revamping the Incident Management process at LinkedIn.
Worked on projects to streamline the process, and to better optimize the efficiency of the team.
Played a part in hiring and building the team & mentoring new hires.
Worked with my Manager to discuss, decide, and take steps to achieve short-term as well as long terms goals for the team.

Incident ManagementTeam LeadershipProcess Optimization

Site Operations Engineer

Nov 2016 – Sep 2018 · 1 yr 10 mos

Worked with the team in handling all major operational issues and incidents.
Worked on projects to streamline the process, and to better optimize the efficiency of the team.
Building scalable web applications using technologies like Python, Jinja, HTML, and JS to automate/streamline the team
processes & workflow.
Handling applications & deployments contribute to the scaling of the app stack to multiple data centers.
Drive daily stand-up meetings discussing critical production issues & assist the teams in taking appropriate actions to
resolve them.

PythonHTMLJavaScriptTeam CollaborationWeb Development

Zycus

Sr Cloud Engineer

Jul 2016 – Oct 2016 · 3 mos · Mumbai Area, India

Part of a new DevOps Team working towards Automating and optimizing existing workflows.
Implementing Tools/Services/ to fix and automate common repetitive issues.
Wrote custom Ansible playbooks for managing the full application stack.
Proof of concept & Implementation of Consul for Service Discovery & Key/Value Management.
Setting up Monitoring Solutions using tools like Nagios/ Icinga, InfluxDB, and Grafana.

AnsibleDevOpsMonitoring SolutionsAutomation

Directi

System Administrator

Aug 2014 – Jun 2016 · 1 yr 10 mos · Mumbai Area, India

Managing products using Linux and Linux application stacks. Automation and implementation of
permanent resolutions to prevent outages/ downtimes.
Configuring, monitoring, and ensuring the integrity, and consistency of regular backups.
Handle incident response, troubleshooting, fix & escalations for various products/services.
Monitor the stability of Infrastructure.

LinuxIncident ResponseTroubleshootingSystem AdministrationIncident Management