N

Naufal Jamal

SRE (Site Reliability Engineer)

Sunnyvale, California, United States19 yrs 4 mos experience
Highly Stable

Key Highlights

  • 15 years of network engineering and automation experience.
  • CCIE certified with advanced knowledge in routing and switching.
  • Expert in Python for network automation and remediation.
Stackforce AI infers this person is a Senior Infrastructure Engineer specializing in network automation and reliability.

Contact

Skills

Core Skills

PythonLeadershipAuto-remediationNetwork AuditsRedis

Other Skills

MentoringGitHubPrometheus.ioGrafanaPython (Programming Language)Network ServicesNetworkingCisco TechnologiesRoutingData CenterFirewallsBGPOSPFSwitchesCisco Nexus

About

With over 15 years of network engineering and automation experience, I am a Senior Staff Site Reliability Engineer at LinkedIn, where I spearhead the development of a versatile framework that empowers teams to create customized audit plugins tailored to their infrastructure needs. I hold a Cisco Certified Internetwork Expert (CCIE) certification, which demonstrates my advanced knowledge and skills in routing and switching technologies. As an SRE, I leverage my extensive Python coding skills to design and implement network auto-remediation and audit processes that optimize network automation for efficiency and performance. I also mentor team members, conduct rigorous code reviews, and present at Tech Talks to advocate for Infrastructure as Code practices. Some of the projects that I have successfully delivered include a config backup system for network devices using GitHub, and a caching system using Redis to store operational data from network devices. I am passionate about solving complex network automation challenges and enhancing network infrastructure through innovation and collaboration.

Experience

19 yrs 4 mos
Total Experience
5 yrs 8 mos
Average Tenure
2 yrs 3 mos
Current Experience

Nvidia

Site Reliability Engineering - Networks

Feb 2024Present · 2 yrs 3 mos · Santa Clara, California, United States · On-site

Linkedin

5 roles

Senior Staff Site Reliability Engineer

Promoted

Sep 2022Dec 2023 · 1 yr 3 mos

  • Spearheaded the development of a versatile framework for creating customized audit plugins tailored to infrastructure needs.
  • Mentored team members and conducted rigorous code reviews to maintain high code quality standards.
  • Engaged in extensive Python coding for network auto-remediation and audit processes, focusing on optimizing network automation for efficiency and performance.
  • Regularly presented at Tech Talks and advocated for Infrastructure as Code practices.
  • Successfully implemented a config backup system for network devices using GitHub.
  • Designed a caching system using Redis to store operational data from network devices, making it accessible via internal APIs for auditing purposes.
  • Pioneered an automated system for provisioning and auditing network devices, eliminating manual intervention and accelerating scalability.
  • Developed an auto-remediation tool, "Networktrafficshift," in Python to proactively identify and address link errors/faults in data centers.
  • Led the development of "Audit360," a generic framework for onboarding and performing custom audits across LinkedIn, offering a plug-and-play architecture and ensuring audit consistency.
  • Overall, played a key role in advancing infrastructure and network automation, code quality, and efficiency through innovation and leadership.
Leadershipnetwork auditsauto-remediationPythonRedisMentoring

Staff Software Engineer

Feb 2021Sep 2022 · 1 yr 7 mos

  • Specialized in network automation, with a focus on network audits and monitoring using Python in combination with Grafana and Prometheus.
  • Implemented a prometheus exporter service that takes data points from internal systems and convert them into metrics that prometheus understands.
  • Led key projects, including "StateDB," which developed a caching system based on Redis Sentinel for storing and exposing operational health data from network devices via REST APIs, and "Configvault," which revamped the legacy backup system at LinkedIn using GitHub to maintain version control of device configurations.
  • Contributed to enhancing network efficiency, monitoring, and data management through innovative solutions and leadership in these projects.
Prometheus.ioGrafanaPython (Programming Language)Pythonnetwork audits

Staff Network Engineer

Promoted

Mar 2017Sep 2022 · 5 yrs 6 mos

  • Data Center Migrations from Layer2 to Layer 3 Networks.
  • Data center and backbone build projects
  • Peering turnup's in IXP and its automation
  • Part of IPv6 migration projects
  • Primarily focused on solving network operations problems using Python
  • Presented at international conferences on network problem-solving using Python, showcasing expertise in this field.
  • Successfully delivered multiple projects in network operations, providing valuable tools for network engineers to enhance daily operations.
  • Notable projects include "Prefixmon," a Python script for monitoring backbone data from peers, and "NetSMART," a tool for validating network changes after maintenance, aiding in the detection of undesirable alterations and preventive actions.

Senior Network Engineer

Promoted

Feb 2015Mar 2017 · 2 yrs 1 mo

  • Duties include but not limited to:
  • Handle Data Center build projects.
  • Currently part of Anycast working group.
  • Does POC for new network designs.
  • Monitoring LinkedIn Prefixes over Internet via BGPMon Portal and Thousand Eyes. (Self Driven initiative)
  • Write Python scripts to automate network related tasks. Written programs with more than 10K line of code.
  • Turning up new VIP’s on citrix load balancers and monitor their SSL expiry and install SSL certificates on the load balancers
  • Turn up of new core switches in network and perform QA before bringing them to PROD.
  • Perform Transit/Peering link turn-ups.
  • Implementing Network POP’s.
  • Perform Oncall duties. Handle Network outages/maintenances/troubleshooting.
  • Implement routers/switches/firewalls in network and other related tasks.
  • Platforms: Cisco Nexus/6500, ASA, Citrix Netscalers, A10 Load Balancers, Juniper MX80’s,MX960's

Network Engineer

Aug 2012Feb 2015 · 2 yrs 6 mos

  • Managing Network infrastructure at linkedin as a part of network Engg team to maintain smooth network operations. Work on A10/Netscaler load balancers, Cisco ASA firewalls and Cisco Nexus platform mainly. Handle network outages and upgrades. Involved in implementation of Linkedin POP's marking linkedIn internet presence across globe.

Hewlett-packard

Network Consultant

Jun 2011Aug 2012 · 1 yr 2 mos · Bangalore

  • Managed day-to-day network operations for a US financial company's data centers, focusing on Nexus 7k platform, 6500 switches, and various other data center product lines to ensure seamless network functionality.

Hcl comnet systems and services limited

Network Engineer

Nov 2006Jun 2011 · 4 yrs 7 mos · Noida Area, India

  • Worked as a network engineer on significant data center refresh projects and deployments for US energy companies
  • Demonstrated expertise in technologies such as Cisco VSS, Cisco Nexus, ACE, and GSS, contributing to the optimization of critical modern networks for these clients.

Stackforce found 100+ more professionals with Python & Leadership

Explore similar profiles based on matching skills and experience