Roshan Bhatia

Senior Software Engineer

Portland, Oregon, United States6 yrs 5 mos experience
Most Likely To Switch

Key Highlights

  • Expert in platform engineering and cloud-native solutions.
  • Led development of scalable backend services and APIs.
  • Proven track record in optimizing cloud infrastructure.
Stackforce AI infers this person is a SaaS-focused Senior Software Engineer with expertise in cloud infrastructure and platform engineering.

Contact

Skills

Core Skills

Platform EngineeringAmazon Web Services (aws)DatabasesInfrastructureKubernetesDocker

Other Skills

AWS Elastic Kubernetes ServiceAWS SQSAmazon ECSAnsibleBashCDgraphDistributed SystemsGitGoGoogle Cloud Platform (GCP)GrafanaHigh Performance Computing (HPC)JavaJavaScript

About

As a Senior Software Engineer, I work on platform engineering projects that support and enhanced the engineering productivity, reliability, and performance of business critical applications and systems.

Experience

6 yrs 5 mos
Total Experience
1 yr
Average Tenure
1 yr 10 mos
Current Experience

Nike

Senior Software Engineer, Nike Runtime Foundation

Jul 2024Present · 1 yr 10 mos · Portland, Oregon Metropolitan Area · Hybrid

Shipyard

Senior Software Engineer

Feb 2024Jun 2024 · 4 mos · Portland, Oregon, United States · Remote

  • Everything from frontend development with React, to custom Kubernetes controller logic in Go.

Laurel (previously time by ping)

Senior Software Engineer, Platform

Jul 2021Dec 2023 · 2 yrs 5 mos · Portland, Oregon, United States · Hybrid

  • Software Engineer on a US-based distributed team primarily focused on platform engineering through building backend services, shared libraries, infrastructure abstractions, observability tooling, and delivery mechanisms used across our engineering teams with a keen eye towards minimizing cloud spend while maximizing availability.
  • Set/educated developers of best practices, technical standards, and established processes concerning infrastructure, observability, incident response, and build/release systems.
  • Developed a globally available, cloud native API Gateway built with Golang and custom Caddy modules, which performs intelligent routing, retries, authentication, and observability transparent to our internal services.
  • Lead developer for the application configuration service, a tier-1, globally available GraphQL-based Nest.js (Typescript Node.js) backend service that stores versioned multi-tenant configuration in a globally replicated MongoDB cluster, with configuration creation done via a second event-driven Nest.js microservice using AWS SQS.
  • Co-owner (member of an engineering-wide subteam) of shared Typescript Node.js/Nest.js libraries, primarily focusing on a standardized way to easily implement logging, tracing, resolving runtime configuration, and message queue utilization.
  • Helped drive design for modern, multi-region, global orchestration platform based on AWS Elastic Kubernetes Service, utilizing spot instances, Karpenter, AWS SecretsManager, AWS ALB ingress controller, AWS Global Accelerator, KEDA-based autoscaling, and OpenTelemetry with associated abstractions in terraform.
  • Developed solution for using GPU enabled nodes for use with ML solutions and ephemeral development environments for our data scientists.
  • Developed shared CircleCI tooling in the form of a CircleCI orb, used across all our build pipelines to automate actions such as git tagging and releasing, deployments, Docker image and artifact builds, etc.
  • This is a development-skewed role.
InfrastructureDistributed SystemsOpentelemetryPlatform EngineeringKubernetesAmazon Web Services (AWS)+6

Dgraph labs

Software Engineer, Site Reliability

Feb 2021Jun 2021 · 4 mos · Portland, Oregon, United States

  • Site reliability engineer on a distributed team developed multi-cloud, cloud-native infrastructure to provide a managed solution for Dgraph's graph database solution
  • Streamlined operations work with Golang, Terraform, and Ansible to automate tasks around configuration, CICD, and resource provisioning on baremetal Rancher Kubernetes, AWS, and GCP
  • Co-owner of metrics, alerting, and log aggregation pipeline, using Prometheus, Promtail, Thanos, and Loki to collect metrics and logs and forward them upstream to Grafana.
  • Primary owner of the baremetal cluster resources. Automated system onboarding via Ansible, Terraform, and shell scripts in order to initialize system configurations, networking resources, storage solutions, Rancher Kubernetes cluster creation, and observability resources.
  • Worked alongside other SREs and Customer Success engineers to respond to production incidents and debug customer problems by participating in on-call rotation and authoring runbooks.
  • Maintained instance of our developer-facing forums (https://discuss.dgraph.io/) which received 10s of thousands of visits per month.
  • This was an operations-skewed role.
InfrastructureAnsibleKubernetesDgraphDatabasesGrafana+5

Virtana

2 roles

Software Engineer, Site Reliability

Jul 2020Feb 2021 · 7 mos

  • Junior site reliability engineer on a distributed team, developed scalable AWS cloud infrastructure on a greenfield SaaS application.
  • Developed automation and services around reducing toil and operations work, including
  • monitoring, alerting, instrumentation, and secrets management with Golang (primarily for
  • services) and Python (for general scripting tasks).
  • Authored infrastructure-as-code with Terraform in order to deploy various microservices and
  • infrastructure in order to ensure that the application was scalable, self-healing, and resilient using modern serverless technologies.
  • Developed infrastructure abstractions for use in Jenkins using a Groovy-based shared library, abstracting complex worfklows like automated releases, branching strategies, etc.
  • Worked to make DevOps culture a daily part of our engineering culture and introduced
  • policies around on-call strategies, documentation, code conventions, logging, releases, etc.
  • This was a development-skewed role.
InfrastructureAmazon Web Services (AWS)System MonitoringAmazon ECS

Software Engineer, Observability Integrations

Dec 2019Jul 2020 · 7 mos

  • Junior backend engineer developing bespoke Go-based observability microservices (deployed to physical devices in customer datacenters) which collected, batched, transformed, and forwarded time-series data from Solaris, Linux, and KVM (implementing the translator design pattern) using a schema provided at runtime by a (canonical) external service.
  • Developed corresponding Go simulator services which mocked the aforementioned operating systems' command outputs in order to enable faster iteration in both local development environments and CI.
  • Developed a Jenkins pipeline that automated unit and integration tests, Docker container builds, Git tagging and releases, and application packaging/signing.
  • Developed infrastructure abstractions for use in Jenkins using a Groovy-based shared library, creating simplified interfaces for complex workflows such as building Docker images from within ephemeral Docker build agents on persistent Jenkins worker nodes.
  • This was a development-skewed role.
InfrastructuregolangDockerSystem Monitoring

Fiduciary decisions

Software Engineer, Full Stack

Jun 2019Oct 2019 · 4 mos · Portland, Oregon Area

  • Junior full-stack developer working with technologies such as Vue.js, Angular, Postgres, Ruby, and Rails on a SaaS platform deployed to AWS.
  • Developed smaller-scoped features such as helping migrate individual components to Vue.js, database migration scripts, and integration tests.
  • Fixed rendering bugs present in an HTML to PDF export pipeline using SCSS and Ruby HTML templates.
  • Assisted SDETs with manual testing of the application prior to releases.

Lewis & clark college

3 roles

Teacher’s Assistant for Computer and Network Security

Sep 2018Dec 2018 · 3 mos · Portland, Oregon Metropolitan Area

Digital Initiatives Assistant

May 2017May 2019 · 2 yrs · Portland, Oregon Metropolitan Area

  • Backend developer for Watzek’s Digital Initiatives team, primarily developing services deployed on AWS (EC2 instances, Lambda, S3) using Docker, Node.js, Python, MongoDB, and PostgreSQL
  • Serving as a system administrator for LC’s high performance computing infrastructure and working with students, faculty, and remote teams to implement and deploy scalable solutions to computationally intensive problems

Teacher’s Assistant for CS 171

Jan 2017May 2017 · 4 mos · Portland, Oregon Metropolitan Area

Cdk global

Software Engineer Intern, Application Performance Management

Jun 2018Aug 2018 · 2 mos · Portland, Oregon

  • Full stack developer on an internal-facing application performance management (APM) team working on a project utilizing DevOps metric data to improve team development practices
  • Developed various Node.js and Java Spring microservices in order to collect metrics from the Atlassian Stack, AppDynamics, and other DevOps tools
  • Implemented a front-end dashboard with React/Redux and D3

Education

Lewis & Clark College

Bachelor of Arts (B.A.) — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Platform Engineering & Amazon Web Services (aws)

Explore similar profiles based on matching skills and experience