Daniel Serrão

SRE (Site Reliability Engineer)

Funchal, Madeira Island, Portugal10 yrs 7 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Over 10 years of experience in IT and engineering roles.
  • Expert in building scalable and reliable platforms.
  • Proven track record in mentoring and leading engineering teams.
Stackforce AI infers this person is a DevOps and Site Reliability Engineering expert in the SaaS industry.

Contact

Skills

Core Skills

Site Reliability EngineeringKubernetesDevopsContinuous Integration And Continuous Delivery (ci/cd)Infrastructure As Code

Other Skills

.NETAmazon Web Services (AWS)Angular2AnsibleBashC#CI/CDCSSCloud ApplicationsCloud ComputingCrossplaneDockerFluentdGitGitOps

About

IT professional with 10+ years of experience across Site Reliability Engineering, DevOps, Platform Engineering and Software Engineering. Skilled in building scalable, secure, and reliable platforms with Kubernetes, Terraform, observability stacks (Prometheus, Thanos, Loki, Grafana, Alloy), and infrastructure tooling such as Helm and Kustomize — while adaptable to a wide range of technologies. I design and implement end-to-end architecture solutions, mentor engineers, and drive best practices to improve reliability and efficiency. Passionate about bridging strategy with execution and enabling teams to deliver at scale.

Experience

10 yrs 7 mos
Total Experience
2 yrs 7 mos
Average Tenure
4 yrs 7 mos
Current Experience

Vitrifi

Senior Site Reliability Engineer

Nov 2021Present · 4 yrs 7 mos · Funchal, Madeira Island, Portugal · Remote

  • Led, designed and implemented an Observability platform for all Kubernetes clusters and associated services, enabling the creation of insightful dashboards with metrics and logs. This has significantly enhanced our ability to proactively identify and troubleshoot issues promptly.
  • Mentored junior and mid-level engineers on Kubernetes, Terraform, and SRE practices.
  • Reduced Kubernetes cluster operating costs by ~20% by monitoring and tuning pod CPU and memory requests and limits.
  • Orchestrated the setup of proactive alerting mechanisms, providing timely notifications for potential service disruptions and enabling swift response to mitigate risks and maintain uninterrupted operations.
  • Changed infrastructure running in AWS to be compliant to AWS Security Hub.
  • Helped on setting up the modern GitOps framework across the all organization.
  • Spearheaded the development of robust pipelines for automated testing and validation of clusters and application configurations before deployment, ensuring the reliability and stability of our infrastructure.
PromtailNetwork SecurityGo (Programming Language)KubernetesCrossplaneLinux+19

Penguin formula

4 roles

Senior DevOps Engineer (client Harlem Next)

Feb 2020Nov 2021 · 1 yr 9 mos

  • At Harlem Next senior and junior developers join forces, thriving on the development of high traffic platforms in a highly dynamic environment.
  • I was responsible mostly for:
  • Design and Implement CI/CD flows.
  • Automate the deployment and update of Infrastructure and many applications on multiple environments.
  • Implement Infrastructure monitoring.
  • Put applications in docker containers.
  • Create documentation for all of the above.
  • The above allowed:
  • The development teams to deploy and configure new production environments in less than one day when initially it took around one week due to the amount of manual tasks.
  • It decreased the number of bugs deployed to production due to the automated tests.
  • New members to more easily setup their local environments and be productive due to the containerized applications and documentation available.
KubernetesSite Reliability EngineeringHelm ChartsTerraformGitlabContinuous Integration and Continuous Delivery (CI/CD)+4

Senior DevOps Engineer (client ASML through Itility B.V.)

Promoted

Apr 2019Jan 2020 · 9 mos

  • Worked on a project called Datacenter Automation (DCA) where we improved the delivery time of new servers from 1 week to 30 minutes by automating the servers deployments using Puppet, Ansible, Terraform, Inspec, RSPEC, Rundeck and Jenkins. Some types of servers that we automated are web servers, sandboxes, proxies or simple base rhel and windows servers.
  • I spent most of my time automating these servers deployments and configuration, researching possible solutions for complex technical problems, having discussions with colleagues and clients about requirements and the best solutions.
  • I also mentored new colleagues in the project, so that they can start producing good work faster and did the role of Scrum master temporarily.
TerraformContinuous Integration and Continuous Delivery (CI/CD)Infrastructure as CodeDevOps

DevOps Engineer (client ASML through Itility B.V.)

Jan 2018Mar 2019 · 1 yr 2 mos

  • Completed a project to allow Splunk developers to write one configuration in one server and the configuration is automatically applied in more than 100 Splunk instances without the need to access every instance.
  • Worked on a project called Datacenter Automation (DCA) where we improved the delivery time of new servers from 1 week to 30 minutes by automating the servers deployments using Puppet, Ansible, Terraform, Inspec, RSPEC, Rundeck and Jenkins. Some types of servers that we automated are web servers, sandboxes, proxies or simple base rhel and windows servers.
TerraformContinuous Integration and Continuous Delivery (CI/CD)Infrastructure as CodeDevOpsSecurity

Software/DevOps Engineer (client Itility B.V.)

Sep 2017Jan 2018 · 4 mos

  • API of the Itility Cloud Control (ICC) which consist on providing services that help on managing Infrastructure such as servers, environments, applications, users, etc.
  • Improvement of an alerting flow which checks the state of the internal and clients infrastructure. An example is the disk of a server is getting full and the DevOPS team receive an alert on VictorOPS User Interface which allow them to act before the problem happens.
Infrastructure as CodeDevOps

Syone

2 roles

Software Engineer (client GodtLevert)

May 2017Aug 2017 · 3 mos · Lisboa, Lisbon, Portugal

  • Responsible for the development of an ecommerce platform (GodtLevert). Doing the following activities:
  • SQL Server development, web applications integration with database using Entity Framework.
  • API’s development and integration with other modules using ajax requests (jQuery).
  • Increasing the web application performance of current functionalities.
  • Functional testing.
  • Continuous integration and delivery with GIT and Jenkins.
  • Layout corrections using ASP.NET MVC 5, HTML, CSS and Kendo UI.
  • Functionalities corrections using JavaScript and jQuery.

Software Engineer (client Brandbassador)

Aug 2016Apr 2017 · 8 mos · Lisboa, Lisbon, Portugal

  • Responsible for the development of a web and mobile social commerce platform (Brandbassador) in backend and frontend contexts. Did the following activities:
  • Database development and queries elaboration in NoSQL (Couchbase).
  • Microservices development using NodeJS (Javascript).
  • Microservices integration with the NoSQL Databases and different social networks, namely Facebook, Twitter and Instagram, among others.
  • Layout development - dashboards for financial and performance indicators screens.
  • HTML, CSS, Angular2 and Javascript development.
  • Mobile development using Ionic.
  • Functional testing.

Roox

Junior Software Engineer

Jun 2015Jul 2016 · 1 yr 1 mo · Lisboa, Lisbon, Portugal

  • During this experience, I was responsible for the following activities:
  • Development and maintenance of HealthCare management application for Desktop and Web using VB.NET, Window Forms, SQL Server and Crystal Reports.
  • Implementation in client context (clinics).
  • Customer support.
  • Development of new functionalities for a Web Application, capable of storing and managing information received and sent by Beacons, allowing customers to personalize messages and decide when and where they will be received by their clients. Used ASP.NET MVC, C#, HTML, CSS, AngularJS and SQL Server.

Agroop

Junior Software Engineer (Freelancer)

Mar 2015May 2015 · 2 mos · Oeiras, Portugal

  • During this experience, I was responsible for the backend development for an agriculture management application by using Java, IntelliJ, MySQL and JSON for http requests.

Education

Instituto Superior Técnico

Master’s Degree — Computer Engineering

Jan 2014Jan 2016

Stackforce found 100+ more professionals with Site Reliability Engineering & Kubernetes

Explore similar profiles based on matching skills and experience