Site Reliability Engineer
Location
São Paulo, SP | Brazil
Job description
Job Description
About you
You are someone who wants to influence your own development. You’re looking for a company where you have the opportunity to pursue your interests and be able to grow professionally.
You bring to Applaudo the following competencies:
- Desirable Bachelor’s Degree in Computer Science, Computer Engineering, related field or equivalent experience.
- +5 years of experience as a software developer, including experience taking a leading role in significant technical projects.
- +2 years of experience as a Site Reliability Engineer or similar role.
- Experience building complex web applications with modern languages and frameworks.
- Comfort building on top of relational databases and fluency in querying them with SQL.
- Experience making data-driven decisions.
- Expertise in Ruby on Rails and proficiency with Python.
- Scaling and ensuring reliability of large SaaS applications.
- SaaS application architecture (Amazon Web Services, Kubernetes and Docker).
- Cloud application monitoring, logging and telemetry (DataDog, Sumo Logic, OpenTelemetry, CloudWatch).
- Downtime SLO management and incident monitoring.
- Query optimization and data base administration (MySQL, Redshift).
- Automated software testing and continuous integration (TravisCI, CircleCI, GitHub Actions, Cloudbuild).
- Security frameworks such as SOC2, NIST and FedRAMP.
- Infraestructure as Code (IaC) tools (Pulumi, Ansible, CloudFormation).
- Love of problem-solving, ability and eagerness to constantly learn and teach others.
- Advanced english proficiency level, as you'll be communicating directly with US clients.
You will be accountable for the following responsibilities:
- Automate infrastructure through code with tools such as Terraform, Ansible, Puppet, Chef, etc.
- Create processes and tools focused on enhancing operation workflows to increase their automation.
- Handle, maintain and support all On-prem and Cloud infrastructure
- Represent the actual infrastructure in network/topology/services diagrams in each environment (Development, QA, Production)
- Define SLI, SLO and SLA of all supported infrastructure
- Continuous improvement and enhance reliability and management of clusters based on Kubernetes.
- Support CI/CD lifecycle.
- Research and evaluate new technologies for potential for cost-savings, simplified implementation and improved availability.
- Define Internal physical IT Infrastructure and work along side with Service providers and vendors to implement new solutions or upgrade as required.
Additional Information
Here at Applaudo Studios values as trust, communication, respect, excellence and team work are our keys to success. We know we are working with the best and thus treat each other with respect and admiration without asking.
Submit your application today, and don't miss this opportunity to join the Best Digital team in the Region!
We truly appreciate all the hard and outstanding work our team makes every day at Applaudo Studios, and that's why the perks that we offer, are deeply thought and designed as a way to thank them for their commitment and excellence.
Some of our perks and benefits:
- Work from home
- Flexible schedule
- Celebrations
- Special discounts
- Entertainment area
- Flexible work spaces
- Great work environment
- Private medical insurance
*B enefits may vary according to your location and/or availability. Request further information when applying.
Job tags
Salary