Senior Site Reliability Engineer
Location
Us, 50250 | France
Job description
We are looking for a Senior Site Reliability Engineer to join our team and develop software systems and automated solutions for operational aspects in an organisation.
Your responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience. Ultimately you will work with our IT teams to ensure our organisation can contribute to delivering products and services in our computer system environment.
What is expected from me on a daytoday basis if I join Knowmax
- Gather and analyse metrics from operating systems as well as applications to assist in performance tuning and fault finding
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting platform management and capacity planning
- Create sustainable systems and services through automation and uplifts
- Balance feature development speed and reliability with welldefined servicelevel objectives
- Run the production environment by monitoring availability and taking a holistic view of system health
- Build software and systems to manage platform infrastructure and applications
- Improve reliability quality and timetomarket of our suite of software solutions
- Measure and optimize system performance with an eye toward pushing our capabilities forward getting ahead of customer needs and innovating for continual improvement
- Provide primary operational support and engineering for multiple largescale distributed software applications
What are the prerequisites and skill sets required to apply for this role
- Bachelor s degree (or equivalent) in computer science or related discipline
- Ability to program (structured and OOP) using one or more highlevel languages such as Python Java C/C Ruby and JavaScript
- Experience with distributed storage technologies such as NFS HDFS Ceph and Amazon S3 as well as dynamic resource management frameworks (Apache Mesos Kubernetes Yarn)
- Proactive approach to identifying problems performance bottlenecks and areas for improvement
apache,apache mesos,reliability,kubernetes,yarn
Job tags
Salary