Senior DevOps Engineer (Databases)
Location
Madrid | Spain
Job description
Hi, we’re Nexthink. We’re not just the leader in the digital employee experience category, we invented the category. Our solutions combine real-time analytics, automation and employee feedback across all endpoints to help IT teams delight people at work. Our cloud-native platform pinpoints issues and solutions, automates response, and helps companies continuously improve their employees’ experience, making them more productive, efficient, and happy at work. We have millions of endpoints deployed, we’ve surpassed $100M in ARR, and we’ve recently secured $180M in Series D financing for a company valuation of $1.1B, but we’re just getting started.
#LI-Hybrid
Job Description
The data infrastructure and services play a critical role in Nexthink’s cloud-native multi-tenant products. It streams, ingests, aggregates, and processes events collected from millions of endpoints every second, and stores and serves these enriched data to all Nexthink services.
Nexthink is looking for passionate and innovative professionals that are keen to join our Data Infrastructure team, which is part of the Developer Experience group. The team ensures our data infrastructure and services are reliable, scalable, cost-effective, and aligned with the evolving product requirements. This position is focused on operating, scaling, and automating the database and streaming infrastructure, including our Kafka, Clickhouse, and Amazon Aurora clusters. The new Platform Engineer should contribute to the team with their previous experience in Software Engineering, Platform Engineering or Site Reliability Engineering and have particular interest in large-scale data-intensive distributed systems and pipelines.
Responsibilities :
- Monitoring and reliability. Use and own the specifications of our tooling set related to monitoring, telemetry, reliability, and automation to assess the health of the data pipeline based on Nexthink workloads.
- Operation. Manage the availability of the databases and streaming platforms which empower Nexthink data pipelines. Understand and be able to communicate the scale, capacity, security, redundancy and performance attributes and requirements.
- Incident management and response. Detect, diagnose and fix incidents finding solutions to achieve required Service Levels. Owner of the post-mortem process of such incidents by writing technical content both for customers and internal stakeholders.
- Work with architects, team leads and developers in activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Contribute to Nexthink tooling and automation framework for provisioning and scaling the infrastructure, with particular focus on resiliency and elasticity strategies.
- Have on-call responsibilities in rotation with the engineering team.
#LI-Hybrid
Qualifications
- Min 5 years of experience as a Software Engineer, DevOps Engineer, Platform Engineer or Site Reliability Engineer with knowledge of best professional software development practices.
- Experience with distributed systems and streaming technologies in general, and familiarity with Apache Kafka in particular.
- Experience with database cluster deployment (Amazon Aurora, Clickhouse, etc), configuration, backup/restore, and production operations.
- Experience operating services on Linux systems.
- Experience with monitoring solutions such as Datadog, Prometheus, Grafana and others.
- Experience administering and deploying on cloud-based platforms (Azure, AWS, Google and/or others), using infrastructure as code (Cloud Formation, Terraform, etc.), configuration management tools (Ansible, Puppet) and pipeline creation tools (like Jenkins, GitHub Actions, GitLab).
- At ease with operating and managing production systems, solving issues striking the right balance between urgency and methodology.
- Excellent written and verbal skills in English.
Great to have :
- Experience with AWS MSK.
- Experience working with Kubernetes and writing custom operators.
- Experience with Kafka in-depth configuration and performance optimization.
Additional Information
We are 900+ employees strong in 21 countries across 8 different time zones speaking 60+ languages. We are positive, we get things done, we keep growing, and we are one team, we are Nexthink. We believe actions are stronger than words when it comes to diversity, inclusion, and equity in the workplace. Nexthinkers are multinational and multilingual, and come from all walks of life. We are committed to hiring a genuinely representative workforce that can create solutions and foster innovation for the modern digital employee experience.
If you are looking for a change and like a nice atmosphere, lots of challenges, and having fun while working, this is a great opportunity for you!
- Permanent Contract and a competitive compensation package (Stock Options also included)
- Private Health Insurance (Sanitas) and monthly restaurant tickets (Edenred) will be entirely covered by us.
- ♀️ Up to 25 EUR per month for a gym subscription.
- Flexible retribution plan for kindergarten & transport tickets.
- ️ Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 23 days of holidays we offer).
- We reimburse up to 50% of the cost of English & Spanish classes.
- Amazing centrally located offices near the Bernabeu Stadium.
- Fresh fruit, cookies, and occasionally some soft drinks as well.
- Regular company and team events like Pizza talks, Team Building activities, Christmas parties, hosting Meetups at the office and more!
- We offer a relocation package to people who are coming from another country.
Job tags
Salary