SRE Cloud Operations Engineer

Nexthink

Nexthink

Operations
Madrid, Spain
Posted on Oct 2, 2024

Company Description

Nexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any application or network, before employees notice the issue. As the first solution to allow IT to progress from reactive problem solving to proactive optimization, Nexthink enables its more than 1,200 customers to provide better digital experiences to more than 15 million employees. Dual headquartered in Lausanne, Switzerland and Boston, Massachusetts, Nexthink has 9 offices worldwide.

#LI-Hybrid

Job Description

Join Nexthink's vibrant Madrid team as a Site Reliability Engineer, where cutting-edge technology meets innovation. If you are a highly skilled professional with a flair for AWS (Amazon Web Services) and a profound grasp of GitOps and DevOps practices, we invite you to explore a role like no other. With a core focus on Kubernetes (k8s), you will orchestrate the deployment of our cloud infrastructure and services, utilizing Nexthink's trailblazing technology.

Your expertise will be essential in maintaining our services' high availability, performance, and efficiency. Be a part of Nexthink's technological revolution, ensuring our global customers enjoy a seamless user experience. Embrace the future with Nexthink in Madrid; apply now and become a key player in our dynamic SRE organization.

*Please note that this role is hybrid (2 days per week in the office)

Responsibilities:

  • Manage and maintain our Kubernetes clusters, including deployment, configuration, and upgrades. Ensure the stability and scalability of the clusters to accommodate increasing demands
  • Utilize your hands-on knowledge to automate routine tasks and streamline operations. Implement infrastructure as code (IaC) practices to facilitate rapid and reliable deployments, ensuring efficient resource provisioning and management
  • Participate in an on-call rotation, providing prompt responses and resolution to critical incidents. Your commitment to keeping the cloud infrastructure up and running will be crucial to maintaining high availability
  • Proactively identify potential issues and troubleshoot system anomalies. Collaborate with other teams to address incidents and implement preventive measures to reduce downtime
  • Set up and maintain comprehensive monitoring and alerting systems to detect anomalies, capacity constraints, and potential performance bottlenecks. Ensure timely responses to alerts and alarms
  • Continuously assess the performance of our cloud infrastructure and applications. Implement optimizations to enhance system efficiency and reduce response times
  • Maintain accurate and up-to-date documentation of processes, procedures, and troubleshooting guides to facilitate knowledge sharing and standardization

Qualifications

  • Strong hands-on experience in managing Kubernetes clusters in a production environment
  • Knowledge in config automation (Ansible), CI/CD (Jenkins), IaC (Terraform, Crossplane) for infrastructure management. Also proficient in at least one scripting language (bash, python)
  • Familiar with source code management solutions (GitHub, Bitbucket) and the Atlassian suite (JIRA, Confluence)
  • Experience working in an on-call rotation environment and running operations
  • Proven problem-solving skills and the ability to troubleshoot complex technical issues
  • Deep commitment to maintaining high system reliability and availability
  • Familiarity with AWS cloud computing platform and related services
  • Excellent communication and collaboration skills to work effectively with cross-functional teams
  • Excellent communication english skills.

Additional Information

We are the pioneers and trailblazers of a global IT Market Category (DEX) that is shaping the future of how the world works, giving our customers’ IT Teams total digital visibility across their enterprise. Our innovative solutions integrate real-time analytics, automation, and employee feedback across all endpoints. This enables our IT teams to solve complex technical challenges, create ever more productive workplaces, and deliver happy, satisfied employees in the digital workplace.

With over 1000 employees across 5 continents, Nexthink operates as One Team, connecting, collaborating and innovating to continuously grow. We call our employees ‘Nexthinkers’ and our commitment to diversity, inclusion, and equity is second to none. We currently have over 75 nationalities working with us, from all cultures and backgrounds, speaking many different languages.

If you are looking for a change and like a nice atmosphere, lots of challenges, and having fun while working, this is a great opportunity for you! Check what we offer:

  • 💼 Permanent Contract and a competitive compensation package (including stock options).
  • 📍 Very good location next to the Prilly-Malley train station.
  • 🏡 Hybrid work model balancing office and remote work, with a structured approach for new hires to foster connections and onboarding.
  • 🏖️ Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 25 days of holidays we offer) plus 3 company-paid volunteer days.
  • 🤸 Free access to a fitness centre inside the building.
  • 🚞 Reimbursement of the half-fare travel card for public transport.
  • 🧑‍🏫 Reimbursement up to 50% of the cost of French classes.
  • 🍉 Fresh fruit, cookies, and soft drinks as well.
  • 🤝 Regular company and team events like Volunteering Days, Pizza talks, Team Building activities, hosting Meetups at the office and more!
  • 📣 Bonuses for referring successful hires after three months of continuous employment.
  • 🚚 We offer a relocation package to people who are coming from another country.

Please note that not all the benefits listed above are available for temporary, contract, and internship roles. To ensure you have the most up-to-date information, we recommend checking with your Recruitment Partner.

Please note that not all the benefits listed above are available for temporary, contract, and internship roles. To ensure you have the most up-to-date information, we recommend checking with your Recruitment Partner.