DevOps Engineer

  • Department:Engineering
  • Location:London, UK
  • Type:Full Time

Job Description:

In this role, you will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure. The ideal candidate has a strong background in software development, system administration, and a deep understanding of modern DevOps practices. You should be passionate about automation, monitoring, and incident response, and have a commitment to building and maintaining robust and efficient systems.

Responsibilities:

  • Design, develop, and maintain scalable and reliable infrastructure solutions that support our applications and services.
  • Implement monitoring and alerting systems to ensure early detection of issues and automate incident response.
  • Collaborate with cross-functional teams, including software engineers, product managers, and business stakeholders, to define system requirements and deliverables.
  • Develop and implement automation strategies to minimize manual intervention and improve system reliability.
  • Continuously analyze and optimize system performance, identifying bottlenecks and implementing improvements.
  • Participate in on-call rotations to ensure timely resolution of incidents and maintain system availability.
  • Stay current with the latest trends and technologies in the SRE field, continuously improving your skills and contributing to the team's growth.
  • Mentor and guide junior engineers, fostering a collaborative team environment and promoting a culture of continuous learning and improvement.

Requirements:

  • Bachelor's degree in computer science, engineering, or a related field, or equivalent experience.
  • Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role, with a strong track record of maintaining highly available systems.
  • Strong understanding of software development, system administration, and modern DevOps practices.
  • Experience with infrastructure automation tools, such as Terraform, Ansible, or Chef.
  • Proficient in using containerization and orchestration technologies, such as Docker and Kubernetes.
  • Experience with monitoring and alerting tools, such as Prometheus, Grafana, or ELK Stack.
  • Excellent problem-solving skills, attention to detail, and the ability to work independently and in a team environment.
  • Strong communication and interpersonal skills, with the ability to work effectively in a collaborative environment.

Nice to Have:

  • Experience with cloud platforms, such as AWS, Azure, or GCP.
  • Knowledge of programming languages, such as Python, Go, or Ruby.
  • Experience with database technologies, such as SQL, NoSQL, or caching systems.

This position is posted on Rachael Recruiting on behalf of another company. To find out more, please apply now to register your interest in the position.

Apply now