Jobs
‹ Back to search results
Site Reliability Engineer
Portugal - Lisbon
Posted: 06/04/2022
Salary: £70K
to £90K per Year
ID: 24696_BH
Site Reliability Engineer
Remote
Your Position
Join a leading international technology consultancy! Our client specialises in software development, data analytics, machine learning and software development in the cloud. We are trusted cloud partners across Google, Amazon and Microsoft.
We use a combination of these technologies to help businesses improve efficiency,
productivity and decision making. We encourage an innovative and dynamic workplace committed to actively supporting our employees in their growth
Your Responsibilities
As an SRE engineer you will be working on improving the reliability and stability of production level components. This involves alerting and monitoring of solutions, performing risk analysis to ensure the optimal uptime is adhered to and automating the deployments of the infrastructure through various stages in the development
pipeline to ensure continuous delivery.
There is an opportunity to grow your skill set in interesting technologies like Kubernetes, Prometheus, Hadoop, Trifacta, Grafana and more. With a focus on DevOps and SysOps you will get the opportunity to grow your career with like minded team members all working towards the same goal of constantly improving the platform.
Your background might be in SysOps / Systems Administration/ Network
Administration / IT Infrastructure / Operations Support / Infrastructure Engineer / Support Engineer or you’re starting to explore the DevOps Environment.
Core Tasks
â— Collaborate with other SRE Engineers and other Technology functions to deliver secure, reliable, robust, scalable solutions which can be built, tested and deployed through the Route to Live and into Production using continuous integration / deployment.
â— Allocate your team’s workload and manage the expectations of key
stakeholders.
â— Identify and implement DevOps/SysOps engineering best practices in
conjunction with your peers.
â— Visualizing metrics through dashboards hosted in Grafana.
â— Ensure platform uptimes are adhered to by using monitoring tools to
automatically surface valuable alerts.
â— Ensure the use of continuous delivery pipelines and tools to fully automate deployment.
â— Troubleshoot and take ownership of issues in our production environments. Including performance optimization and continuous tuning
â— Continuous learning and evaluation of the latest approaches, tools, and technologies
The Ideal Candidate
â— An individual thinker, not afraid to think outside of the box and to challenge preconceived ideas.
â— Self starter and disciplined to take ownership of critical areas for continuous improvement.
â— A passionate advocate of continuous deployment
â— Ability to quickly learn and apply emerging techniques, frameworks, and platforms
â— Working experience with Docker and/or Kubernetes an advantage
â— Good communication and collaboration skills
Technical Skills
â— UNIX / Linux background
â— Experience or Understanding of configuration management tooling (Chef,
Ansible, Puppet)
â— Experience in container management technologies (Kubernetes or any other)
â— Knowledge of Infrastructure as Code
â— Basic scripting skills (bash/sh/ksh/pearl/python)
â— Experience working with or following Runbooks
â— Experience in one of more popular CI platforms (e.g. Github
Action,Jenkins,Bamboo, or Travis).
â— SysOps
â— Understanding of Infrastructure Deployment and Templating (Puppet / Chef /
Ansible / Terraform )
â— Good Infrastructure Principles
â— Experience in advanced monitoring models. (Prometheus an advantage)
â— Knowledge of continuous integration and automated testing
â— Visualization experience advantage (ELK, Grafana, Splunk).
â— RHCSA / RCHE Certification an advantage but not a requirement.
Technical Skills
â— UNIX / Linux background
â— Experience or Understanding of configuration management tooling (Chef,
Ansible, Puppet)
â— Experience in container management technologies (Kubernetes or any other)
â— Knowledge of Infrastructure as Code
â— Basic scripting skills (bash/sh/ksh/pearl/python)
â— Experience working with or following Runbooks
â— Experience in one of more popular CI platforms (e.g. Github
Action,Jenkins,Bamboo, or Travis).
â— SysOps
â— Understanding of Infrastructure Deployment and Templating (Puppet / Chef /Ansible / Terraform )
â— Good Infrastructure Principles
â— Experience in advanced monitoring models. (Prometheus an advantage)
â— Knowledge of continuous integration and automated testing
â— Visualization experience advantage (ELK, Grafana, Splunk).
â— RHCSA / RCHE Certification an advantage but not a requirement.