Two-Up Digital is a vibrant full-service digital design and development agency. We specialise in the betting & gaming and entertainment industries. Our two founders, Robbie Morris and Martyn Miller, have amassed more than 30 years experience in the sector working with or for many household names such as Ladbrokes, Coral, William Hill, Betfair, Racing Post and ITV. Based in Shoreditch, the creative heartbeat of London, we nurture talent and value passionate, ambitious people. Openness and trust in a collegiate and flexible way of working is a cornerstone of our business but the bottom line is we never miss deadlines. We believe success stems from understanding people and their objectives, utilising our experience and knowledge to challenge views to ensure the finest deliverable.
Salary - up to 18000 PLN
We are looking for a passionate Site Reliability Engineer to join our platform team.
- Maintenence and development of product infrastructure strategy
- Continuous Performance Management. Measuring performance and working with developers to improve it
- Detect and resolve security, performance and availability issues to ensure maximum uptime and performance
- Investigate, evaluate and recommend new tools and technologies for faster fault finding
- Administration of web servers, Load Balancing (haproxy, nginx)
- Network and Linux virtual machines administration
- Containerisation / packaging. Distributed component integration/troubleshooting (Docker, Kubernetes)
- Log aggregation (Kibana / Graylog / Elasticsearch)
- Monitoring (Prometheus, Grafana)
- Troubleshooting of various issues in cloud environment in a measured, methodical way, often under pressure
- 2+ years of experience in SRE, DevOps/Ops role, administration of production software environment,
- strong practical knowledge of Linux/Unix, networking/administration
- practical knowledge of HTTP protocol (cache, debugging, monitoring)
- good understanding of cloud computing paradigm (distributed logging, service discovery, stateless applications, scaling, HA)
- understanding of Infrastructure as a Code paradigm
- experience in troubleshooting issues in distributed systems
- practical knowledge of git SCM and good understanding of git flow concepts
- hands-on experience of writing scripts with at least one of scripting language
- incentive to propose improvements for the development teams (software design patterns, best practices, code styles)
- experience with real-life deployments to any of the major cloud providers (i.e. AWS, Google Cloud)
- understanding of Kubernetes
- experience with developing in-house IaaS/PaaS solution
- work experience with configuration management tools such as Ansible, Chef
- GCP understanding
- experience in building pipelines for automating of applications scaffolding, testing, building, auto-scaling and integration
- experience in work with developers on middleware/frontend tier
- experience in infrastructure provisioning with Terraform