Description
The Client Site Reliability Engineer role in Infrastructure, Client Services will be responsible for enabling and supporting our clients to deliver a best in class cloud native implementation of Thought Machine Vault products on client or Thought Machine hosted infrastructure, from presales to production at scale. This role supports clients in their cloud infrastructure preparation, deployment, optimisation and troubleshooting.
- Hands on cloud infrastructure consulting both on client site and remote
Working with customers and external partners to design and prepare suitable cloud infrastructure to ensure Thought Machine Vault products can be tested and run successfully at scale. Includes planning for high availability, disaster recovery, backup, redundancy, capacity and security
Deploying and configuring Thought Machine Vault products on client, SaaS and internal cloud infrastructure
Developing deep understanding of and advising clients on optimisation of cloud infrastructure for overarching implementation of Vault, including advising on systems outside of Vault to empower holistic digital transformation in collaboration with Thought Machine Client Architects
Supporting and troubleshooting client, SaaS and internal cloud infrastructure both remotely and on site, including by promoting and deploying suitable monitoring, logging and alerting tools
Working closely with internal product and engineering teams to ensure client feedback is incorporated into improvements to the product and platform
Supporting the Thought Machine Commercial Team and Cloud Provider Partners in answering infrastructure queries and challenges in the presales cycle
Requirements
Ability to explain technical concepts to technical and non-technical stakeholders.
Hands on experience of some or all of the following:
Linux/Unix administration, e.g. Ubuntu, Debian, Kafka, PostgreSQL, Kubernetes, Istio
Experience with automation/configuration management, e.g. Terraform
Experience with AWS or GCP
Experience with at least one and associated certifications for at least one of the following:
AWS (ideally certified Solutions Architect Professional)
GCP (ideally certified Professional Cloud Architect)
Azure (ideally certified Solutions Architect Expert)
Experience of enterprise secrets management systems, e.g. HashiCorp Vault, AWS secrets manager
Experience in supporting production systems for high profile, mission critical systems.
Experience with hybrid cloud technologies including OpenShift, Google Anthos, AWS EKS Anywhere, AWS Outposts
A strong background in Go, Python or Java
Experience with Postgres
Experience with observability tools, e.g. Prometheus, Grafana
Benefits
- Highly competitive salary
- Pension plan (match up to 5%)
- Life insurance - three times annual salary
- Competitive maternity (six months fully paid) and paternity leave (four weeks fully paid)
- Shared parental leave (matched to our maternity leave for the same point in time)
- 25 days holiday and bank holidays
- Flexible working hours
- Cycle-to-work scheme
- Electric car scheme
- Season ticket loan
- Access to outstanding learning materials and courses
- Sports and hobby clubs, subsidised by Thought Machine
- All the latest tech you need
- Start the day properly with fresh fruit and cereals
- Huge range of healthy (and not-so-healthy) snacks, smoothies and drinks
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress
- Two charity days a year
- Weekly food pop-up
We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn't accurately match the job description. We also encourage applications from those with different abilities, including candidates with ADHD, autism, dyslexia or dyspraxia.