Lead Site Reliability Engineer role at Thomson Reuters, requiring 5+ years of experience in Site Reliability Engineering or related position on AWS Cloud Platform. Key responsibilities include hands-on engineering, mentorship, incident lifecycle management, and automation. Must have deep experience with AWS, Docker, Kubernetes, and related technologies.
Requirements
- Hands-on experience with cloud operations/administration in Amazon AWS
- 5+ years of experience in Site Reliability Engineering or related position on AWS Cloud Platform
- At least 2 AWS Certifications
- Experience working with SQL, Windows Servers, Load balancers, Linux
- Deep experience with AWS, Docker, Kubernetes, CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS, Amazon FSX, Elastic Search and networking concepts
- Program at a high level in at least one language such as: Java, C#, Javascript, Python or Ruby
- Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch
- Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation;
- Ability to explain technical concepts in clear, non-technical language
- Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
- Knowledge of security and compliance standards such as SOC/PCI is a plus
Benefits
- Flexible vacation
- Two company-wide Mental Health Days off
- Access to the Headspace app
- Retirement savings
- Tuition reimbursement
- Employee incentive programs
- Resources for mental, physical, and financial wellbeing