The Lead Site Reliability Engineer will provide mentorship to other team members on core SRE principles and tools, participate in end-to-end operational aspects of production environment, and work on cloud systems, networks, and databases.
Requirements
- Bachelor's or Master's in Computer Science discipline.
- 5+ years' experience in Site Reliability Engineering or related position in AWS Cloud Platform.
- At least 2 AWS Certifications (AWS Sysops Admin and Architects certifications preferred).
- Deep experience with AWS, Docker, and Kubernetes, CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS, Amazon FSX, Elastic Search, and networking concepts.
- Program at a high level in at least one language such as Java, C#, Javascript, Python, or Ruby.
- Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch.
- Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms, and tools, SLO management, incident resolution, and automation.
- Ability to explain technical concepts in clear, non-technical language.
- Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks).
- Knowledge of security and compliance standards such as SOC/PCI is a plus.
Benefits
- Hybrid Work Model: Flexible hybrid working environment (2-3 days a week in the office depending on the role).
- Flexibility & Work-Life Balance: Flexible work arrangements, including work from anywhere for up to 8 weeks per year.
- Career Development and Growth: Continuous learning and skill development, Grow My Way programming, and skills-first approach.
- Industry Competitive Benefits: Flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing.
- Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more.
- Social Impact: Two paid volunteer days off annually, opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives.