Lead – Site Reliability Engineer is a hands-on role that requires a highly skilled technology professional with excellent communication skills, strategic mindset, and strong analytical and troubleshooting skills on AWS Cloud Platform. The role involves participating in end-to-end operational aspects of Production environment, working with internal business partners, and collaborating with Architects, DevOps, Product, and development teams to ensure uptime and meet customer SLA.
Requirements
- Skilled with cloud operations/administration in Amazon AWS.
- Tax/Accounting domain experience
- Bachelors or Master’s in Computer Science discipline.
- 5+ years’ experience focussed on Site Reliability Engineering or related position in AWS Cloud Platform.
- At least 2 AWS Certifications are must. (AWS Sysops Admin and Architects certifications preferred).
- Experience working with SQL, Windows Servers, Load balancers, Linux
- Deep experience with AWS, Docker and Kubernetes, CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS, Amazon FSX, Elastic Search and networking concepts are must.
- Program at a high level in at least one language such as: Java, C#, Javascript, Python or Ruby.
- Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch.
- Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation;
- Ability to explain technical concepts in clear, non-technical language
- Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
- Knowledge of security and compliance standards such as SOC/PCI is a plus
Benefits
- Hybrid Work Model: We’ve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.
- Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset.
- Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow’s challenges and deliver real-world solutions.
- Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing.
- Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more.
- Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives.