Attain Finance is seeking a strategic and people-first Site Reliability Engineer Manager to lead their Site Reliability Engineering team. This role involves building scalable, resilient, and high-performing infrastructure, ensuring reliable systems while empowering engineers. The position requires a blend of technical rigor and empathetic leadership, focused on incident management, root cause remediation, and fostering a culture of trust and continuous improvement.
Requirements
- 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles
- 2+ years of experience managing technical teams with a focus on mentorship and collaboration
- Strong proficiency in observability tools (Grafana, Thousand Eyes), and alerting systems (Go Alert)
- Experience configuring storage alarms in Grafana and CloudWatch
- Deep understanding of distributed systems, service-level objectives, and incident response frameworks
- Demonstrated familiarity with Python, SQL, and PowerShell
- Familiarity with ITSM platforms (Ivanti, ServiceNow) and device management tools (Intune, JAMF)
- Proven ability to drive cross-functional initiatives and foster a culture of reliability and ownership
Benefits
- Medical
- Dental
- Vision
- Life Insurance
- Disability
- 401k program
- Flexible Paid Time Off