The Manager, Service Reliability Engineering will lead the team responsible for ensuring the reliability of Navitaire's products and services. The role will work closely with engineers, architects, and product owners to design and implement efficient and scalable systems. The ideal candidate will have technical knowledge of DevOps, Linux, and Windows operating systems, as well as experience with cloud solutions, automation, and monitoring tools.
Requirements
- Bachelor's or graduate degree in engineering
- Working knowledge of the Linux and Windows operating systems
- Experience with SOP, SU, SLO, Automation, Capacity Management, Operational Improvement, Operational Readiness Testing
- Ability to technically troubleshoot cloud solutions, analyzing technical problems within the application, server and operating systems logs to identify the root cause and resolving the issue creating an impact to system's availability in production
- Experience supporting monitoring, alerting, or pipeline analysis tool while optimizing the current configuration of those monitoring tools and technically maintaining their availability
- General networking knowledge
- Knowledge and practical exposure to IT and Cloud operations, ideally in mission-critical environments
- Knowledge of standard automation tools and scripting: Terraform, FLUX
- Experience in implementing measurements and alerting in complex environments using standard tools like Splunk Grafana, Prometheus, Argos, ServiceNow
- Knowledge of Kubernetes, OpenShift, and Azure AKS is a plus
Benefits
- Competitive remuneration
- Individual and company annual bonus
- Vacation and holiday paid time off
- Health insurances
- Professional development opportunities
- Diverse and inclusive workplace
- Global opportunities to learn and grow