We're looking for a Site Reliability Engineer - Incident Management to join our team. The ideal candidate will have experience in IT Operations, Linux, and Monitoring tools. You will be responsible for monitoring, maintaining, and managing our infrastructure, troubleshooting issues, and collaborating with other teams for quicker resolution. You will also be responsible for incident management, documentation, and task automation.
Requirements
- One to Two years IT Operations (Infra/System admin/Linux) or equivalent experience/certification
- Knowledge or familiarity of Monitoring and other integration tools like Splunk, Prometheus, Grafana, Kibana, PagerDuty, Runscope, Jira /ServiceNow tool for Incident Management
- Good experience (or familiarity) with ITSM main functions and usage of tools
- Strong interpersonal skills and have the ability to interact with all levels of employees in a professional manner
- Certifications is highly recommended with a strong knowledge of computer functionality. Any technical certification on Linux, System Admin, VMware, IT Security or certification in the area of ITSM/ ITIL will be an added advantage.
- Knowledge of DevOps/SRE (basics), Python, Cloud will be also good to have
Benefits
- No mention of benefits in the job description