As a Platform Observability Lead, you will play a key role in ensuring the current state of Interac’s Application, Infrastructure, and Services are robust, visible, and available to stakeholders for troubleshooting, performance analysis, capacity planning and reporting. You will provide the vision and expertise to ensure the successful implementation and administration of the enterprise solutions to enable our support teams, developers, and system administrators to efficiently detect and remediate incidents as they arise and proactively address issues before they become incidents.
Requirements
- Providing overall vision and roadmap for Interac’s core Observability functions and capabilities
- Ensure the successful Implementation and maintenance of Infrastructure and Configurations for Splunk, Dynatrace, and ServiceNow integrations
- Develop and maintain automation for provisioning, management, and maintenance of Observability in our enterprise pipelines
- Oversee onboarding of new data sources and users in Splunk and Dynatrace
- Managing Splunk Knowledge Objects (e.g., fields, extractions, tags, event types, lookups, workflow actions, aliases, macros, etc.)
- Modelling data to allow for data normalization across a variety of unique data sources.
- Develop and support the writing of Splunk and Dynatrace queries for alerts, dashboards, and reporting.
- Provide oversight and direction in the development of Customer Journey dashboard and reporting to drive enhanced availability
- Optimizing Observability Suite to monitor applications and infrastructure.
- Tracking new releases of monitoring solutions and ensure the deployment of patches/implementing upgrades regularly.
- Building advanced visualizations in Splunk and Dynatrace which enhances our ability to identify and respond to issues.
- Partnering closely with our key vendors for service enablement, troubleshooting, coordinating maintenance windows, etc.
- Creating and maintaining operational process documentation for monitoring solutions.
- Develop observability for application flows in a containerized/microservice environment.
- Manage, maintain and ensure performance of all monitoring systems including data retention, capacity management and performance analytics.
- Work with external business partner and customers to ensure end to end service mapping and views are created to drive improved availability across the entire ecosystem
Benefits
- Connection: You’re surrounded by talented people every day who are driven by their passion of a common goal.
- Core Values: They define us. Living them helps us be the best at what we do.
- Compensation & Benefits: Pay is driven by individual and corporate performance and we provide a multitude of benefits and perks.
- Education: To ensure you are the best at what you do we invest in you