Coupang

Staff Reliability Engineer

Staff Reliability Engineer at Coupang, Bengaluru. Drive observability strategy, design monitoring platforms, lead incident management integration. 15+ yrs required. 4-day week, visa sponsorship.

Department - JobBoardly X Webflow Template
Direct Hire
Job Level - JobBoardly X Webflow Template
Expert/Leadership
ServiceNow Role Type:
Department - JobBoardly X Webflow Template
Implementer
ServiceNow Modules:
Department - JobBoardly X Webflow Template
DevOps
Department - JobBoardly X Webflow Template
IT Service Management
Department - JobBoardly X Webflow Template
Incident Management
Department - JobBoardly X Webflow Template
Security Operations
ServiceNow Certifications (nice to have):

Job description

Date - JobBoardly X Webflow Template
Posted on:
 
December 4, 2025

Coupang is looking for a Staff Reliability Engineer to ensure stable IT services by operating monitoring systems and processes for IT infrastructure and applications. The role involves defining and driving observability strategy, leading the design and implementation of observability platforms, and conducting gap assessments in existing monitoring setups.

Requirements

  • Define and drive the observability strategy and roadmap
  • Establish a mature observability framework
  • Advocate for observability best practices across engineering, operations, and product teams
  • Lead the design, implementation, and optimization of observability platforms
  • Evaluate and onboard new tools and technologies
  • Ensure scalable and resilient monitoring architectures
  • Conduct gap assessments in existing monitoring setups
  • Implement automated solutions to address low-hanging fruits
  • Continuously refine monitoring configurations
  • Build and maintain end-to-end visibility across infrastructure, network, applications, and user journeys
  • Integrate observability tools with incident management, ticketing, and reporting systems
  • Develop and enforce tagging strategies, metrics standards, and log enrichment practices
  • Partner with DevOps, SRE, and application teams
  • Provide technical guidance and training to teams
  • Support incident response and post-mortem analysis
  • Leverage observability data to generate actionable insights
  • Create dashboards and reports that provide meaningful visibility to stakeholders

Benefits

  • Generous Paid Time Off
  • 401k Matching
  • Retirement Plan
  • Visa Sponsorship
  • Four Day Work Week
  • Generous Parental Leave
  • Tuition Reimbursement
  • Relocation Assistance

Requirements Summary

15+ years of hands-on experience in monitoring, observability, and infrastructure operations. Proven track record of designing and implementing observability platforms in complex environments. Experience in gap analysis and optimization of monitoring setups across infrastructure, network, applications, and end-user layers