ServiceNow

Senior Site Reliability Engineer - Cloud Operations

Join ServiceNow in San Diego as a Senior Site Reliability Engineer. Leverage your SRE skills and ServiceNow expertise to enhance cloud infrastructure reliability.

ServiceNow Role Type:
Department - JobBoardly X Webflow Template
System Administrator
ServiceNow Modules:
Department - JobBoardly X Webflow Template
DevOps
Department - JobBoardly X Webflow Template
IT Service Management
Department - JobBoardly X Webflow Template
Incident Management
Department - JobBoardly X Webflow Template
Security Operations
ServiceNow Certifications (nice to have):
Department - JobBoardly X Webflow Template
Certified System Administrator

Job description

Date - JobBoardly X Webflow Template
Posted on:
 
July 16, 2025

We are looking for a Senior Site Reliability Engineer to join our Cloud Operations team. As an SRE, you will be responsible for maintaining and developing the reliability, scalability, and performance of our cloud infrastructure. You will use your software development, systems engineering, and networking expertise to proactively prevent repeatable issues and drive initiatives to improve the reliability and performance of our infrastructure.

Requirements

  • Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving
  • Solid understanding of Linux systems, networking, and container security
  • Proficiency with infrastructure-as-code tools like Terraform and Ansible
  • 4+ years of experience in SRE, DevOps, or cloud infrastructure role
  • 4+ years of experience programming/scripting skills in Python, Go, Bash, and JavaScript
  • 4+ years of experience with Linux System Administration with deep knowledge of Linux systems
  • 4+ years of experience operating and scaling Kubernetes in production environments
  • Knowledge of database technologies including MySQL, MariaDB, and PostgreSQL
  • Expertise with GitLab CI/CD and modern software delivery practices
  • Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, etc.)
  • Experience with Cloud technologies, Azure, AWS, and GCP
  • Ability to leverage AI technologies to enhance system reliability, automate operational tasks, and optimize performance monitoring and incident response processes
  • Team-first attitude and an uncompromising attention to detail
  • Excellent collaboration and communication skills
  • Experience developing on the ServiceNow Platform is a bonus!

Benefits

  • Base pay of $126,700 - $215,400
  • Equity (when applicable)
  • Variable/incentive compensation
  • Benefits
  • 401(k) Plan with company match
  • ESPP
  • Matching donations
  • Flexible time away plan
  • Family leave programs

Requirements Summary

4+ years of experience in SRE, DevOps, or cloud infrastructure role, programming/scripting skills in Python, Go, Bash, and JavaScript, Linux System Administration with deep knowledge of Linux systems, and experience operating and scaling Kubernetes in production environments