Senior Staff Site Reliability Engineer at Synopsys - ScoutJobs - The AI-curated global job board
Skip to content
Synopsys
Posted 2 days ago

Senior Staff Site Reliability Engineer

SynopsysSenior Staff, Site Reliability Engineer

Requirements

7+ years SRE or DevOps experience, Experience with LLMs, Azure OpenAI, or LangChain, Deep Azure knowledge (AKS, Blob Storage, Redis), Proficiency in Python and TypeScript, Expertise in Terraform, Kubernetes, Helm, and Docker, Experience with OpenTelemetry and ELK stack, Strong Linux and networking knowledge

Skills

AzureKubernetesPythonTerraformTypeScriptOpenTelemetryLLM

About the role

Responsibilities

  • Own availability, latency, performance, and capacity for Cloud-native SaaS products running on Azure AKS
  • Design and deploy AI agents using LLMs, Azure OpenAI, or LangChain to automate complex operational workflows
  • Build self-healing internal services instrumented with OpenTelemetry to detect and resolve incidents automatically
  • Define and enforce SLIs, SLOs, and error budgets using the ELK stack and Azure Monitor
  • Lead post-incident reviews and drive continuous improvement to turn outages into automation opportunities
  • Participate in a rotational on-call schedule to maintain high-availability for global customers

Requirements

  • 7+ years of experience in a dedicated SRE or DevOps role managing high-traffic SaaS environments
  • Hands-on experience implementing AI agents using LLMs, Azure OpenAI, or LangChain
  • Deep architectural knowledge of Azure (AKS, Blob Storage, Redis Cache, Azure Monitor, and Azure Automation)
  • Proficiency in Python and TypeScript for writing clean, testable automation code
  • Expert-level command of Terraform, Kubernetes, Helm, Docker, and GitHub Actions
  • Deep experience with OpenTelemetry and the ELK stack for distributed tracing and observability
  • Strong understanding of Linux internals and networking protocols

Benefits

  • Comprehensive range of health, wellness, and financial benefits
  • Opportunities to work on cutting-edge AI-driven reliability engineering
  • A collaborative environment focused on scaling next-generation cloud solutions

About the Company

Synopsys is the leader in engineering solutions from silicon to systems, enabling customers to rapidly innovate AI-powered products. We deliver industry-leading silicon design, IP, simulation, and analysis solutions to power innovation across a wide range of industries.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

Senior Staff Site Reliability Engineer

Synopsys · Bengaluru

Sign up to apply