
Posted a day ago
Senior Staff Site Reliability Engineer
SynopsysSenior Staff, Site Reliability Engineer
Requirements
7+ years SRE or DevOps experience, Experience with LLMs, Azure OpenAI, or LangChain, Deep Azure knowledge (AKS, Blob Storage, Redis), Proficiency in Python and TypeScript, Expertise in Terraform, Kubernetes, Helm, and Docker, Experience with OpenTelemetry and ELK stack, Strong Linux and networking knowledge
Skills
AzureKubernetesPythonTerraformTypeScriptOpenTelemetryLLM
About the role
Responsibilities
- Own availability, latency, performance, and capacity for Cloud-native SaaS products running on Azure AKS
- Design and deploy AI agents using LLMs, Azure OpenAI, or LangChain to automate complex operational workflows
- Build self-healing internal services instrumented with OpenTelemetry to detect and resolve incidents automatically
- Define and enforce SLIs, SLOs, and error budgets using the ELK stack and Azure Monitor
- Lead post-incident reviews and drive continuous improvement to turn outages into automation opportunities
- Participate in a rotational on-call schedule to maintain high-availability for global customers
Requirements
- 7+ years of experience in a dedicated SRE or DevOps role managing high-traffic SaaS environments
- Hands-on experience implementing AI agents using LLMs, Azure OpenAI, or LangChain
- Deep architectural knowledge of Azure (AKS, Blob Storage, Redis Cache, Azure Monitor, and Azure Automation)
- Proficiency in Python and TypeScript for writing clean, testable automation code
- Expert-level command of Terraform, Kubernetes, Helm, Docker, and GitHub Actions
- Deep experience with OpenTelemetry and the ELK stack for distributed tracing and observability
- Strong understanding of Linux internals and networking protocols
Benefits
- Comprehensive range of health, wellness, and financial benefits
- Opportunities to work on cutting-edge AI-driven reliability engineering
- A collaborative environment focused on scaling next-generation cloud solutions
About the Company
Synopsys is the leader in engineering solutions from silicon to systems, enabling customers to rapidly innovate AI-powered products. We deliver industry-leading silicon design, IP, simulation, and analysis solutions to power innovation across a wide range of industries.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSenior Staff Site Reliability Engineer
Synopsys · Bengaluru
