
Posted a day ago
IT SRE Team Lead
Cerebras SystemsIT SRE Team Lead
Requirements
8+ years in SRE, DevOps, or IT engineering, 2+ years in leadership, Experience with AI coding tools and AI agents, Proficiency in Python or Go, Experience with Okta, Entra, Jamf, or Intune, Hands-on with Terraform and CI/CD
Skills
PythonGoTerraformOKTAJamfSRE
About the role
Responsibilities
- Define and own the reliability strategy for internal IT systems, including SLOs, error budgets, and operational health reporting.
- Build and lead a team of IT SRE engineers focused on automation, observability, and incident response for corporate systems.
- Design and implement automation to eliminate manual IT work across provisioning, access management, patching, and lifecycle operations.
- Instrument internal services and SaaS integrations with monitoring, alerting, and on-call workflows.
- Run incident response for IT outages, including root cause analysis and durable remediation.
- Drive infrastructure-as-code and GitOps practices across IT-owned systems.
- Partner with security and networking teams on identity, access, and network reliability.
Requirements
- Minimum 8 years of experience in SRE, DevOps, or IT engineering roles, with at least 2 years in a leadership capacity.
- Direct hands-on experience with AI coding tools, building and deploying AI agents for triage and bug fixes.
- Strong software engineering background with hands-on experience in Python, Go, or similar.
- Deep experience with identity platforms (Okta, Entra), endpoint management (Jamf, Intune), and SaaS integration patterns.
- Hands-on experience with infrastructure-as-code tools (Terraform) and CI/CD pipelines applied to IT systems.
- Proven track record of running on-call rotations and driving operational maturity in fast-moving environments.
About the Company
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, empowering machine learning users to effortlessly run large-scale ML applications. Our technology is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via agentic computation.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeIT SRE Team Lead
Cerebras Systems · Sunnyvale
