
Posted 4 hours ago
Principal AI Site Reliability Engineer
OraclePrincipal AI Site Reliability Engineer
Requirements
7+ years software engineering or SRE experience, Experience with distributed systems, Proficiency in Python, Java, or Go, Experience with CI/CD and Infrastructure as Code, U.S. citizenship required, Ability to obtain security clearance
Skills
KubernetesTerraformPythonDockerPrometheusGrafanaOCIAWS
About the role
Responsibilities
- Design, build, and operate highly reliable, scalable, and secure infrastructure for the Oracle Health Patient Portal.
- Advance automation, observability, and AI-assisted reliability practices to support cloud operations.
- Improve system reliability through performance optimization, monitoring, and automated remediation.
- Partner with development teams to enhance service architecture, scalability, and operability.
- Participate in on-call rotations and perform root cause analysis for complex production issues.
- Drive continuous improvement in DevOps/SRE practices, including CI/CD and Infrastructure as Code.
Requirements
- 7+ years of software engineering, cloud infrastructure, SRE, or DevOps experience.
- Proven experience with distributed systems, performance monitoring, and resiliency patterns.
- Proficiency in Python, Java, or Go.
- Experience with CI/CD pipelines (Jenkins, Kubernetes) and Infrastructure as Code (Terraform).
- Experience with observability tools such as Prometheus and Grafana.
- U.S. citizenship is required.
- Ability to obtain and maintain a U.S. government security clearance.
Preferred Qualifications
- Experience in healthcare or regulated environments (HIPAA, compliance frameworks).
- Experience building self-healing or autonomous infrastructure systems.
- Experience working in environments requiring security clearance.
Benefits
- Medical, dental, and vision insurance.
- 401(k) Savings and Investment Plan with company match.
- Flexible Vacation and paid time off.
- Paid parental leave and adoption assistance.
- Employee Stock Purchase Plan.
About the Company
Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. With AI embedded across our products and services, we help customers turn that promise into a better future for all.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freePrincipal AI Site Reliability Engineer
Oracle · United States
