
Posted 4 days ago
Software Engineer, Platform Infrastructure (Foundations)
AnyscaleSoftware Engineer, Platform Infrastructure (Foundations)
Requirements
Bachelor's degree in Computer Science or equivalent, 3+ years of production code experience, Experience with distributed systems, Expertise in cloud-native technologies (AWS, Azure, GCP), Kubernetes expertise, Proficiency in Go and Python, Knowledge of Linux kernel and containers
Skills
GoPythonKubernetesAWSDistributed Systems
About the role
Responsibilities
- Design, build, and scale services that orchestrate Ray clusters across cloud and on-prem environments
- Optimize control plane components for large-scale, distributed AI/ML workloads
- Build intelligent scheduling and resource management systems for heterogeneous compute clusters
- Develop features to enhance the reliability, performance, scalability, and observability of managed workloads
- Support and optimize accelerator integration, such as GPUs and TPUs
- Handle container image management and dependency resolution for distributed workloads
- Participate in code reviews, design discussions, and provide on-call support to troubleshoot infrastructure issues
Requirements
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
- 3+ years of experience writing high-quality production code
- Hands-on experience building and maintaining highly available, scalable, and performant distributed systems
- Expertise in cloud-native technologies (AWS, Azure, GCP) and Kubernetes-based deployments
- Proficiency in Go and Python
- Knowledge of low-level operating system foundations, including Linux kernel, file systems, and containers
Preferred Qualifications
- Deep understanding of networking, security, and authentication mechanisms in cloud environments
- Familiarity with observability stacks such as Prometheus and Grafana
About the Company
Anyscale is on a mission to democratize distributed computing. We are commercializing Ray, a popular open-source project that creates an ecosystem of libraries for scalable machine learning. Our platform enables developers and data scientists to scale ML applications from a laptop to a massive cluster seamlessly. We are backed by leading investors including Andreessen Horowitz, NEA, and Addition.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSoftware Engineer, Platform Infrastructure (Foundations)
Anyscale · Bengaluru
