Staff Infrastructure Engineer at TensorWave - ScoutJobs - The AI-curated global job board
Skip to content
TensorWave
Posted 24 days ago

Staff Infrastructure Engineer

TensorWaveStaff Infrastructure Engineer – Kubernetes Platform

Perks & benefits

Health InsuranceMedical Insurance

Requirements

7+ years infrastructure or platform engineering experience, Deep experience operating Kubernetes at scale, Strong understanding of Kubernetes internals (API server, etcd, scheduler), Expertise in Linux systems and networking stacks, Experience with CNI plugins, Experience with multi-tenant cluster models

Skills

KubernetesLinux

About the role

Responsibilities

  • Design and evolve Kubernetes control plane architecture across multiple regions
  • Define and implement multi-tenant cluster models, including shared control planes and virtual cluster approaches
  • Own the reliability and operational behavior of Kubernetes platforms in production
  • Diagnose and resolve control plane instability, API server saturation, and scheduling issues
  • Design ingress/egress architectures and optimize pod-to-pod networking and CNI behavior
  • Drive the transition from standalone clusters to regionally managed platform models
  • Collaborate with DevOps and Infrastructure teams to align platform design with compute and networking capabilities

Requirements

  • 7+ years of experience in infrastructure, platform engineering, or distributed systems
  • Deep experience operating Kubernetes at scale in production environments
  • Strong understanding of Kubernetes internals, including API server, etcd, and scheduler
  • Expertise in Linux systems and the networking stack
  • Experience with CNI plugins (Cilium preferred)
  • Proven experience with multi-tenant cluster models and resource isolation

Preferred Qualifications

  • Experience with virtual cluster technologies such as vcluster or Kamaji
  • Experience supporting GPU workloads in Kubernetes
  • Familiarity with NUMA-aware scheduling and topology-aware workloads
  • Awareness of RDMA and high-throughput networking environments
  • Experience with observability platforms like Prometheus and Grafana

Benefits

  • Stock Options
  • 100% paid Medical, Dental, and Vision insurance
  • 401(k) and Flexible Spending Account
  • Flexible PTO and Paid Holidays
  • Parental Leave
  • Company Health Savings Account contributions

About the Company

TensorWave delivers seamless, secure, reliable, and resilient AI compute at scale. We have built a versatile cloud platform that eliminates infrastructure barriers, empowering builders to focus on innovation instead of fighting their stack.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

Staff Infrastructure Engineer

TensorWave · Las Vegas

Sign up to apply