AI SW Stack Deployment Architect at Sandisk - ScoutJobs - The AI-curated global job board
Skip to content
Sandisk
Posted 18 days ago

AI SW Stack Deployment Architect

SandiskAI SW Stack Deployment Architect

Requirements

10+ years in AI/ML systems or software architecture, Strong experience with PyTorch, Transformers, and LLMs, Hands-on experience with LLM deployment and scalable inference engines, Expertise in system design, APIs, and cross-layer integration, Experience building scalable AI platforms

Skills

PyTorchTensorFlowLLMAI/ML

About the role

Responsibilities

  • Architect the integration of vLLM, PyTorch, TensorFlow, and JAX/XLA into the Next Generation Accelerator stack
  • Define framework, compiler, runtime APIs, and technical contracts
  • Own LLM execution behavior, including batching, KV cache, and streaming inference
  • Design and implement end-to-end deployment workflows for packaging, versioning, and reproducibility
  • Drive performance optimization across the model, framework, and runtime layers
  • Collaborate cross-functionally with compiler, runtime, and low-level software teams
  • Support customer workloads, model onboarding, and debugging processes

Requirements

  • 10+ years of experience in AI/ML systems or software architecture
  • Strong experience with PyTorch, Transformers, and LLMs
  • Hands-on experience with LLM deployment and scalable inference engine systems (e.g., vLLM, Triton, SGLang)
  • Proven experience building scalable AI platforms for cloud or edge environments
  • Expertise in system design, APIs, and cross-layer integration

Preferred Qualifications

  • Experience with vLLM or similar LLM serving systems
  • Familiarity with XLA, MLIR, or other compiler frameworks
  • Exposure to AI accelerators (GPU/NPU) and runtime systems
  • Experience working with distributed or multi-agent AI systems

About the Company

Sandisk relentlessly innovates to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions serve as the beating heart of the digital world. We combine powerhouse manufacturing capabilities with an industry-leading portfolio of products recognized globally for innovation, performance, and quality.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

AI SW Stack Deployment Architect

Sandisk · Bangalore

Sign up to apply