
Posted 18 days ago
AI SW Stack Deployment Architect
SandiskAI SW Stack Deployment Architect
Requirements
10+ years in AI/ML systems or software architecture, Strong experience with PyTorch, Transformers, and LLMs, Hands-on experience with LLM deployment and scalable inference engines, Expertise in system design, APIs, and cross-layer integration, Experience building scalable AI platforms
Skills
PyTorchTensorFlowLLMAI/ML
About the role
Responsibilities
- Architect the integration of vLLM, PyTorch, TensorFlow, and JAX/XLA into the Next Generation Accelerator stack
- Define framework, compiler, runtime APIs, and technical contracts
- Own LLM execution behavior, including batching, KV cache, and streaming inference
- Design and implement end-to-end deployment workflows for packaging, versioning, and reproducibility
- Drive performance optimization across the model, framework, and runtime layers
- Collaborate cross-functionally with compiler, runtime, and low-level software teams
- Support customer workloads, model onboarding, and debugging processes
Requirements
- 10+ years of experience in AI/ML systems or software architecture
- Strong experience with PyTorch, Transformers, and LLMs
- Hands-on experience with LLM deployment and scalable inference engine systems (e.g., vLLM, Triton, SGLang)
- Proven experience building scalable AI platforms for cloud or edge environments
- Expertise in system design, APIs, and cross-layer integration
Preferred Qualifications
- Experience with vLLM or similar LLM serving systems
- Familiarity with XLA, MLIR, or other compiler frameworks
- Exposure to AI accelerators (GPU/NPU) and runtime systems
- Experience working with distributed or multi-agent AI systems
About the Company
Sandisk relentlessly innovates to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions serve as the beating heart of the digital world. We combine powerhouse manufacturing capabilities with an industry-leading portfolio of products recognized globally for innovation, performance, and quality.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeAI SW Stack Deployment Architect
Sandisk · Bangalore
