
Posted 6 hours ago
Senior Solutions Architect, Infiniband and Networking Ethernet
NVIDIA Corporation
Requirements
BS/MS/PhD in CS, EE, Physics, or Math, 5+ years networking experience, Proficiency in LAN and InfiniBand, Knowledge of EVPN, BGP, OSPF, VXLAN, Experience with Cumulus Linux, SONiC, IOS, JunosOS, or EOS, Automation with Ansible, Salt, or Python, CI/CD pipeline development, Willingness to travel ~30%
Skills
InfiniBandEthernetPythonAnsibleBGPTCP/IPLinux
About the role
Responsibilities
- Build AI/HPC infrastructure for new and existing customers.
- Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, real-time monitoring, logging, and alerting.
- Engage in the full service lifecycle, from inception and design through deployment, operation, and refinement.
- Maintain services by measuring and monitoring availability, latency, and overall system health.
- Provide technical feedback to internal teams, including documenting workarounds and suggesting product improvements.
Requirements
- BS/MS/PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or a related field.
- 5+ years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture.
- Proficiency in configuring, testing, and resolving issues in LAN and InfiniBand networks within HPC/AI environments.
- Advanced knowledge of EVPN, BGP, OSPF, and VXLAN protocols.
- Hands-on experience with network platforms such as Cumulus Linux, SONiC, IOS, JunosOS, or EOS.
- Extensive experience with automated network provisioning using Ansible, Salt, or Python.
- Ability to develop CI/CD pipelines for network operations.
- Willingness to travel approximately 30%.
Preferred Qualifications
- Familiarity with cloud networks (AWS, GCP, Azure).
- Linux or Networking certifications.
- Experience with High-performance computing architectures and job schedulers like Slurm or PBS.
- Knowledge of Lustre management technologies (e.g., Base Command Manager).
- Experience with GPU-focused hardware and software.
About the Company
NVIDIA is a global leader in AI computing. Our Infrastructure Specialist Team builds and supports some of the largest and fastest AI/HPC systems in the world, helping academic and commercial groups revolutionize deep learning and data analytics.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSenior Solutions Architect, Infiniband and Networking Ethernet
NVIDIA Corporation · Hsinchu
