
Posted a day ago
Senior Solutions Architect, Infiniband and Networking Ethernet
NVIDIA CorporationSenior Solutions Architect, Infiniband and Networking Ethernet - NVIS
Requirements
BS/MS/PhD in Computer Science or related field, 5+ years professional networking experience, Proficiency in LAN and InfiniBand networks, Advanced knowledge of EVPN, BGP, OSPF, VXLAN, Experience with Cumulus Linux, SONiC, IOS, JunosOS, or EOS, Automated network provisioning with Ansible, Salt, or Python, CI/CD pipeline development for network operations
Skills
InfiniBandEthernetPythonAnsibleBGPTCP/IP
About the role
Responsibilities
- Build AI/HPC infrastructure for new and existing customers.
- Support operational and reliability aspects of large-scale AI clusters, focusing on performance, real-time monitoring, logging, and alerting.
- Manage the full service lifecycle from inception and design through deployment, operation, and refinement.
- Maintain live services by measuring and monitoring availability, latency, and overall system health.
- Provide technical feedback to internal teams, including documenting workarounds and suggesting product improvements.
- Act as the technical face to customers, partners, and internal teams to implement large-scale networking projects.
Requirements
- BS/MS/PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or a related field.
- 5+ years of professional experience in networking fundamentals, TCP/IP stack, and data center architecture.
- Proficiency in configuring, testing, and resolving issues in LAN and InfiniBand networks within HPC/AI environments.
- Advanced knowledge of EVPN, BGP, OSPF, and VXLAN protocols.
- Hands-on experience with network platforms such as Cumulus Linux, SONiC, IOS, JunosOS, or EOS.
- Extensive experience with automated network provisioning using Ansible, Salt, or Python.
- Ability to develop CI/CD pipelines for network operations.
- Strong English communication skills (written, verbal, and listening).
Preferred Qualifications
- Familiarity with cloud networking environments (AWS, GCP, Azure).
- Linux or Networking industry certifications.
- Experience with High-Performance Computing (HPC) architectures and job schedulers like Slurm or PBS.
- Knowledge of Lustre management technologies or Base Command Manager (BCM).
- Experience with GPU-focused hardware and software.
About the Company
NVIDIA is a global leader in AI computing. Our NVIDIA Infrastructure Specialist Team builds some of the largest and fastest AI/HPC systems in the world, helping academic and commercial groups revolutionize deep learning, data analytics, and data center operations.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSenior Solutions Architect, Infiniband and Networking Ethernet
NVIDIA Corporation · Singapore
