Network Engineer (HPC/RDMA)

Apply Now

Company: TensorWave

Location: Las Vegas, NV 89110

Description:

At Tensorwave, we're leading the charge in AI compute, building a versatile cloud platform that's driving the next generation of AI innovation. We're focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what's possible in the AI landscape.

Job Description

We are looking for a HPC/RDMA Engineer with a passion for AI and advanced networking technologies. The ideal candidate will support our vision by developing and managing a networking infrastructure that underpins our innovative AI cloud services. This role involves exploring and integrating new types of network fabrics to enhance our platform's performance and scalability, ensuring optimal operation for our clients' AI projects.

Responsibilities
    • Collaborate with a dynamic IT team to design and implement innovative networking solutions that meet the demands of high-performance AI workloads.
    • Lead initiatives to explore and integrate new types of network fabrics, enhancing the scalability and efficiency of our AI infrastructure
    • Ensure network reliability, performance, and security for cloud services, optimizing for both AMD and NVIDIA GPU technologies.
    • Work closely with the AI development team to align networking strategies with the overall goals of TensorWave's cloud platform.
    • Troubleshoot and resolve complex networking issues, providing expert guidance and solutions to maintain high service levels.


Essential Skills & Qualifications
    • Bachelor's degree in Computer Science, Information Technology, or related field.
    • At least 5 years of relevant experience in network engineering, with a focus on supporting high-performance computing (HPC) and AI applications.
    • Strong knowledge of BGP, Ethernet protocols, RoCEv2, and network security practices.
    • Experience with or keen interest in exploring new network fabrics and technologies, particularly in the context of AI and cloud computing.
    • Familiarity with AMD and NVIDIA GPU ecosystems and their impact on network performance and configuration.
    • Exceptional problem-solving abilities and a commitment to innovation in networking for AI applications.


We're looking for resilient, adaptable people to join our team-folks who enjoy collaborating and tackling tough challenges. We're all about offering real opportunities for growth, letting you dive into complex problems and make a meaningful impact through creative solutions. If you're a driven contributor, we encourage you to explore opportunities to make an impact at Tensorwave. Join us as we redefine the possibilities of intelligent computing.

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

Stock Options

100% paid Medical, Dental, and Vision insurance

Life and Voluntary Supplemental Insurance

Short Term Disability Insurance

Flexible Spending Account

401(k)

Flexible PTO

Paid Holidays

Parental Leave

Mental Health Benefits through Spring Health

Similar Jobs