Magic (magic.dev) Logo

Magic (magic.dev)

HPC Networking Lead

Job Posted 15 Days Ago Posted 15 Days Ago
Be an Early Applicant
Remote
2 Locations
100K Annually
Senior level
Remote
2 Locations
100K Annually
Senior level
The HPC Networking Lead will optimize communication in distributed training by implementing algorithms, designing monitoring systems, and enhancing overall network performance.
The summary above was generated by AI

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As the HPC networking lead, you will be the lead technical contributor to an internal NCCL-like library, aiming to optimize performance for communication patterns our workloads require.

What you might work on: 

  • Implement and tune custom collective communication algorithms for specific topologies

  • Design and implement networking monitoring systems for our training clusters

  • Implement and benchmark collective communication primitives to achieve low latency and high throughput

  • Contribute to the development of debugging and profiling tools for network communication performance analysis

  • Integrate new communication techniques into the overall system architecture

What we’re looking for: 

  • Deep understanding of sharding techniques used in distributed training (pipeline/tensor/data parallelism) 

  • Experience contributing to a collective communication library such as NCCL or an MPI implementation

  • Expert understanding of RoCE/IB RDMA networks and have written distributed algorithms using RDMA

  • A track record of contributing to open-source projects related to high-performance networking

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience. 

Our culture:

  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building 

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):

  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Top Skills

Distributed Algorithms
Ib Rdma
Mpi
Nccl
Networking Monitoring Systems
Roce
HQ

Magic (magic.dev) San Francisco, California, USA Office

San Francisco, CA, United States

Similar Jobs

28 Minutes Ago
Remote
US
260K-332K Annually
Expert/Leader
260K-332K Annually
Expert/Leader
eCommerce • Food • Software
Lead a team in creating scalable caching solutions, enhance performance for backend systems, and ensure high-quality software delivery in a collaborative environment.
Top Skills: CloudwatchDatadogDynamoDBEc2ElasticacheElasticsearchGCPGoPythonRds PostgresRubyRustSentrySnowflake
28 Minutes Ago
Remote
US
225K-250K Annually
Senior level
225K-250K Annually
Senior level
eCommerce • Food • Software
Lead a technical team to design and implement cloud-based caching solutions. Collaborate with stakeholders and drive initiatives to enhance system efficiencies and performance.
Top Skills: CloudwatchDatadogDynamoDBEc2ElasticacheElasticsearchGCPGoPythonRds PostgresRubyRustSentrySnowflake
28 Minutes Ago
Easy Apply
Remote
2 Locations
Easy Apply
217K-301K Annually
Senior level
217K-301K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Lead Upstart’s Mobile Software Engineering organization, driving mobile strategy and ensuring seamless user experiences while mentoring a high-performing team and collaborating cross-functionally.
Top Skills: AndroidiOS

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account