Share this job
Head of Infrastructure - Neocloud
Apply for this job

Head of Infrastructure - Neocloud

Remote, Nationwide


A rapidly scaling infrastructure innovator is seeking a proven leader to drive the next generation of high-performance compute environments. As Head of Infrastructure for this stealth-mode neocloud pioneer, you’ll be responsible for delivering world-class AI and HPC infrastructure that supports some of the most demanding workloads on the planet. This is your opportunity to shape large-scale, GPU-based systems from the ground up, while steering the roadmap for emerging compute technologies in a mission-driven and fast-paced environment.


In this influential role, you'll lead an elite team of systems, networking, and storage engineers, all united around the goal of unlocking intelligence at scale. The work is dynamic, hands-on, and high-impact—ideal for someone who thrives at the intersection of technical precision, operational execution, and executive-level influence.


What You’ll Tackle:

  • Drive end-to-end design, deployment, and operations of global-scale, bare-metal infrastructure spanning compute, storage, and networking layers.
  • Lead a team of senior infrastructure engineers and architects with a relentless focus on scalability, performance, and resiliency.
  • Architect GPU-dense clusters using state-of-the-art accelerators (NVIDIA, AMD, etc.), optimizing for rack design, fabric topology, and power/cooling considerations.
  • Align hardware roadmaps with sourcing and procurement strategies, ensuring just-in-time readiness for rapidly scaling workloads.
  • Partner with data center operations teams for seamless site launches, covering everything from physical layout to ICT and cluster readiness.
  • Define infrastructure standards to support containerized, managed environments like Kubernetes and SLURM for AI/ML workloads.
  • Build automation and lifecycle tooling to accelerate deployment velocity and maintain systems at hyperscale standards.
  • Represent infrastructure strategy in cross-functional discussions across software, security, finance, and business teams.
  • Stay engaged at the ground level—personally joining bring-ups, participating in architecture reviews, and resolving complex technical challenges.
  • Travel occasionally to data centers, OEM partners, and industry events to maintain deep market and vendor insight.

 

What Sets You Apart:

  • 10+ years of hands-on infrastructure engineering experience, including 3+ years in senior technical leadership over systems, network, or hardware teams.
  • Proven track record leading deployment of 10,000+ GPU clusters—deep understanding of timelines, hardware nuances, and execution at scale.
  • Expertise in high-performance networking (InfiniBand, RoCEv2, BGP, ECMP) for latency-sensitive, large-scale compute environments.
  • Strong expertise in server architecture, including interconnect frameworks like PCIe and NVLink, baseboard management controllers, and low-level firmware, along with experience in high-performance storage platforms such as WekaFS, VAST Data, and DDN.
  • Familiar with datacenter-specific infrastructure such as direct liquid cooling, high-density power solutions, and environmental constraints for GPU-heavy racks.
  • Strong collaboration skills and experience working with procurement, finance, and security on infrastructure delivery.
  • Outstanding ability to communicate effectively, simplifying complex technical concepts for audiences without a technical background, including clients and business stakeholders.

 

Preferred Experience:

  • Prior experience leading teams within a neocloud environment, hyperscale organization, or GPU and compute hardware manufacturer.
  • Working knowledge of platforms such as NetBox, MAAS, or Redfish, along with experience streamlining firmware updates, system imaging, and deployment processes through automation.
  • Understanding of how AI and machine learning frameworks impact system performance, including experience with technologies such as Kubernetes, SLURM, JAX, and PyTorch.
  • Experience leading infrastructure efforts in both customer-facing and internal engineering settings.

 

Why You Should Join:

  • Compensation package includes a competitive base salary, performance bonus, and equity with high upside.
  • Comprehensive health, vision, and dental benefits.
  • Competitive retirement program and ample paid time off, comparable to leading technology organizations.
  • Join a tight-knit, mission-driven team transforming how compute is delivered for advanced intelligence systems.
  • Be part of a company where infrastructure is not just a function, but a product differentiator and core focus.


About Blue Signal:  

Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS 


Apply for this job