Head of AI Infrastructure - Neocloud
Location – Remote | Open to qualified candidates working remotely anywhere in the United States
Our confidential partner is an early-stage hyperscale innovator on a mission to build a next-generation “neocloud” optimized for large-scale AI workloads. Backed by prominent investors and already powering live customer pilots, they are assembling a world-class leadership team that will define how massive GPU fleets are designed, deployed, and operated. As Head of AI Infrastructure, you will own the technical roadmap for a truly global, security-first GPU cloud platform—shaping everything from hardware topology to customer onboarding. If you thrive on blank-sheet architecture, love solving problems at multi-petaflop scale, and want equity in a company poised to challenge the status quo, this role is for you.
Key Responsibilities
- Create and evolve an elastic GPU cloud fabric capable of scaling from hundreds to many thousands of accelerators while maintaining low-latency performance for training and inference.
- Define compute, storage, and high-speed network blueprints that meet rigorous security and compliance requirements across multiple geographies.
- Own Kubernetes-based scheduling, multi-tenant isolation, and capacity-planning strategies for a global fleet.
- Guide enterprise customers through architecture reviews, proof-of-concept deployments, and production cut-overs.
- Evaluate and negotiate with hardware, colocation, and network vendors to balance performance, cost, and supply-chain resiliency.
- Build, mentor, and inspire a distributed team of infrastructure architects and site-reliability engineers.
- Serve as the senior technical voice in executive discussions, investor updates, and strategic partnerships.
What You Bring
- 10 + years of progressive experience in cloud infrastructure, platform engineering, or systems architecture with demonstrable success operating large GPU clusters.
- Expertise in PCIe or NVLink topologies, high-performance networking (InfiniBand/RoCE), and distributed storage for AI workloads.
- Deep production experience with Kubernetes (or similar schedulers) for GPU environments, including GPU virtualization and multi-tenant isolation.
- Proven track record leading customer-facing solution architecture or technical sales engagements.
- Comfort working in fast-moving, venture-backed environments where you set the roadmap and the pace.
- Bonus points for hands-on exposure to liquid cooling, hybrid or multi-cloud deployments, and large-scale model training frameworks.
Why You’ll Love It Here
- Immediate impact – You will be the first senior leader dedicated to infrastructure and will directly unlock revenue-generating customer deals.
- Equity with upside – Join before hyper-growth and share in the value you create.
- Cutting-edge tech stack – Design from a clean slate using the newest GPUs, fabrics, and sustainability-minded data-center solutions.
- Global vision, remote flexibility – Collaborate with distributed teams and travel to high-growth AI regions (APAC, NA) as needed while enjoying a remote-first culture.
- Backed to win – Supported by seasoned founders and significant capital commitments, giving you the resources to move fast and think big.
Compensation & Benefits
A highly competitive base salary, performance bonus, and meaningful equity grant are offered, commensurate with experience. A comprehensive benefits package—including medical, dental, vision, 401(k), unlimited PTO, and home-office stipend—is provided.
About Blue Signal:
Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS