Head of Infrastructure
Location: Hybrid – San Jose, CA
An emerging player in the AI infrastructure space is transforming how the world accesses and deploys compute at scale. With a stealth-mode launch and powerful backing, this innovative startup is pioneering GPU-as-a-Service solutions to support AI workloads worldwide. They’re searching for a dynamic and experienced infrastructure leader to guide the physical buildout of their next-generation GPU clusters.
This high-impact position offers the rare opportunity to shape infrastructure strategy at a formative stage and help lay the physical foundation for some of the world’s most advanced machine learning systems. If you’re looking to join a trailblazing company during a period of exponential growth, this is your opportunity to make a defining career move.
Key Responsibilities
- Drive the full lifecycle of physical infrastructure projects, from site selection and vendor negotiation to data center integration and GPU system layout.
- Architect modular systems and standards to accommodate explosive growth across global regions and availability zones.
- Serve as the key liaison between infrastructure, engineering, operations, and hardware teams to ensure seamless collaboration and platform interoperability.
- Build and manage relationships with colocation providers, hardware OEMs, and critical service vendors to enable cost-effective, high-performance deployments.
- Establish systems for monitoring, capacity forecasting, disaster recovery, and reliability that match hyperscale quality expectations.
- Build and lead a world-class team of engineers and deployment specialists as the infrastructure function scales with the business.
Qualifications
- 10+ years of experience with large-scale data center and compute infrastructure, preferably in AI/ML, HPC, or hyperscale environments.
- Deep technical background in GPU clusters, high-density computing, power and thermal management, and network design.
- Proven success launching major physical infrastructure initiatives from scratch, ideally in startup or high-growth scenarios.
- Expertise with NVIDIA GPUs (e.g., H100, A100) and associated technologies (InfiniBand, NVLink, immersion cooling, smart PDUs).
- Strong leadership and vendor management skills across hardware, colocation, and systems integration domains.
- Business-minded approach to cost/performance tradeoffs, with a track record of delivering high-performance infrastructure under aggressive timelines.
- Comfortable with ambiguity, rapid scaling, and fast-moving environments where infrastructure defines company success.
What’s In It for You
- Be part of an elite, mission-driven team building infrastructure from the ground up.
- Play a foundational role in the development of one of the largest AI-focused GPU deployments globally.
- Influence architecture and operational strategy in a category-defining company.
- Receive competitive compensation and meaningful equity with significant upside.
- Thrive in a culture of technical excellence, agility, and innovation.
About Blue Signal:
Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS