Share this job
DC Cluster Lead
Buffalo, NY
Apply for this job

DC Cluster Lead

Locations: Hybrid – Buffalo, NY (Open to relocation for those outside of Buffalo)

Industry: AI Infrastructure | Data Center Operations | Hyperscale Deployment

Employment Type: Full-Time


A pioneering force in next-gen compute infrastructure is seeking a DC Cluster Lead to lead critical deployment efforts of hyperscale AI-focused data centers across the U.S. This is a high-impact, hands-on leadership opportunity with a stealth-mode, high-growth company building the foundation for tomorrow’s artificial intelligence breakthroughs.


As they undergo hypergrowth and expand into new regions, the company is scaling its technical operations team to support nationwide deployments of custom AI/ML fabrics. Having secured collaborations with leading AI research groups and major companies, they are shifting their attention to carrying out large-scale, highly technical field work. The DC Cluster Lead will be at the helm of this expansion, responsible for hands-on buildouts, leading small elite teams, and crafting deployment playbooks that scale from dozens to hundreds of megawatts. Equity packages for this role are projected to significantly exceed industry benchmarks (2–4x base salary), providing rare upside potential.

 

Key Responsibilities:

  • Lead end-to-end deployment of AI-centric data center network fabrics, including front-end, back-end, BMS, and management networks.
  • Build and mentor a growing field deployment team responsible for datacenter setup, configuration, fiber coordination, validation, and remediation.
  • Create and implement thorough guides for deployment, quality checkpoints, operational procedures, and transition requirements.
  • Coordinate efforts across data center operations, ICT, network engineering, hardware, and external partners.
  • Drive field-level troubleshooting and incident response, ensuring production readiness with full logical and physical validation.
  • Own operational excellence KPIs across multiple sites, including MTTR, uptime, cost efficiency, and SLA compliance.
  • Act as a cross-functional leader and primary escalation point during critical deployment phases.

 

Qualifications:

  • 8+ years in data center engineering or field operations, with at least 3+ years of direct team leadership experience.
  • Proven expertise deploying AI/ML or hyperscale fabrics using technologies such as EVPN/VXLAN, BGP, CLOS, and high-radix switching.
  • Experience leading on-site deployments at hyperscale campuses (ideal candidates come from cloud, hyperscale, or AI infra providers).
  • Strong hands-on skills across configuration, remediation, RMA, and cable/fiber management in production environments.
  • Excellent communication and cross-functional coordination abilities.
  • Willingness to travel up to 70–80% as needed for on-site deployments across data center campuses.
  • Extreme self-sufficiency and comfort with ambiguity, startup pace, and high ownership culture.

Preferred:

  • Experience leading deployments in Tier III/IV or 50MW+ campuses.
  • Familiarity with RoCE, ECN, RDMA, PFC, and real-time validation frameworks.
  • Background at companies such as Meta, Google, Oracle, or similar-scale environments.

 

Why Apply:

  • Own a Career-Defining Buildout: Be the architect of first-of-their-kind AI data center fabrics at true hyperscale.
  • Massive Growth: Join a mission-driven, founder-led company that’s scaling from stealth to global impact.
  • High-Upside Equity: With equity targets exceeding 2–4x base comp, this role offers meaningful long-term ownership.
  • Elite Culture: High trust, high accountability, and high-impact work in an environment built for speed.


About Blue Signal:  

Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS 



Apply for this job