Share this job
Network Automation Lead
USA
Apply for this job

Network Automation Lead

Location: Nationwide/Remote, Bay Area Candidates Preferred


Join a stealth-mode trail-blazer that is building hyperscale AI compute platforms for some of the most demanding machine-learning workloads on the planet. Backed by multi-billion-dollar funding and partnered closely with top silicon and cloud innovators, this company is racing to deliver next-generation GPU fabrics that will power the future of artificial intelligence. As the Network Automation Lead, you will own the code, pipelines, and observability stack that turn thousands of ports and petabits of bandwidth into a self-driving, lights-out network. Your work will directly accelerate time-to-model for cutting-edge AI research, giving you a front-row seat to breakthroughs that will define the next decade.


What You’ll Tackle

  • Design, build, and maintain end-to-end automation pipelines for provisioning, validating, and operating multi-terabit data-center fabrics exceeding 1,000 devices.
  • Develop CI/CD workflows (e.g., GitLab CI, Jenkins, Argo CD) that enable safe, rapid, and repeatable network changes.
  • Implement zero-touch onboarding and “golden state” configuration management at scale, eliminating manual intervention during device turn-up.
  • Stand up streaming-telemetry and observability frameworks leveraging gNMI, gRPC, OpenConfig, Prometheus, and Grafana to provide real-time insight into network health.
  • Create controls to detect configuration drift, validate intent, and automate rollback or remediation.
  • Partner with architecture, platform, and data-center teams to ensure fault-tolerant operation across geographically distributed clusters.
  • Define internal standards, document best practices, and mentor engineers on network-as-code principles and DevOps culture.


Ideal Profile

  • 10+ years in large-scale network engineering or automation, including hands-on ownership of deployments with 1,000+ nodes.
  • Expert-level scripting in Python, Go, or similar, plus deep facility with tools such as Nornir, Ansible, Terraform, or SaltStack.
  • Proven experience designing CI/CD pipelines for infrastructure, and familiarity with container-based workflows (Docker/K8s).
  • Strong understanding of modern DC protocols and architectures (BGP EVPN, VXLAN, MPLS, RoCEv2, or InfiniBand).
  • Comfortable building and scaling telemetry systems that feed time-series databases and dashboarding platforms.
  • Passion for automation, reliability engineering, and high-velocity startup environments.
  • US work authorization required; no current visa sponsorship.


Why You’ll Love It

  • Massive Impact: Your automation will light up clusters housing tens-of-thousands of GPUs that train frontier AI models.
  • Top-Tier Rewards: Competitive seven-figure total-comp potential including base, sign-on, performance bonus, and significant equity.
  • Remote First: Work from anywhere in the United States, with occasional visits to flagship data-center sites. Bay Area residents receive priority consideration for on-site collaboration.
  • Velocity & Ownership: Flat org, zero bureaucracy, direct line to executive leadership, and the freedom to build the systems you’ve always wanted.
  • Mission & Culture: Join builders who thrive on solving scale, speed, and reliability challenges at the bleeding edge of AI infrastructure.


About Blue Signal:  

Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS 



Apply for this job