Sr. Hardware Ops Specialist – Data Center GPU (Level 3)

OtherData Center
0 views0 saves0 applied

Quick Summary

Overview

Sr. Hardware Ops Specialist – Data Center GPU (Level 3) On-site | Buffalo, NY • Houston, TX • Aberdeen, TX This role is 100% on-site. You will be embedded in the data-center floor, participating in an on-call rotation to meet aggressive SLAs.

Requirements Summary

Exposure to emerging platforms (Grace Blackwell, MI300, Gaudi 3). Technical diploma or certifications in electrical, mechanical, or IT disciplines. Vendor-management or project-coordination experience in a hyperscale build-out.

Technical Tools
linux

This role is 100% on-site. You will be embedded in the data-center floor, participating in an on-call rotation to meet aggressive SLAs. Expect occasional travel between campuses for special projects or new-site launches.

Responsibilities

~1 min read
  • Own the rack from rail to NIC. Rack, cable, power-on, and burn-in GPU servers, network switches, and storage nodes, logging every asset change in the CMDB.
  • Be the first-responder. Triage and resolve hardware or Layer 1/2 network incidents, escalating to remote engineering SMEs only for code-level fixes.
  • Swap anything. Replace DIMMs, GPUs, SSDs, PSUs, fans, and NICs—even if you have never seen the exact failure mode before—and validate fixes with diagnostic tools.
  • Maintain uptime. Execute structured change windows, follow ESD and OSHA safety practices, and document each action for audit and compliance.
  • Prevent before it breaks. Run capacity checks, preventive maintenance, and inventory audits, ensuring zero surprise outages.
  • Rotate and collaborate. Work a 24 × 7 shift rotation with on-call, coordinating with facilities, network, and vendor partners on expansions and retrofits.

Requirements

~1 min read
  • 3+ years in data-center or large-scale lab operations with direct GPU-server troubleshooting (BIOS, BMC/IPMI, PXE, firmware flashing, etc.).
  • Hands-on experience with NVIDIA accelerators (H100, B200, A100, or similar).
  • Solid Linux CLI and basic scripting to automate diagnostics or asset updates.
  • Working knowledge of copper and fiber cabling, switches, and optics
  • Proven ability to resolve unfamiliar hardware faults independently.
  • Ability to lift 50 lbs, climb ladders, and work safely in hot/cold aisles.

Nice to Have

~1 min read
  • Exposure to emerging platforms (Grace Blackwell, MI300, Gaudi 3).
  • Technical diploma or certifications in electrical, mechanical, or IT disciplines.
  • Vendor-management or project-coordination experience in a hyperscale build-out.

What We Offer

~1 min read
Competitive base salary (DOE & location).
Equity participation in a fast-growth AI-infrastructure company.
Comprehensive medical, dental, and vision coverage.
Retirement plan with company match
Generous PTO plus paid holidays aligned with local norms.
Professional development budget and clear technical-leadership career path.

Blue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS 


Location & Eligibility

Where is the job
Buffalo, United States
On-site at the office
Who can apply
US

Listing Details

First seen
May 6, 2026
Last seen
May 9, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
42%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

Blue-Signal-SearchSr. Hardware Ops Specialist – Data Center GPU (Level 3)