Sr. Hardware Ops Specialist – Data Center GPU (Level 3)
Quick Summary
Sr. Hardware Ops Specialist – Data Center GPU (Level 3) On-site | Buffalo, NY • Houston, TX • Aberdeen, TX This role is 100% on-site. You will be embedded in the data-center floor, participating in an on-call rotation to meet aggressive SLAs.
Exposure to emerging platforms (Grace Blackwell, MI300, Gaudi 3). Technical diploma or certifications in electrical, mechanical, or IT disciplines. Vendor-management or project-coordination experience in a hyperscale build-out.
This role is 100% on-site. You will be embedded in the data-center floor, participating in an on-call rotation to meet aggressive SLAs. Expect occasional travel between campuses for special projects or new-site launches.
Responsibilities
~1 min read- → Own the rack from rail to NIC. Rack, cable, power-on, and burn-in GPU servers, network switches, and storage nodes, logging every asset change in the CMDB.
- → Be the first-responder. Triage and resolve hardware or Layer 1/2 network incidents, escalating to remote engineering SMEs only for code-level fixes.
- → Swap anything. Replace DIMMs, GPUs, SSDs, PSUs, fans, and NICs—even if you have never seen the exact failure mode before—and validate fixes with diagnostic tools.
- → Maintain uptime. Execute structured change windows, follow ESD and OSHA safety practices, and document each action for audit and compliance.
- → Prevent before it breaks. Run capacity checks, preventive maintenance, and inventory audits, ensuring zero surprise outages.
- → Rotate and collaborate. Work a 24 × 7 shift rotation with on-call, coordinating with facilities, network, and vendor partners on expansions and retrofits.
Requirements
~1 min read- 3+ years in data-center or large-scale lab operations with direct GPU-server troubleshooting (BIOS, BMC/IPMI, PXE, firmware flashing, etc.).
- Hands-on experience with NVIDIA accelerators (H100, B200, A100, or similar).
- Solid Linux CLI and basic scripting to automate diagnostics or asset updates.
- Working knowledge of copper and fiber cabling, switches, and optics
- Proven ability to resolve unfamiliar hardware faults independently.
- Ability to lift 50 lbs, climb ladders, and work safely in hot/cold aisles.
Nice to Have
~1 min read- Exposure to emerging platforms (Grace Blackwell, MI300, Gaudi 3).
- Technical diploma or certifications in electrical, mechanical, or IT disciplines.
- Vendor-management or project-coordination experience in a hyperscale build-out.
What We Offer
~1 min readBlue Signal is an award-winning, executive search firm specializing in various specialties. Our recruiters have a proven track record of placing top-tier talent across industry verticals, with deep expertise in numerous professional services. Learn more at bit.ly/46Gs4yS
Location & Eligibility
Listing Details
- First seen
- May 6, 2026
- Last seen
- May 9, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 42%
- Scored at
- May 6, 2026
Signal breakdown
Please let Blue-Signal-Search know you found this job on Jobera.
4 other jobs at Blue-Signal-Search
View all →Explore open roles at Blue-Signal-Search.
Similar Data Center jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.