Job Summary
We are seeking a dependable and detail-oriented Infrastructure Administrator to oversee the day-to-day operational health of our on-premises IT infrastructure. This includes physical servers, enterprise storage systems, Fibre Channel switching, and data backup environments. The successful candidate will play a critical role in maintaining system uptime, executing backup and recovery operations, and proactively monitoring and remediating infrastructure issues before they impact business operations. This position requires hands-on experience with physical server and storage hardware, as well as familiarity with enterprise backup technologies such as Commvault. The Infrastructure Administrator will be responsible for supporting patching and firmware updates, managing hardware lifecycles, maintaining asset inventories, and ensuring the stability and availability of production systems.
Job Location
Remote: Legal residents of one of the following states: AK, AL, AR, AZ, CT, DE, FL, GA, IA, ID, IN, KS, KY, LA, MD, ME, MI, MN, MO, MS, NC, ND, NH, NM, NV, OH, OK, PA, SC, SD, TN, TX, UT, VA, VT, WI, WV, or WY
We only accept W-2 candidates, H-1B sponsorship is not available.
Responsibilities
- Proactively administer and maintain physical server and enterprise storage infrastructure across on-premises and co-located data centers.
- Manage and maintain Commvault backup environments, including job scheduling, policy enforcement, and disaster recovery readiness.
- Oversee Fibre Channel switch infrastructure, including zoning, switch firmware, port usage, and error monitoring.
- Perform routine infrastructure maintenance including patching, firmware upgrades, and health checks on servers, storage arrays, and switches.
- Monitor system performance and storage capacity to detect and resolve issues before they impact business operations.
- Support Windows and Linux environments, ensuring timely patching, performance tuning, and secure configuration management.
- Maintain detailed infrastructure documentation including configuration baselines, SOPs, runbooks, diagrams, and asset inventories.
- Collaborate with Infrastructure Engineering on system upgrades, refresh projects, migrations, and capacity expansion planning.
- Participate in incident response, root cause analysis (RCA), and continual service improvement activities.
- Support after-hours maintenance and participate in the on-call rotation as required.
Physical Requirements
- Work is performed while sitting/standing and interfacing with a personal computer.
- Requires the ability to communicate effectively using speech, vision, and hearing.
- Requires the regular use of hands for simple grasping and fine manipulations.
- Requires occasional bending, squatting, crawling, climbing, and reaching.
- Requires the ability to occasionally lift, carry, push, or pull medium weights, up to 50lbs.
Qualifications
Experience
- Proven ability to support on-premises data center infrastructure, including both physical and virtual systems
- Strong experience maintaining enterprise server hardware, including firmware/BIOS updates, diagnostics, and hardware replacement
- Practical involvement in disaster recovery planning and execution, including backup verification, restore testing, and site failover exercises
- Proficient in monitoring infrastructure health, identifying performance issues, and assisting with root cause analysis using observability and monitoring tools
- Familiarity with full infrastructure lifecycle tasks such as hardware provisioning, system upgrades, and decommissioning of end-of-life equipment
- Contributed to infrastructure initiatives such as system migrations, hardware refresh projects, and deployment of new physical and virtual infrastructure
Education
This role does not require a degree. We value relevant skills and experience and alignment with our core values above all else.
Desired Traits & Skills
Datacenter Management
- Physical server installation, structured cabling, and racking best practices
- Environmental monitoring (temp, humidity, airflow) and capacity planning (power/cooling)
- Hands-on use of OOB tools (iDRAC, iLO) for remote diagnostics
- Rack elevation planning, vendor coordination, and asset lifecycle documentation
Server Hardware Management
- Dell PowerEdge and modular infrastructure
- RAID configuration, BIOS/firmware updates, and hardware diagnostics
- Predictive health monitoring and preventative hardware maintenance
Storage, Backup & Disaster Recovery
- SAN/NAS systems including Dell EMC, NetApp, Dell ECS
- Fibre Channel zoning, fabric management, and switch administration
- Commvault configuration, backup validation, retention policies, and DR testing
- Snapshot, replication, deduplication, and archival strategies
Operating Systems & Virtualization
- Windows Server (2016–2025) and Linux (RHEL, Ubuntu, Rocky Linux)
- VMware vSphere, ESXi, vCenter operations and health monitoring
Monitoring & Observability
- Zabbix, Grafana, Prometheus, Splunk
- Custom dashboards, alerting, and system telemetry
- RCA participation and performance tuning
Automation & Scripting
- PowerShell, Bash, Python scripting
- Experience with Ansible, Terraform, or other IaC tools
- Git version control and automated provisioning
Tooling & Documentation
- CMDB systems and ITSM platforms (e.g., HaloITSM, ServiceNow, Jira)
- Confluence, SharePoint, Markdown for SOPs and documentation
- Change management, asset tracking, incident workflows aligned with ITIL