Job Summary
We are seeking a highly skilled and reliable Infrastructure Engineer to manage and maintain our physical IT infrastructure, including servers, storage systems, and data backup solutions. This role is crucial for ensuring the performance, reliability, and integrity of our on-premises hardware and for supporting the organization’s data protection and recovery strategies.
The ideal candidate will have hands-on experience with enterprise server and storage hardware, as well as expertise in implementing and maintaining backup and disaster recovery solutions. Responsibilities include installing and configuring physical servers, managing storage environments, monitoring system health, and performing regular data backups and restores. This position plays a key role in ensuring system uptime, safeguarding critical data, and supporting ongoing infrastructure improvements to meet organizational needs.
Job Location
Remote: Legal residents of one of the following states: AK, AL, AR, AZ, CT, DE, FL, GA, IA, ID, IN, KS, KY, LA, MD, ME, MI, MN, MO, MS, NC, ND, NH, NM, NV, OH, OK, PA, SC, SD, TN, TX, UT, VA, VT, WI, WV, or WY
We only accept W-2 candidates, H-1B sponsorship is not available.
Responsibilities
- Assist in the daily management, maintenance, and optimization of physical server and storage infrastructure.
- Participate in infrastructure projects and initiatives under the direction of senior engineers, including planning, implementation, and testing.
- Troubleshoot and resolve hardware and system-level issues related to servers and storage devices in a timely manner.
- Support the operation, monitoring, and administration of data backup and recovery systems to ensure data integrity and availability.
- Perform regular system updates, patches, and routine maintenance to ensure infrastructure security and performance.
- Collaborate with cross-functional teams to support infrastructure requirements and ensure high availability and reliability of critical systems.
- Maintain accurate and up-to-date documentation of procedures, system configurations, and hardware inventories
Physical Requirements
- Work is performed while sitting/standing and interfacing with a personal computer.
- Requires the ability to communicate effectively using speech, vision, and hearing.
- Requires the regular use of hands for simple grasping and fine manipulations.
- Requires occasional bending, squatting, crawling, climbing, and reaching.
- Requires the ability to occasionally lift, carry, push, or pull medium weights, up to 50lbs.
Qualifications
Experience
- Minimum of 3 years of relevant experience in a system or infrastructure engineering role.
- Experience supporting on-premises/co-located data center environments with both physical and virtual infrastructure.
- Hands-on experience with enterprise server hardware, firmware/BIOS management, and hardware diagnostics.
- Disaster recovery planning, including failover testing and backup/recovery verification.
- Applied knowledge of monitoring performance metrics, conducting RCA, and using observability tools.
- Exposure to lifecycle management tasks such as provisioning, upgrades, and decommissioning.
- Project involvement in infrastructure rollouts, system migrations, and hardware refresh initiatives.
Education
- This role does not require a degree. We value relevant skills and experience and alignment with our core values above all else.
Desired Traits & Skills
Datacenter Management
- Physical server installation, racking, and cable management
- Power and cooling capacity planning
- Asset labeling, inventory tracking, and lifecycle documentation
- Environmental monitoring (temperature, humidity, airflow)
- Structured cabling standards (fiber, copper, patch panels)
- Equipment decommissioning and secure hardware disposal
- Physical access control and vendor coordination
- Hands-on use of remote management tools (e.g., iDRAC, iLO) for out-of-band access
- Maintenance of equipment racks, PDUs, and UPS systems
- Coordination of change windows and physical maintenance schedules
Storage, Backup & Disaster Recovery
- SAN, NAS, & object storage systems including NetApp, Dell EMC, Dell ECS
- Backup products including Commvault, Veeam, Rubrik
- Storage replication and deduplication technologies
- Backup policy configuration and retention management
- Disaster recovery planning and testing
- Backup validation and offsite replication
Enterprise Server Management
- Dell PowerEdge server hardware
- RAID array configuration
- Hardware standardization and diagnostics
- Capacity planning and resource forecasting
- Server lifecycle management (provisioning, patching, decommissioning)
- Data center operations and server racking
Operating Systems & Virtualization
- Windows Server (2016–2025)
- Linux (RHEL, Ubuntu, Rocky Linux)
- VMware vSphere/ESXi
- vCenter HA
Monitoring, Performance & Observability
- Zabbix, Prometheus, Grafana
- Event Viewer, Splunk, etc.
- Log aggregation and alerting
- Custom dashboard creation in Grafana
- System telemetry and metric collection
- Root cause analysis (RCA)
- Performance tuning and optimization
Read Full Description