Job Title: Data Center/Systems Administrator
Work Location: US- Hillsboro, OR 97124
The main duties are building, configuration, and maintenance of high-performance computing clusters, cluster test beds in an internal test Iab, with predominantly Linux based clusters and High performance Fabrics like InfiniBand and 100-400Gb Data Center Ethernet. The work includes installation and troubleshooting of both production and preproduction servers, GPUs, switches, and software, as well as development of scripts for automation of cluster provisioning and configuration.
In addition, the maintenance includes support for users of the systems, internal documentation of practices, and general maintenance of the Iab spaces.
Scope:
• Custom OS provisioning including Linux and Windows for client and server platforms, using Deploy Commander, FOG and AWX.
• Own infrastructure incident ticket resolution for escalations from first-level support teams and DSE engineers
• Own infrastructure request resolution from DSE partners and Lab Compute
• Hardware support for pre-production and production hardware, system assembly, parts replacement, firmware updates, break/fix, and inventory management including storage and tracking in systems of records (Lab ServiceNow and other where required).
• Labs Capacity Management - Provision new systems in DSE Labs to optimize capacity and maintain system life cycle to drive efficiencies
• Partnership with Network & IT security teams to design and implement lab networks and firewall rules
• Update and maintain IT service management systems (ServiceNow)
• Document and communicate findings to all key stakeholders. Work with Infrastructure & Application Owner to create and maintain troubleshooting documentation.
• Work with Lab Compute to improve system usability, quality, and maintainability
• Engage with DSE & Lab Compute engineers to drive projects
• Physical system assembly and add/remove components
• Support Oregon DSE Lab infrastructure computer systems
Qualification/Skills, not limited to the following:
• Must have experience in Enterprise Linux and Windows Server Support Administration with Hardware Experience (Windows 2016/2019/2022, Linux RedHat, SUSE and Ubuntu)
• Must have 2+ years of experience in Storage Support Administration with Hardware Experience. (SAN/3PAR / EMC / Netapp NFS/CIFS)
• Must have 2+ years of experience in Client Support Administration with Hardware Experience. (WinXP, Win7, Win10, Win11)
• Must have 2+ years of experience in Virtualization (MS Hyper V, VMWare, etc…)
• Experience with enterprise automated OS provisions software (Altiris, BigFix, WDE, FOG, etc…) for Client & Server build platforms
• Experience working in a Lab and Datacenter Environments
Preferred requirements:
• Enterprise/Intermediate OS high performance Linux clustering
• Advanced TCP/IP management (DNS, DHCP)
• Experience in one or more scripting Languages, preferably Perl, Python and/or shell scripting
• Network file storage and backup administration
• Building non-standard Linux kernels, batch system setup, tuning, and administration, LDAP setup and administration, server management using out-of-band tools, server platform performance tools
• Jenkins CI testing environment
• Working knowledge of software engineering practices, testing methodologies, etc.
• Knowledge of labs processes and business environments, and the ability to follow business processes in a disciplined manner
• BS Degree in computer science or related field.
The candidate should also exhibit the following behavioral traits and/or skills: Analytical, diagnostic and problem-solving skills, strong communication (both verbal and written), people skills, and customer service ethic. The candidate needs to be able to work independently, readily adapt to change, and balance multiple priorities in a self-directed manner.
The ability to integrate well into a dynamic, professional, and high-performing team that provides mutual support and coverage to other team members.
Shift Hours
8am-5pm
Shift Days
Monday-Friday