Provide Data Center Infrastructure Management for Lehigh University data centers. Provide input to and execute the design, implementation, maintenance, enhancement and monitoring of the data center infrastructure composed of power, cooling, fire, security while planning for future growth and changes in technology. Implement, perform and maintain hardware and software installations in the data centers. Collaborate with LTS architects to support their designs. The Lehigh community takes seriously our commitment to antiracism and The Principles of our Equitable Community.
1. Data Center Infrastructure Management (DCIM) - Design, monitor, measure, optimize, manage and/or control data center utilization and energy consumption of all IT-related equipment (such as servers, storage and network switches) and facility infrastructure components (such as power distribution units [PDUs] and computer room air conditioners [CRACs]) *Research, design and implement data center improvements to support Disaster Recovery and Business Continuity Planning (DR/BCP) and continued growth *Design and document data center processes and procedures *Responsible for upkeeping of the data center environment, ensuring all guests comply while working in the data center operates within approved processes *Conducting capacity assessments of existing infrastructure to ensure that it can support future growth *Manage and lead DC projects to include capacity planning for power and cooling *Plan, research, formulate and prepare project analyses for replacement of outdated older technology
2. Leverage and support existing monitoring and metrics and define new ones to improve efficiency and scalability, and proactively analyze data to quantify risks and opportunities for efficiency gains *Understand process for diagnostic methods of troubleshooting data center equipment *Produce periodic status reports on utilization, progress, downtime, performance and costs *Focus on on-premise data centers but incorporate cloud infrastructures *Monitoring energy and cooling usage across the data center to ensure efficient operation
3. Support Network Engineering, Research/HPC and System Engineering to perform required work within the data center environment *Installing and configuring new servers, storage devices, routers, switches and other network equipment as designated in the data centers (Linux, Windows OS, firmware updates, repurposing hardware by removing and/or installing additional components) *Repair servers (replace hard drives; replace memory, GPUs, Fiber SFP, cabling, etc.) *Configure IPMI, update firmware and OS installation (Linux, Windows, etc.) *Implement automation via scripting (shell, python) *Self-perform server diagnostic and troubleshooting *Assist with HPC grants by reviewing quotes to ensure conformity and to prepare for installation *Review design and quotes for recommended equipment to ensure conformity within DC environments
4. Disaster Recovery and Business Continuity Planning (DR/BCP) *Lead improvement initiatives to DR to restore normal business-critical operations after a disaster *Lead improvement initiatives to BCP to maintain essential functions during and after a natural or man-made disruption *Participate in exercises *Assist with documentation of processes
Grade: 11 - 40
Position Number: S97920
The duties of the position do not allow for a remote work option; the employee in this position will be required to work on campus where they can be fully accessible to the Lehigh community. However, this position will be eligible to work partially remote after 6 months to a year in the position.
This position has contact with minors
Will sometimes need to climb or balance, pull or push, see, stand, use hands to touch, handle or feel, reach with hands and arms, sit, stoop, kneel, crouch or crawl, walk, work near moving mechanical parts, lift up to 25-50 pounds and be subject to electrical hazards and frequent loud noise(s)
Bachelor’s Degree in Electrical/Mechanical Engineering, Computer Science, Information Technology or related field or equivalent combination of education and experience
Five to eight years related work experience
Understanding of hardware and software and installation
Ability to read and interpret data center diagrams and schematics
Problem-solving, troubleshooting and analytical skills
Strong attention to detail
Customer-focused mindset, with demonstrated skill in managing expectations, providing proactive status updates and producing high-quality work product
Ability to use independent judgment to make sound, justifiable decisions and take action to solve problems
Ability to communicate effectively with customers and internal staff and effectively work in team environment
A strong desire and aptitude for solving problems and performing deep technical dives to resolve issues quickly and efficiently
Electrical and HVAC knowledge
Successful completion of standard background checks including but not limited to: social security verification, education verification, national criminal background checks, motor vehicle checks, PATCH, FBI fingerprinting, Child Abuse Clearance and credit history based upon the requirements of the position
All Lehigh faculty and staff are required to be fully vaccinated and receive a booster shot six months after their second vaccine; unless they receive an approved medical or religious exemption from the requirement.
Only complete applications will be considered therefore please complete the application in its entirety. Once the posting is removed from the website applications may no longer be allowed to be completed.
Lehigh is a premier residential research university, ranked in the top tier of national research universities each year. We are a coeducational, nondenominational, private university that offers a distinct academic environment of undergraduate and graduate students from across the globe.