Research Computing System Administrator University of Surrey

The University of Surrey Surrey Institute for People-Centred AI (PAI) is recruiting for a Research Computing System Administrator to primarily contribute to managing CoSTAR AI compute, including HPC compute facilities, user
support and management of user access/scheduling led by the University of Surrey IT support as part of the UKRI AHRC CoSTAR National Lab, as well as other PAI (people centred AI institute) research programmes as required.
The role is part of the wider PAI/CVSSP professional services team, and the post holder will work closely with the
PAI/CVSSP Facilities Manager. Day to day the post holder will be embedded within the Research computing
services team in IT services, who manage the CoStar/AI compute infrastructure on behalf of PAI/CVSSP and will
report to the Lead Research Computing system administrator.
Required skills and experience
- Full range of system administration skills including user management, building/deployment, installing scientific software packages, performance benchmarking, resource utilisation/performance and availability monitoring and incident response.
- automation of repetitive tasks in the form of developing and maintaining Ansible playbooks and roles
- using git for version control, change management and collaboration.
- Pro-actively and reactively identify, diagnose, and resolve faults and areas of suboptimal performance across research platforms.
- Communicate directly with customers providing specialist guidance and support.
- Practical working knowledge of Linux Systems & High-Performance Computing (HPC).