Vacancy terms: Full Time, Fixed Term until 31 March 2023
Hours per week: 37.5
Advert closes midnight on: 30 Jul 2020
We are seeking a HPC Systems Specialist to join the Research Computing Team in our IT Services Department. The Research Computing Team delivers a range of services including local and national High-Performance Computing (HPC) facilities, a multi-petabyte Research Data Storage service, virtualised PaaS capability for hosting standard or bespoke servers, as well as a range of research support tools such as wikis, blogs and version control. The Team manages an estate of 900 servers, with 24000 CPU cores across three HPC systems and 6PB data.
In this role, you will design, develop and support the High Performance Computing and other infrastructure supporting the University’s critical research computing systems ensuring that requirements for functionality, security, performance, scalability, availability and service continuity are met. This role will focus in particular on providing support for DiRAC’s Data-Intensive service and Leicester’s ARM Catalyst HPC systems.
You will be responsible for running this infrastructure, and support the research groups that exploit it, taking responsibility for design and procurement, complex incident and problem resolution, preventative maintenance and upgrade work, capacity management and change management.
To be successful in this role you will have significant experience of working in a complex IT organisation using Linux systems design, being educated to degree level or equivalent in a relevant subject. You will experienced in working with HPC software and tools such as:
- Job schedulers (Slurm, PBS, Torque/Moab)
- Parallel file systems (Lustre, BeeGFS, SpectrumScale, Panasas)
- Cluster management software (Bright, OpenStack, xCAT)
- HPC software (Compilers, Infiniband, Spack/EasyBuild)
You will also have experience of working with the following technologies:
- Server virtualisation (VMware, OpenStack, Azure, AWS)
- Authentication and Authorization solutions (LDAP / Active Directory, PAM)
- Web servers (Apache, Nginx, HA Proxy, reverse-proxies, load-balancers)
- Open source databases (for example MySQL, Postgres, MongoDB)
- Network protocols and applications (DNS, DHCP, IPAM, Infoblox)
- Monitoring (e.g. Grafana, Zabbix, Prometheus)
- Configuration management (Puppet, Ansible, Chef, Salt, etc)
- Scripting languages (ideally Python)
For more information on the role and its requirements, please see the Job Summary Form.
Leicester is a leading University committed to international excellence, world-changing research and high quality, inspirational teaching. The University is ranked within the top 25 UK Universities in all international league tables and we are strongly committed to inclusivity, promoting equality and celebrating diversity among our staff. Located in the heart of the UK, you will have the opportunity to progress your career in a supportive and varied environment that values hard work and recognises individual achievement.
We offer a working environment that is committed to inclusivity, through promoting equality and valuing diversity. We offer a competitive salary package with excellent pension scheme and a generous annual leave allowance. Located close to Leicester city centre, our award winning campus benefits from a wide range of cafes, a fully equipped sports centre and nursery facilities. Further information regarding our extensive range of staff benefits is available here.