Data Scientist (Large Language Model Officer) - SeRP
| Dyddiad hysbysebu: | 27 Tachwedd 2025 |
|---|---|
| Cyflog: | £34,132 i £38,249 bob blwyddyn |
| Gwybodaeth ychwanegol am y cyflog: | together with USS pension benefits |
| Oriau: | Llawn Amser |
| Dyddiad cau: | 11 Rhagfyr 2025 |
| Lleoliad: | Swansea, Wales |
| Gweithio o bell: | Ar y safle yn unig |
| Cwmni: | Swansea University |
| Math o swydd: | Cytundeb |
| Cyfeirnod swydd: | SU01256 |
Crynodeb
This is a Fixed Term role until December 2026 working full-time.
The purpose of the role is to support Swansea University’s role within the Dementias Platform UK (DPUK) collaboration, a £53 million pound public-private Medical Research Council funded endeavour to create a wide reaching and innovative dementias research facility, incorporating different research disciplines from stem-cell research to data analysis. DPUK is a world leading resource for person focussed dementias research designed to fast-track scientific knowledge, treatments and the prevention of the disease. Dementias Platform UK enables researchers to access data via a virtual desktop environment. The post offers a unique opportunity to work on a project utilising the latest AI techniques and large language models (LLMs) for developing data discovery and feasibility tools as well as personal identifiable information (PII) detection.
As a data scientist you will:
o Work within research projects within the DPUK data portal team, including:
• Partaking in team planning and management
• Research design
• Data preparation
• Statistical analysis
• Writing results for publication
o Contribute to DPUK support activities, including provisioning data to projects, reviewing project outputs, handling researcher queries about DPUK, and helping researchers develop project ideas.
o Work with the DPUK team to develop and test various LLM methods utilising data and metadata within the data portal and compare methods and explore opportunities/challenges in different approaches.
o Support with the development of a user interface for users to query across DPUK data/metadata and work on PII detection projects.
The purpose of the role is to support Swansea University’s role within the Dementias Platform UK (DPUK) collaboration, a £53 million pound public-private Medical Research Council funded endeavour to create a wide reaching and innovative dementias research facility, incorporating different research disciplines from stem-cell research to data analysis. DPUK is a world leading resource for person focussed dementias research designed to fast-track scientific knowledge, treatments and the prevention of the disease. Dementias Platform UK enables researchers to access data via a virtual desktop environment. The post offers a unique opportunity to work on a project utilising the latest AI techniques and large language models (LLMs) for developing data discovery and feasibility tools as well as personal identifiable information (PII) detection.
As a data scientist you will:
o Work within research projects within the DPUK data portal team, including:
• Partaking in team planning and management
• Research design
• Data preparation
• Statistical analysis
• Writing results for publication
o Contribute to DPUK support activities, including provisioning data to projects, reviewing project outputs, handling researcher queries about DPUK, and helping researchers develop project ideas.
o Work with the DPUK team to develop and test various LLM methods utilising data and metadata within the data portal and compare methods and explore opportunities/challenges in different approaches.
o Support with the development of a user interface for users to query across DPUK data/metadata and work on PII detection projects.