Research led by Queen Mary’s Professor Claudia Langenberg and her team has helped to prove the effectiveness of large-scale protein studies using UK Biobank data to understand disease
UK Biobank is the world’s most comprehensive source of biomedical data available for health research in the public interest. In 2023 it released data on nearly 3,000 circulating proteins from 54,000 participants. In 2024, Professor Claudia Langenberg, Director of the Precision Healthcare University Research Institute (PHURI) at Queen Mary, and her colleagues published a landmark study using UK Biobank proteomic data to identify disease risk. This research was one of a small number of studies to use this unique data.
Building on this pilot data release, UK Biobank announced today a project to measure up to 5,400 proteins in each of 600,000 samples, including those taken from half a million UK Biobank participants and 100,000 second samples taken from these volunteers up to 15 years later. The new unique dataset is ten times larger than that used in the pilot, and is being funded by a consortium of 14 leading biopharmaceutical companies, known as the UK Biobank Pharma Proteomics Project.
This new project will allow researchers to explore a first-of-its-kind database, detailing how changes to an individual’s protein levels over mid-to-late life influence disease. The study will begin by analysing the first 300,000 samples, which will include initial samples from 250,000 UK Biobank volunteers and 50,000 second samples taken at follow-up assessments.
UK Biobank’s proteomics dataset will allow researchers to:
Professor Langenberg said: “Adding proteomic data for the full UK Biobank cohort will be an absolute game-changer for prediction of disease onset and prognosis, particularly for the many neglected diseases for which good prospective data are lacking. These include debilitating and life threating diseases, such as polycystic ovary syndrome and motor neurone disease. Just imagine if we could detect these and many other conditions much earlier than is currently possible.”
Professor Sir Rory Collins, Principal Investigator and Chief Executive of UK Biobank, said: “For the first time at this scale, researchers will be able to detect the exact causes of diseases by comparing how protein levels change over mid-to-late life in a large group of people. Proteomic data has already paved the way for better cancer, autoimmune and dementia diagnostics, and this truly exciting study of proteins will significantly speed up drug discovery, leading to major improvements in public health and care everywhere.”
It will take about a year to measure the protein levels in 300,000 participant samples. The proteomic data will be made available to UK Biobank-approved researchers in staggered releases from 2026, with the full dataset expected to be added to the UK Biobank Research Analysis Platform by 2027. During this time, additional funding will be sought to analyse samples from all remaining UK Biobank volunteers (an additional 250,000 participants, including second samples from a further 50,000).
For media information, contact: