kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
The blood proteome as a window into human health and disease
KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Protein Science, Systems Biology. KTH, Centres, Science for Life Laboratory, SciLifeLab.ORCID iD: 0000-0002-2669-7796
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The circulating proteome is a dynamic and accessible window into the biological state of the human body, reflecting its physiological and pathological processes. Advances in technologies to measure the plasma proteome now enable the measurement of thousands of proteins at population scale, opening new opportunities for the discovery of clinically relevant and minimally invasive biomarkers. These approaches hold promise for improving disease detection, patient stratification, and disease monitoring, positioning plasma proteomics at the forefront of precision medicine. Key to these advances are bioinformatic methods that identify candidate proteins associated with specific health and disease states from high-dimensional datasets. Despite the efforts combining large-scale proteomics with computational analyses, relatively few biomarkers have translated into clinical practice. This highlights the need for investigations that incorporate diverse cohorts and expand on classic comparisons against healthy controls, alongside an increased focus on validation strategies.

This thesis contributes to biomarker discovery by broadening the biological contexts that are profiled and compared. The first studies focus on cancer, starting by predicting the presence of cancer in patients with non-specific symptoms in Paper I, and identifying a protein panel able to distinguish between twelve cancer types in Paper II. Building on these findings, Paper III provides a deeper perspective of the circulating proteome across healthy individuals, during development, adulthood and aging, and a wide range of diseases. This is followed by Paper IV, which focuses on comparing the two main affinity proteomics platforms by assessing their complementarity and applicability in biomarker studies. Finally, the analysis of these large-scale datasets led to the development of streamlined bioinformatics pipelines, which are presented as an open-access package in Paper V.

Together, this work illustrates the potential of combining affinity proteomics with bioinformatics pipelines to profile the circulating proteome and derive biological insights. This thesis focuses on pan-disease comparisons, evaluates the complementarity of affinity proteomics platforms, and highlights the importance of reproducible biomarker discovery workflows. Developed within the framework of the Human Disease Blood Resource, the resulting data and insights are integrated into the Human Protein Atlas (www.proteinatlas.org), providing a resource for precision medicine research.

Abstract [sv]

Proteiner i blodplasma återspeglar de fysiologiska och patologiska processer som sker i kroppen och utgör därmed en unik källa till information om människokroppens hälsotillstånd. Den tekniska utvecklingen inom analys av plasmaproteomet har möjliggjort att tusentals proteiner kan kvantifieras i tusentals prover. Detta har skapat goda förutsättningar för identifiering av nya kliniskt relevanta biomarkörer som kan mätas i ett enkelt blodprov och användas inom diagnostik, riskstratifiering och prognos. Tillämpningen av bioinformatiska metoder på högdimensionell data har varit en nyckel till identifiering av proteiner kopplade till specifika hälso- och sjukdomstillstånd. Trots omfattande metodutveckling inom både bioinformatik och olika proteomikplattformar har få biomarkörer introducerats inom klinisk kemi, vilket understryker behovet av bredare studier, utökande jämförelsegrupper och ett större fokus på validering.

Syftet med denna avhandling är att bidra till nya lovande biomarkörspaneler genom att skräddarsy och optimera de sammanhang där blodplasmaproteomet studeras. De inledande studierna fokuserar på cancerdiagnostik: i Artikel I identifieras biomarkörspaneler som kan ge ledtrådar om tidig utvecklad cancer hos patienter med diffusa symptom, och i Artikel II identifieras proteiner med potential att urskilja tolv olika cancerformer från varandra. Artikel III bygger vidare på de tidigare nämnda studierna genom att även inkludera friska vuxna som kombinerats med uppföljningsstudier hos barn som växer upp, äldre samt ett stort antal sjukdomstillstånd. Detta följs upp med Artikel IV som undersöker hur väl olika affinitetsproteomikplattformar stämmer överens, samt hur resultatet översätts till biologisk kontext. Slutligen presenterar Artikel V ett programmeringsbibliotek som innehåller flera analysmetoder för att effektivisera biomarkörsforskning baserad på storskalig data.

Sammantaget visar avhandlingen hur affinitetsproteomik och bioinformatiska arbetsflöden kan kombineras för att utforska det cirkulerande proteomet i olika hälso- och sjukdomstillstånd. Det övergripande arbetet belyser värdet av sjukdomsöverskridande jämförelser samt vikten av reproducerbara arbetsflöden vid biomarkörsstudier. Avhandlingen har genomförts inom ramen för Human Disease Blood Resource, som är en del av Human Protein Atlas (www.proteinatlas.org) där data och resultat som genererats som en resurs inom precisionsmedicin.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2026. , p. 77
Series
TRITA-CBH-FOU ; 2026:22
Keywords [en]
plasma proteomics, biomarker discovery, protein profiling, affinity proteomics, pan-disease analysis, disease signatures, precision medicine, differential expression, machine learning, feature selection, classification models, Proximity Extension Assay, Olink proteomics, Human Protein Atlas, Human Disease Blood Atlas
National Category
Medical Biotechnology
Research subject
Biotechnology
Identifiers
URN: urn:nbn:se:kth:diva-380258ISBN: 978-91-8106-593-0 (print)OAI: oai:DiVA.org:kth-380258DiVA, id: diva2:2056032
Public defence
2026-05-22, Eva & George Klein, via Zoom: https://kth-se.zoom.us/j/69364610322, Solnavägen, 9, Solna, 13:30 (English)
Opponent
Supervisors
Note

QC 2026-04-28

Available from: 2026-04-28 Created: 2026-04-27 Last updated: 2026-05-11Bibliographically approved
List of papers
1. Plasma protein profiling predicts cancer in patients with non-specific symptoms
Open this publication in new window or tab >>Plasma protein profiling predicts cancer in patients with non-specific symptoms
Show others...
2025 (English)In: Nature Communications, E-ISSN 2041-1723, Vol. 17, no 1, article id 151Article in journal (Refereed) Published
Abstract [en]

Cancer detection is challenging, especially in patients with diffuse symptoms that overlap with non-malignant conditions. Here we show that plasma protein profiling can identify cancer among patients with non-specific symptoms. Using proximity extension assay-based proteomics of 1463 plasma proteins from 456 patients presenting with non-specific symptoms sampled prior to cancer diagnostic work-up and diagnosis, we identify 29 proteins associated with new cancer diagnoses. We develop a model able to stratify 160 cancer cases and 296 non-cancer cases with an area under the curve of 0.80, maintaining performance (0.82) in an independent replication cohort of 238 patients. The model also distinguishes cancer from autoimmune, inflammatory and infectious diseases. Designed as a triage tool, our model based on a blood test could help prioritize patients at higher cancer risk for rapid and highly sensitive diagnostic modalities such as positron emission tomography-computed tomography. These findings emphasize the potential of blood proteome profiling to support timely diagnosis and transform clinical medicine.

Place, publisher, year, edition, pages
Springer Nature, 2025
National Category
Cancer and Oncology
Identifiers
urn:nbn:se:kth:diva-378073 (URN)10.1038/s41467-025-67688-3 (DOI)001655544200002 ()41457066 (PubMedID)2-s2.0-105026913946 (Scopus ID)
Note

QC 20260318

Available from: 2026-03-18 Created: 2026-03-18 Last updated: 2026-04-27Bibliographically approved
2. Next generation pan-cancer blood proteome profiling using proximity extension assay
Open this publication in new window or tab >>Next generation pan-cancer blood proteome profiling using proximity extension assay
Show others...
2023 (English)In: Nature Communications, E-ISSN 2041-1723, Vol. 14, no 1, article id 4308Article in journal (Refereed) Published
Abstract [en]

A comprehensive characterization of blood proteome profiles in cancer patients can contribute to a better understanding of the disease etiology, resulting in earlier diagnosis, risk stratification and better monitoring of the different cancer subtypes. Here, we describe the use of next generation protein profiling to explore the proteome signature in blood across patients representing many of the major cancer types. Plasma profiles of 1463 proteins from more than 1400 cancer patients are measured in minute amounts of blood collected at the time of diagnosis and before treatment. An open access Disease Blood Atlas resource allows the exploration of the individual protein profiles in blood collected from the individual cancer patients. We also present studies in which classification models based on machine learning have been used for the identification of a set of proteins associated with each of the analyzed cancers. The implication for cancer precision medicine of next generation plasma profiling is discussed.

Place, publisher, year, edition, pages
Springer Nature, 2023
National Category
Cancer and Oncology Medical Biotechnology (with a focus on Cell Biology (including Stem Cell Biology), Molecular Biology, Microbiology, Biochemistry or Biopharmacy)
Identifiers
urn:nbn:se:kth:diva-333884 (URN)10.1038/s41467-023-39765-y (DOI)001037322100032 ()37463882 (PubMedID)2-s2.0-85165345608 (Scopus ID)
Note

QC 20230815

Available from: 2023-08-15 Created: 2023-08-15 Last updated: 2026-04-27Bibliographically approved
3. A human pan-disease blood atlas of the circulating proteome
Open this publication in new window or tab >>A human pan-disease blood atlas of the circulating proteome
Show others...
2025 (English)In: Science, ISSN 0036-8075, E-ISSN 1095-9203, Vol. 390, no 6779, article id eadx2678Article in journal (Refereed) Published
Abstract [en]

The human blood proteome provides a holistic readout of health states through the assessment of thousands of circulating proteins. In this study, we present a pan-disease resource to enable the study of diverse disease phenotypes within a harmonized proteomics dataset. By profiling protein concentrations across 59 diseases and healthy cohorts, we identified proteins associated with age, sex, and body mass index, as well as disease-specific signatures. This study highlights shared and distinct protein patterns across conditions, demonstrating the power of a unified proteomics approach to uncover biological insights. The dataset, covering 8262 individuals and up to 5416 proteins, serves as an online resource for exploring disease-specific protein profiles and advancing precision medicine research.

Place, publisher, year, edition, pages
American Association for the Advancement of Science (AAAS), 2025
National Category
Medical Biotechnology (Focus on Cell Biology, (incl. Stem Cell Biology), Molecular Biology, Microbiology, Biochemistry or Biopharmacy)
Identifiers
urn:nbn:se:kth:diva-378079 (URN)10.1126/science.adx2678 (DOI)001643421200001 ()41066540 (PubMedID)2-s2.0-105025246161 (Scopus ID)
Note

QC 20260318

Available from: 2026-03-18 Created: 2026-03-18 Last updated: 2026-04-27Bibliographically approved
4. The blood proteome across health and disease using two affinity-based platforms
Open this publication in new window or tab >>The blood proteome across health and disease using two affinity-based platforms
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

The human blood proteome offers a powerful window into health and disease, allowing for precision medicine efforts, involving early detection of disease, stratification of patients, and monitoring of therapeutic regimes. Here, we report on the analysis of 28 diseases and 2,254individuals using two next generation blood profiling platforms, Proximity Extension Assay and SomaScan, with analysis of over 5,400 and 9,500 protein targets, respectively. The study enables cross-platform comparison of disease-associated profiles and predictive models to find patients with a particular disease. We also cover the longitudinal analysis of children through puberty and the dramatic changes in women during pregnancy. The study includes the analysis of a longevity cohort with healthy old-aged individuals, and we report on a revised model for biological age based on a selected subset of the circulating proteome. The data is visualized in a new version of the open access Blood Resource as part of the Human Protein Atlas allowing comparisons of the two analytical platforms to explore the proteome profiles across the blood from healthy individuals and patients with diseases.

National Category
Medical and Health Sciences
Research subject
Biotechnology
Identifiers
urn:nbn:se:kth:diva-380253 (URN)
Note

QC 20260508

Available from: 2026-04-27 Created: 2026-04-27 Last updated: 2026-05-08Bibliographically approved
5. HDAnalyzeR: streamlining data analysis for biomarker research
Open this publication in new window or tab >>HDAnalyzeR: streamlining data analysis for biomarker research
Show others...
2026 (English)In: Bioinformatics Advances, E-ISSN 2635-0041, Vol. 6, no 1, article id vbag020Article in journal (Refereed) Published
Abstract [en]

Motivation: Exploration of large-scale biological datasets remains a central challenge in computational biology. While many tools are available, they are often developed in isolation, leading to fragmented workflows, duplicated efforts, and limited reproducibility. There is a pressing need for flexible, standardized solutions that unify exploratory data analysis and biomarker discovery across diverse platforms.

Results: We present HDAnalyzeR, a user-friendly and extensible R package for the streamlined analysis of high-dimensional biological data. HDAnalyzeR provides modular, reproducible workflows that support a range of analyses, from quality control and dimensionality reduction to differential expression and enrichment analysis. The package features built-in visualization, metadata-aware modeling, and seamless integration with interactive apps and learning resources. We also present two case studies, where HDAnalyzeR dramatically reduced analysis time and code complexity while providing biologically meaningful insights, such as classification of blood cancer types with AUC = 1.0 and identification of thousands of solid tumor-associated genes. HDAnalyzeR is designed to support both beginner users and experienced bioinformaticians, promoting transparency, reproducibility, and publication-quality output.

Availability and implementation: HDAnalyzeR is freely available both as an open-source R package at https://github.com/kantonopoulos/HDAnalyzeR and a web application at https://hdanalyzer.serve.scilifelab.se.

Place, publisher, year, edition, pages
Oxford University Press (OUP), 2026
National Category
Bioinformatics and Computational Biology Software Engineering Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:kth:diva-377879 (URN)10.1093/bioadv/vbag020 (DOI)001695984800001 ()41732669 (PubMedID)2-s2.0-105030823868 (Scopus ID)
Note

QC 20260306

Available from: 2026-03-06 Created: 2026-03-06 Last updated: 2026-04-27Bibliographically approved

Open Access in DiVA

MariaBuenoAlvez-thesis(6962 kB)272 downloads
File information
File name FULLTEXT01.pdfFile size 6962 kBChecksum SHA-512
b24682279abfd8875f97785bb05cfe9988ce800451c0997ce3c1dd6a21213e909a307204a2fe13b49f08697bec55cb85dca071afed5d42e778f6ee58e7033092
Type fulltextMimetype application/pdf

Authority records

Bueno Álvez, María

Search in DiVA

By author/editor
Bueno Álvez, María
By organisation
Systems BiologyScience for Life Laboratory, SciLifeLab
Medical Biotechnology

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 2420 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf