kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Systematically identification of survival-associated eQTLs in a Japanese kidney cancer cohort
KTH, Centres, Science for Life Laboratory, SciLifeLab. KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Protein Science, Systems Biology.ORCID iD: 0009-0001-3893-682X
KTH, Centres, Science for Life Laboratory, SciLifeLab. KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Protein Science, Systems Biology.ORCID iD: 0000-0002-0955-6289
KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Protein Science, Systems Biology. KTH, Centres, Science for Life Laboratory, SciLifeLab.ORCID iD: 0000-0002-8301-9959
KTH, Centres, Science for Life Laboratory, SciLifeLab. KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Protein Science, Systems Biology.ORCID iD: 0000-0002-9248-3294
Show others and affiliations
2025 (English)In: PLOS Genetics, ISSN 1553-7390, E-ISSN 1553-7404, Vol. 21, no 7 July, article id e1011770Article in journal (Refereed) Published
Abstract [en]

Background Clear cell renal carcinoma (ccRCC) is the predominant form of kidney cancer, but the prognostic value of expression quantitative trait loci (eQTLs) remains underexplored, particularly in Asian populations. Objective We analyzed whole-exome sequencing and RNA sequencing data from 100 Japanese ccRCC patients to identify eQTLs. Multiple Cox proportional hazard models assessed survival associations, with validation in the Cancer Genome Atlas ccRCC cohort (n = 287). Results We identified 805 eGenes and 4,558 cis-eQTLs in the Japanese cohort. Survival analysis revealed a total of 9 eGenes significantly associated with overall survival (FDR < 0.05). Further exploratory analysis were performed using 158 eGenes and 711 eQTLs (p-value <0.05) as potential prognostic signals. Among these, 223 eQTLs regulating 54 eGenes showed consistent prognostic effects at both expression and genetic levels. Cross-population validation identified eight eQTLs regulating 11 eGenes with reproducible survival associations across ethnicities, including a missense mutation in ERV3–1 and regulatory variants near ANKRD20A7P. These variants demonstrated consistent allelic effects on both gene expression and patient survival in both cohorts.

Place, publisher, year, edition, pages
Public Library of Science (PLoS) , 2025. Vol. 21, no 7 July, article id e1011770
National Category
Cancer and Oncology
Identifiers
URN: urn:nbn:se:kth:diva-368942DOI: 10.1371/journal.pgen.1011770ISI: 001524169900006PubMedID: 40622919Scopus ID: 2-s2.0-105009893848OAI: oai:DiVA.org:kth-368942DiVA, id: diva2:1992651
Note

QC 20250828

Available from: 2025-08-28 Created: 2025-08-28 Last updated: 2026-03-30Bibliographically approved
In thesis
1. Genomic-based approaches for identifying risk loci and facilitating precision medicine in human diseases
Open this publication in new window or tab >>Genomic-based approaches for identifying risk loci and facilitating precision medicine in human diseases
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Genomics represents the first layer of the central dogma, where variations in the genome influence downstream biological processes and finally affect human phenomics. Although next-generation sequencing (NGS) technologies have generated massive amounts of genomic data, gaps remain in translating genomics into clinical and precision medicine applications.

The first part of this thesis (Papers I–II) focuses on developing high-performance computational platforms for clinical genomic and research applications. In Paper I, we developed GenRiskPro, a platform designed to enhance connectivity among key stakeholders in clinical genetics, including hospitals, research facilities, clinicians, and patients. It prioritizes reporting genetic risk variants for rare diseases and includes tools for detecting pharmacogenomic (PGx) and lifestyle-associated variants affecting complex traits. We also revealed population-specific genetic heterogeneity in the enrichment of pathogenic and low-to-rare-frequency risk variants, which frequently exhibit low penetrance in their phenomic presentations.

In Paper II, we developed OncoRisk, a comprehensive web server integrating multiple precision oncology knowledge bases and pan-cancer cohorts. The server comprises four major modules. It enables the query of key oncogenic terms such as mutations, gene fusions, diseases, and therapies, and supports analysis of individual tumor sequencing data to identify potentially significant mutations by mapping with known oncogenic resources. We also built functions for fast visualization and analysis of cancer cohort data and for querying of mutation frequencies for specific genes or variants across large-scale cancer sequencing cohorts.

The second part of this thesis (Papers III–IV) explores the genetic architecture of complex diseases using our in-house cohorts. In Paper III, we explored the survival effects of the expression quantitative trait loci (eQTLs). Firstly, we identified 805 eGenes and 4,558 cis-eQTLs in a Japanese kidney cancer cohort (n=100). Then, we validated these findings cross-ethnically using TCGA data (n=287) through comprehensive survival analyses across different allelic and covariate models, revealing regulatory variants with consistent effects on patient survival. Lastly, in Paper IV, we conducted an integrative genomic-transcriptomic study in a pediatric congenital heart disease (CHD) cohort. We have identified known pathogenic variants and found that rare missense variant burdens of CHD-associated genes were significantly enriched in CHD patients compared to controls. Transcriptomic analysis revealed shifts in oxidative phosphorylation and interferon signaling. Functional analysis of overlapping eGenes and differentially expressed genes (DEGs) highlighted involvement in small GTPase-mediated signaling and cytoskeleton organization, and integration of rare-variant burdens further identified high-confidence candidates.

In summary, this thesis demonstrates the complex impacts of rare and common variants on human health across both rare and complex disease contexts. By developing computational platforms to assist in identifying risk loci and leveraging genomic data to uncover novel potential drivers, this work aims to advance translational genetics and precision medicine.

Abstract [sv]

Genomik representerar det första lagret i den centrala dogmen, där variationer i genomet påverkar nedströms biologiska processer och slutligen formar human fenomik. Även om nästa generations sekvenseringsteknik (NGS) har genererat enorma mängder genomiska data, kvarstår betydande luckor i tillämpningen av genomik inom klinisk medicin och precisionsmedicin.

Den första delen av denna avhandling (Artikel I–II) fokuserar på utveckling av högpresterande plattformar för kliniska genomiska och forskningstillämpningar. I Artikel I utvecklade vi GenRiskPro, en plattform utformad för att förbättra samverkan mellan viktiga aktörer inom klinisk genetik, däribland sjukhus, forskningsanläggningar, kliniker och patienter. Plattformen prioriterar rapportering av genetiska riskvarianter för sällsynta sjukdomar och inkluderar verktyg för att identifiera farmakogenomiska (PGx) och livsstilsassocierade varianter som påverkar komplexa egenskaper. Vi påvisade även populationsspecifik genetisk heterogenitet i anrikningen av patogena varianter och varianter med låg till sällsynt frekvens, vilka ofta uppvisar låg penetrans i sina fenomiska manifestationer.

I Artikel II utvecklade vi OncoRisk, en omfattande webbserver som integrerar flera kunskapsbaser inom precisionsonkologi samt pan-cancerkohort-data. Servern består av fyra huvudmoduler. Den möjliggör sökning av centrala onkogena termer såsom mutationer, genfusioner, sjukdomar och terapier, samt stödjer analys av individuella tumörsekvenseringsdata för att identifiera potentiellt betydelsefulla mutationer genom kartläggning mot kända onkogena resurser. Vi utvecklade även funktioner för snabb visualisering och analys av cancerkohortsdata samt för sökning av mutationsfrekvenser för specifika gener eller varianter i storskaliga cancersekvenseringskohorter.

Den andra delen av avhandlingen (Artikel III–IV) utforskar den genetiska arkitekturen hos komplexa sjukdomar med hjälp av våra egna kohorter. I Artikel III undersökte vi överlevnadseffekterna av expressionskvantitativa traitloki (eQTL). Vi identifierade inledningsvis 805 eGener och 4 558 cis-eQTL i en japansk njurcancerkohort (n=100). Därefter validerade vi dessa fynd tväretniskt med hjälp av TCGA-data (n=287) genom omfattande överlevnadsanalyser över olika alleliska och kovariat-modeller, vilket påvisade regulatoriska varianter med konsekventa effekter på patientöverlevnad.

I Artikel IV genomförde vi en integrativ genomisk-transkriptomisk studie i en pediatrisk kohort med medfödd hjärtsjukdom (CHD). Vi identifierade kända patogena varianter och fann att bördan av sällsynta missense-varianter i CHD-associerade gener var signifikant anrikad hos CHD-patienter jämfört med kontroller. Transkriptomisk analys påvisade förändringar i oxidativ fosforylering och interferonsignalering. Funktionell analys av överlappande eGener och differentiellt uttryckta gener (DEG) belyste deras involvering i liten GTPas-medierad signalering och cytoskelettonisering, och integration av sällsynta variantbördor identifierade ytterligare högkonfidens-kandidater.

Sammanfattningsvis belyser denna avhandling de komplexa effekterna av sällsynta och vanliga varianter på människors hälsa inom både sällsynta och komplexa sjukdomskontexter. Genom att utveckla beräkningsplattformar för att underlätta identifiering av riskloki och utnyttja genomiska data för att upptäcka nya potentiella drivande faktorer, syftar detta arbete till att främja translationell genetik och precisionsmedicin.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2026. p. 92
Series
TRITA-CBH-FOU ; 2026:18
Keywords
Clinical Genomics, Translational Genetics, Genomic Architecture, Variant Interpretation, Precision Medicine, Precision Oncology, Web Server Development, eQTL Analysis, Rare and Complex Diseases, Multi-Omics, Bioinformatics, Systems Biology
National Category
Bioinformatics and Computational Biology Medical Genetics and Genomics Medical Bioinformatics and Systems Biology
Research subject
Biotechnology
Identifiers
urn:nbn:se:kth:diva-378880 (URN)978-91-8106-572-5 (ISBN)
Public defence
2026-04-23, F3 KTH Campus, via Zoom: https://kth-se.zoom.us/j/69920511035, Lindstedtvägen 26, Stockholm, 14:00 (English)
Opponent
Supervisors
Funder
Knut and Alice Wallenberg Foundation, 72110
Note

QC 2026-03-30

Available from: 2026-03-30 Created: 2026-03-30 Last updated: 2026-04-08Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Song, XiyaJin, HanLi, XiangyuYuan, MengYang, HongZhang, ChengMardinoglu, Adil

Search in DiVA

By author/editor
Song, XiyaJin, HanLi, XiangyuYuan, MengYang, HongZhang, ChengMardinoglu, Adil
By organisation
Science for Life Laboratory, SciLifeLabSystems Biology
In the same journal
PLOS Genetics
Cancer and Oncology

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 68 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf