Change search
ReferencesLink to record
Permanent link

Direct link
Complementing tissue characterization by integrating transcriptome profiling from the Human Protein Atlas and from the FANTOM5 consortium
KTH, School of Biotechnology (BIO), Proteomics and Nanobiotechnology. KTH, Centres, Science for Life Laboratory, SciLifeLab.
KTH, School of Biotechnology (BIO), Proteomics and Nanobiotechnology. KTH, Centres, Science for Life Laboratory, SciLifeLab.ORCID iD: 0000-0003-0198-7137
Show others and affiliations
2015 (English)In: Nucleic Acids Research, ISSN 0305-1048, E-ISSN 1362-4962, Vol. 43, no 14, 6787-6798 p.Article in journal (Refereed) Published
Abstract [en]

Understanding the normal state of human tissue transcriptome profiles is essential for recognizing tissue disease states and identifying disease markers. Recently, the Human Protein Atlas and the FANTOM5 consortium have each published extensive transcriptome data for human samples using Illumina-sequenced RNA-Seq and Heliscope-sequenced CAGE. Here, we report on the first large-scale complex tissue transcriptome comparison between full-length versus 5'-capped mRNA sequencing data. Overall gene expression correlation was high between the 22 corresponding tissues analyzed (R > 0.8). For genes ubiquitously expressed across all tissues, the two data sets showed high genome-wide correlation (91% agreement), with differences observed for a small number of individual genes indicating the need to update their gene models. Among the identified single-tissue enriched genes, up to 75% showed consensus of 7-fold enrichment in the same tissue in both methods, while another 17% exhibited multiple tissue enrichment and/or high expression variety in the other data set, likely dependent on the cell type proportions included in each tissue sample. Our results show that RNA-Seq and CAGE tissue transcriptome data sets are highly complementary for improving gene model annotations and highlight biological complexities within tissue transcriptomes. Furthermore, integration with image-based protein expression data is highly advantageous for understanding expression specificities for many genes.

Place, publisher, year, edition, pages
2015. Vol. 43, no 14, 6787-6798 p.
National Category
Biological Sciences
URN: urn:nbn:se:kth:diva-173983DOI: 10.1093/nar/gkv608ISI: 000360588200019PubMedID: 26117540OAI: diva2:858887
Knut and Alice Wallenberg Foundation

QC 20151005

Available from: 2015-10-05 Created: 2015-09-24 Last updated: 2015-10-05Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Hallström, Björn M.Fagerberg, LinnUhlén, Mathias
By organisation
Proteomics and NanobiotechnologyScience for Life Laboratory, SciLifeLab
In the same journal
Nucleic Acids Research
Biological Sciences

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 19 hits
ReferencesLink to record
Permanent link

Direct link