kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Interpreting protein abundance in Saccharomyces cerevisiae through relational learning
Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg 412 96, Sweden; Department of Computer Science and Engineering, Chalmers University of Technology, Rännvägen 6B, Gothenburg 412 96, Sweden., Rännvägen 6B.
Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg 412 96, Sweden.
KTH, School of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), Industrial Biotechnology. Department of Life Sciences, Chalmers University of Technology, Gothenburg 412 96, Sweden.ORCID iD: 0000-0002-0408-3515
Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg 412 96, Sweden; Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, United Kingdom; The Alan Turing Institute, London NW1 2DB, United Kingdom.
2024 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 40, no 2, article id btae050Article in journal (Refereed) Published
Abstract [en]

Motivation: Proteomic profiles reflect the functional readout of the physiological state of an organism. An increased understanding of what controls and defines protein abundances is of high scientific interest. Saccharomyces cerevisiae is a well-studied model organism, and there is a large amount of structured knowledge on yeast systems biology in databases such as the Saccharomyces Genome Database, and highly curated genome-scale metabolic models like Yeast8. These datasets, the result of decades of experiments, are abundant in information, and adhere to semantically meaningful ontologies. Results: By representing this knowledge in an expressive Datalog database we generated data descriptors using relational learning that, when combined with supervised machine learning, enables us to predict protein abundances in an explainable manner. We learnt predictive relationships between protein abundances, function and phenotype; such as a-amino acid accumulations and deviations in chronological lifespan. We further demonstrate the power of this methodology on the proteins His4 and Ilv2, connecting qualitative biological concepts to quantified abundances. Availability and implementation: All data and processing scripts are available at the following Github repository: https://github.com/ DanielBrunnsaker/ProtPredict.

Place, publisher, year, edition, pages
Oxford University Press , 2024. Vol. 40, no 2, article id btae050
National Category
Bioinformatics and Computational Biology
Identifiers
URN: urn:nbn:se:kth:diva-344019DOI: 10.1093/bioinformatics/btae050Scopus ID: 2-s2.0-85185180243OAI: oai:DiVA.org:kth-344019DiVA, id: diva2:1841389
Note

QC 20240229

Available from: 2024-02-28 Created: 2024-02-28 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Tiukova, Ievgeniia A.

Search in DiVA

By author/editor
Tiukova, Ievgeniia A.
By organisation
Industrial Biotechnology
In the same journal
Bioinformatics
Bioinformatics and Computational Biology

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 50 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf