kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A new machine learning method for cancer mutation analysis
Islamic Azad Univ, Dept Math, Qazvin Branch, Qazvin, Iran..ORCID iD: 0000-0002-8969-2706
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Computational Science and Technology (CST). KTH, Centres, Science for Life Laboratory, SciLifeLab.ORCID iD: 0000-0002-2741-0355
2022 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 18, no 10, article id e1010332Article in journal (Refereed) Published
Abstract [en]

It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis. Author summary It can be challenging to find mutations that cause cancer. One of the most trustworthy characteristics for identifying cancer-causing mutations is the recurrence of a mutation in patients. However, some uncommon and low-frequency mutations should also be explored as cancer-related mutations, particularly in the setting of networks. In this study, we suggested a unique approach to discover prospective driver genes and investigate the functionality of mutually exclusive genes in networks formed from mutation connections and gene-gene interactions. These networks have identified critical biological elements in the vital pathways, notably in those that experience infrequent mutations. In the first step, we established six enlightening topological features for each gene acting as a network node. For each gene, we computed the score for our predefined features. Then, we suggested the high-scoring genes with significant connections to cancer as potential targets for further research. In the second step, we constructed a network based on the relationships between the high-score genes to find the cancer-related modules. We used what we had learned in the first step about how the high-score potential driver genes interact physically, biologically, and in terms of how they work to build this network.

Place, publisher, year, edition, pages
Public Library of Science (PLoS) , 2022. Vol. 18, no 10, article id e1010332
National Category
Medical Genetics and Genomics
Identifiers
URN: urn:nbn:se:kth:diva-324533DOI: 10.1371/journal.pcbi.1010332ISI: 000924885500001PubMedID: 36251702Scopus ID: 2-s2.0-85140933352OAI: oai:DiVA.org:kth-324533DiVA, id: diva2:1741775
Note

QC 20230320

Available from: 2023-03-07 Created: 2023-03-07 Last updated: 2025-02-10Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Taheri, Golnaz

Search in DiVA

By author/editor
Habibi, MahnazTaheri, Golnaz
By organisation
Computational Science and Technology (CST)Science for Life Laboratory, SciLifeLab
In the same journal
PloS Computational Biology
Medical Genetics and Genomics

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 38 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf