kth.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (5 of 5) Show all publications
Habibi, M. & Taheri, G. (2022). A new machine learning method for cancer mutation analysis. PloS Computational Biology, 18(10), Article ID e1010332.
Open this publication in new window or tab >>A new machine learning method for cancer mutation analysis
2022 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 18, no 10, article id e1010332Article in journal (Refereed) Published
Abstract [en]

It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis. Author summary It can be challenging to find mutations that cause cancer. One of the most trustworthy characteristics for identifying cancer-causing mutations is the recurrence of a mutation in patients. However, some uncommon and low-frequency mutations should also be explored as cancer-related mutations, particularly in the setting of networks. In this study, we suggested a unique approach to discover prospective driver genes and investigate the functionality of mutually exclusive genes in networks formed from mutation connections and gene-gene interactions. These networks have identified critical biological elements in the vital pathways, notably in those that experience infrequent mutations. In the first step, we established six enlightening topological features for each gene acting as a network node. For each gene, we computed the score for our predefined features. Then, we suggested the high-scoring genes with significant connections to cancer as potential targets for further research. In the second step, we constructed a network based on the relationships between the high-score genes to find the cancer-related modules. We used what we had learned in the first step about how the high-score potential driver genes interact physically, biologically, and in terms of how they work to build this network.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2022
National Category
Medical Genetics and Genomics
Identifiers
urn:nbn:se:kth:diva-324533 (URN)10.1371/journal.pcbi.1010332 (DOI)000924885500001 ()36251702 (PubMedID)2-s2.0-85140933352 (Scopus ID)
Note

QC 20230320

Available from: 2023-03-07 Created: 2023-03-07 Last updated: 2025-02-10Bibliographically approved
Taheri, G. & Habibi, M. (2022). Comprehensive analysis of pathways in Coronavirus 2019 (COVID-19) using an unsupervised machine learning method. Applied Soft Computing, 128, Article ID 109510.
Open this publication in new window or tab >>Comprehensive analysis of pathways in Coronavirus 2019 (COVID-19) using an unsupervised machine learning method
2022 (English)In: Applied Soft Computing, ISSN 1568-4946, E-ISSN 1872-9681, Vol. 128, article id 109510Article in journal (Refereed) Published
Abstract [en]

The World Health Organization (WHO) introduced “Coronavirus disease 19” or “COVID-19” as a novel coronavirus in March 2020. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires the fast discovery of effective treatments to fight this worldwide crisis. Artificial intelligence and bioinformatics analysis pipelines can assist with finding biomarkers, explanations, and cures. Artificial intelligence and machine learning methods provide powerful infrastructures for interpreting and understanding the available data. On the other hand, pathway enrichment analysis, as a dominant tool, could help researchers discover potential key targets present in biological pathways of host cells that are targeted by SARS-CoV-2. In this work, we propose a two-stage machine learning approach for pathway analysis. During the first stage, four informative gene sets that can represent important COVID-19 related pathways are selected. These “representative genes” are associated with the COVID-19 pathology. Then, two distinctive networks were constructed for COVID-19 related signaling and disease pathways. In the second stage, the pathways of each network are ranked with respect to some unsupervised scorning method based on our defined informative features. Finally, we present a comprehensive analysis of the top important pathways in both networks. Materials and implementations are available at: https://github.com/MahnazHabibi/Pathway.

Place, publisher, year, edition, pages
Elsevier BV, 2022
Keywords
Coronavirus disease 2019, Machine learning, SARS-CoV-2, Unsupervised learning
National Category
Computer Sciences Infectious Medicine
Identifiers
urn:nbn:se:kth:diva-329054 (URN)10.1016/j.asoc.2022.109510 (DOI)000884754600013 ()35992221 (PubMedID)2-s2.0-85136590598 (Scopus ID)
Note

QC 20230614

Available from: 2023-06-14 Created: 2023-06-14 Last updated: 2023-06-14Bibliographically approved
Habibi, M., Taheri, G. & Aghdam, R. (2021). A SARS-CoV-2 (COVID-19) biological network to find targets for drug repurposing. Scientific Reports, 11(1), Article ID 9378.
Open this publication in new window or tab >>A SARS-CoV-2 (COVID-19) biological network to find targets for drug repurposing
2021 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 11, no 1, article id 9378Article in journal (Refereed) Published
Abstract [en]

The Coronavirus disease 2019 (COVID-19) caused by the SARS-CoV-2 virus needs a fast recognition of effective drugs to save lives. In the COVID-19 situation, finding targets for drug repurposing can be an effective way to present new fast treatments. We have designed a two-step solution to address this approach. In the first step, we identify essential proteins from virus targets or their associated modules in human cells as possible drug target candidates. For this purpose, we apply two different algorithms to detect some candidate sets of proteins with a minimum size that drive a significant disruption in the COVID-19 related biological networks. We evaluate the resulted candidate proteins sets with three groups of drugs namely Covid-Drug, Clinical-Drug, and All-Drug. The obtained candidate proteins sets approve 16 drugs out of 18 in the Covid-Drug, 273 drugs out of 328 in the Clinical-Drug, and a large number of drugs in the All-Drug. In the second step, we study COVID-19 associated proteins sets and recognize proteins that are essential to disease pathology. This analysis is performed using DAVID to show and compare essential proteins that are contributed between the COVID-19 comorbidities. Our results for shared proteins show significant enrichment for cardiovascular-related, hypertension, diabetes type 2, kidney-related and lung-related diseases.

Place, publisher, year, edition, pages
Springer Nature, 2021
National Category
Infectious Medicine
Identifiers
urn:nbn:se:kth:diva-297731 (URN)10.1038/s41598-021-88427-w (DOI)000656497500026 ()33931664 (PubMedID)2-s2.0-85105142757 (Scopus ID)
Note

QC 20210802

Correction in:  Scientific Reports, (2021), 11, 1, (9378), 10.1038/s41598-021-88427-w

Available from: 2021-08-02 Created: 2021-08-02 Last updated: 2022-09-15Bibliographically approved
Habibi, M. & Taheri, G. (2021). Topological network based drug repurposing for coronavirus 2019. PLOS ONE, 16(7), Article ID e0255270.
Open this publication in new window or tab >>Topological network based drug repurposing for coronavirus 2019
2021 (English)In: PLOS ONE, E-ISSN 1932-6203, Vol. 16, no 7, article id e0255270Article in journal (Refereed) Published
Abstract [en]

The COVID-19 pandemic caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has become the current health concern and threat to the entire world. Thus, the world needs the fast recognition of appropriate drugs to restrict the spread of this disease. The global effort started to identify the best drug compounds to treat COVID-19, but going through a series of clinical trials and our lack of information about the details of the virus's performance has slowed down the time to reach this goal. In this work, we try to select the subset of human proteins as candidate sets that can bind to approved drugs. Our method is based on the information on human-virus protein interaction and their effect on the biological processes of the host cells. We also define some informative topological and statistical features for proteins in the protein-protein interaction network. We evaluate our selected sets with two groups of drugs. The first group contains the experimental unapproved treatments for COVID-19, and we show that from 17 drugs in this group, 15 drugs are approved by our selected sets. The second group contains the external clinical trials for COVID-19, and we show that 85% of drugs in this group, target at least one protein of our selected sets. We also study COVID-19 associated protein sets and identify proteins that are essential to disease pathology. For this analysis, we use DAVID tools to show and compare disease-associated genes that are contributed between the COVID-19 comorbidities. Our results for shared genes show significant enrichment for cardiovascular-related, hypertension, diabetes type 2, kidney-related and lung-related diseases. In the last part of this work, we recommend 56 potential effective drugs for further research and investigation for COVID-19 treatment. Materials and implementations are available at: .https://github.com/ MahnazHabibi/Drug-repurposing.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2021
National Category
Infectious Medicine
Identifiers
urn:nbn:se:kth:diva-302054 (URN)10.1371/journal.pone.0255270 (DOI)000685248200073 ()34324563 (PubMedID)2-s2.0-85111483422 (Scopus ID)
Note

QC 20210916

Available from: 2021-09-16 Created: 2021-09-16 Last updated: 2022-06-25Bibliographically approved
Aghdam, R., Habibi, M. & Taheri, G. (2021). Using informative features in machine learning based method for COVID-19 drug repurposing. Journal of Cheminformatics, 13(1), Article ID 70.
Open this publication in new window or tab >>Using informative features in machine learning based method for COVID-19 drug repurposing
2021 (English)In: Journal of Cheminformatics, E-ISSN 1758-2946, Vol. 13, no 1, article id 70Article in journal (Refereed) Published
Abstract [en]

Coronavirus disease 2019 (COVID-19) is caused by a novel virus named Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). This virus induced a large number of deaths and millions of confirmed cases worldwide, creating a serious danger to public health. However, there are no specific therapies or drugs available for COVID-19 treatment. While new drug discovery is a long process, repurposing available drugs for COVID-19 can help recognize treatments with known clinical profiles. Computational drug repurposing methods can reduce the cost, time, and risk of drug toxicity. In this work, we build a graph as a COVID-19 related biological network. This network is related to virus targets or their associated biological processes. We select essential proteins in the constructed biological network that lead to a major disruption in the network. Our method from these essential proteins chooses 93 proteins related to COVID-19 pathology. Then, we propose multiple informative features based on drug-target and protein-protein interaction information. Through these informative features, we find five appropriate clusters of drugs that contain some candidates as potential COVID-19 treatments. To evaluate our results, we provide statistical and clinical evidence for our candidate drugs. From our proposed candidate drugs, 80% of them were studied in other studies and clinical trials.

Place, publisher, year, edition, pages
Springer Nature, 2021
Keywords
Coronavirus disease 2019, SARS-CoV-2, Protein-protein interaction, Clustering method
National Category
Medical and Health Sciences
Identifiers
urn:nbn:se:kth:diva-303049 (URN)10.1186/s13321-021-00553-9 (DOI)000698428500001 ()34544500 (PubMedID)2-s2.0-85115141169 (Scopus ID)
Note

QC 20211006

Available from: 2021-10-06 Created: 2021-10-06 Last updated: 2022-06-25Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-2741-0355

Search in DiVA

Show all publications