Change search
ReferencesLink to record
Permanent link

Direct link
Machine Learning Based Classification of Microsatellite Variation: An Effective Approach for Phylogeographic Characterization of Olive Populations
KTH, School of Biotechnology (BIO), Gene Technology. Natl Inst Genet Engn & Biotechnol, Iran.
Show others and affiliations
2015 (English)In: PLoS ONE, ISSN 1932-6203, E-ISSN 1932-6203, Vol. 10, no 11, e0143465Article in journal (Refereed) PublishedText
Abstract [en]

Finding efficient analytical techniques is overwhelmingly turning into a bottleneck for the effectiveness of large biological data. Machine learning offers a novel and powerful tool to advance classification and modeling solutions in molecular biology. However, these methods have been less frequently used with empirical population genetics data. In this study, we developed a new combined approach of data analysis using microsatellite marker data from our previous studies of olive populations using machine learning algorithms. Herein, 267 olive accessions of various origins including 21 reference cultivars, 132 local ecotypes, and 37 wild olive specimens from the Iranian plateau, together with 77 of the most represented Mediterranean varieties were investigated using a finely selected panel of 11 microsatellite markers. We organized data in two '4-targeted' and '16-targeted' experiments. A strategy of assaying different machine based analyses (i.e. data cleaning, feature selection, and machine learning classification) was devised to identify the most informative loci and the most diagnostic alleles to represent the population and the geography of each olive accession. These analyses revealed microsatellite markers with the highest differentiating capacity and proved efficiency for our method of clustering olive accessions to reflect upon their regions of origin. A distinguished highlight of this study was the discovery of the best combination of markers for better differentiating of populations via machine learning models, which can be exploited to distinguish among other biological populations.

Place, publisher, year, edition, pages
PUBLIC LIBRARY SCIENCE , 2015. Vol. 10, no 11, e0143465
National Category
Genetics and Breeding
Identifiers
URN: urn:nbn:se:kth:diva-180513DOI: 10.1371/journal.pone.0143465ISI: 000365862600068ScopusID: 2-s2.0-84955210919OAI: oai:DiVA.org:kth-180513DiVA: diva2:895469
Note

QC 20160119

Available from: 2016-01-19 Created: 2016-01-14 Last updated: 2016-01-19Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Ardalan, Arman
By organisation
Gene Technology
In the same journal
PLoS ONE
Genetics and Breeding

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 24 hits
ReferencesLink to record
Permanent link

Direct link