Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/97201
Citations
Scopus Web of Science® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTorkzaban, B.-
dc.contributor.authorKayvanjoo, A.-
dc.contributor.authorArdalan, A.-
dc.contributor.authorMousavi, S.-
dc.contributor.authorMariotti, R.-
dc.contributor.authorBaldoni, L.-
dc.contributor.authorEbrahimie, E.-
dc.contributor.authorEbrahimi, M.-
dc.contributor.authorHosseini-Mazinani, M.-
dc.contributor.editorKalaitzis, P.-
dc.date.issued2015-
dc.identifier.citationPLoS One, 2015; 10(11):e0143465-1-e0143465-17-
dc.identifier.issn1932-6203-
dc.identifier.issn1932-6203-
dc.identifier.urihttp://hdl.handle.net/2440/97201-
dc.description.abstractFinding efficient analytical techniques is overwhelmingly turning into a bottleneck for the effectiveness of large biological data. Machine learning offers a novel and powerful tool to advance classification and modeling solutions in molecular biology. However, these methods have been less frequently used with empirical population genetics data. In this study, we developed a new combined approach of data analysis using microsatellite marker data from our previous studies of olive populations using machine learning algorithms. Herein, 267 olive accessions of various origins including 21 reference cultivars, 132 local ecotypes, and 37 wild olive specimens from the Iranian plateau, together with 77 of the most represented Mediterranean varieties were investigated using a finely selected panel of 11 microsatellite markers. We organized data in two '4-targeted' and '16-targeted' experiments. A strategy of assaying different machine based analyses (i.e. data cleaning, feature selection, and machine learning classification) was devised to identify the most informative loci and the most diagnostic alleles to represent the population and the geography of each olive accession. These analyses revealed microsatellite markers with the highest differentiating capacity and proved efficiency for our method of clustering olive accessions to reflect upon their regions of origin. A distinguished highlight of this study was the discovery of the best combination of markers for better differentiating of populations via machine learning models, which can be exploited to distinguish among other biological populations.-
dc.description.statementofresponsibilityBahareh Torkzaban, Amir Hossein Kayvanjoo, Arman Ardalan, Soraya Mousavi, Roberto Mariotti, Luciana Baldoni, Esmaeil Ebrahimie, Mansour Ebrahimi, Mehdi Hosseini-Mazinani-
dc.language.isoen-
dc.publisherPublic Library of Science-
dc.rights© 2015 Torkzaban et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited-
dc.source.urihttp://dx.doi.org/10.1371/journal.pone.0143465-
dc.subjectOlea-
dc.subjectDNA, Plant-
dc.subjectBayes Theorem-
dc.subjectReproducibility of Results-
dc.subjectComputational Biology-
dc.subjectMicrosatellite Repeats-
dc.subjectGenotype-
dc.subjectAlleles-
dc.subjectGenes, Plant-
dc.subjectGeography-
dc.subjectAlgorithms-
dc.subjectDecision Trees-
dc.subjectIran-
dc.subjectGenetic Variation-
dc.subjectPhylogeography-
dc.subjectMachine Learning-
dc.titleMachine learning based classification of microsatellite variation: an effective approach for phylogeographic characterization of olive populations-
dc.typeJournal article-
dc.identifier.doi10.1371/journal.pone.0143465-
pubs.publication-statusPublished-
dc.identifier.orcidEbrahimie, E. [0000-0002-4431-2861]-
Appears in Collections:Aurora harvest 3
Molecular and Biomedical Science publications

Files in This Item:
File Description SizeFormat 
hdl_97201.pdfPublished version1.17 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.