Lakizadeh, A.Agha-Golzadeh, P.Ebrahimi, M.Ebrahimie, E.Ebrahimi, M.2014-06-162014-06-162011Advanced Studies in Biology, 2011; 3(2):63-781313-9495http://hdl.handle.net/2440/83411There is a high demand for engineering thermostable enzymes in some industries; especially in paper industries to use environmental friendly enzymes instead of toxic chlorine chemicals. Hence, understanding protein attributes involved in enzyme thermostability is important. Herein, the most important protein features contributing to enzyme thermostability was searched by using data mining algorithms. Combination of attribute weighting and unsupervised clustering algorithms were used to explore protein attributes which play major roles in thermostability. The results showed that expectation maximization clustering with uncertainly and correlation attribute weighting algorithms can effectively (100%) classify thermo- and meso-stable proteins. Gln content and frequency of hydrophilic residues were the most important protein features selected by 70% of weighing methods. The findings of this research provide the required knowledge for engineering thermostable enzymes in laboratory.enCopyright status unknownEngineering thermostable enzymes; application of unsupervised clustering algorithmsJournal article002013810314491Ebrahimie, E. [0000-0002-4431-2861]