The Use of BEXA Family Algorithms in Bioinformatics Data Classification

Madara Gasparoviсa, Ludmila Aleksejeva, Valdis Gersons

Abstract


This article studies the possibilities of BEXA family classification algorithms – BEXA, FuzzyBexa and FuzzyBexa II in data, especially bioinformatics data, classification. Three different types of data sets have been used in the study – data sets often used in the literature, UCI data repository real life data sets and real bioinformatics data sets that have the specific character – a large number of attributes and a small number of records. For the comparison of classification results experiments have been carried out using all data sets and other classification algorithms. As a result, conclusions have been drawn and recommendations given about the use of each algorithm of BEXA family for classification of various real data, as well as an answer has been given to the question, whether the use of these algorithms is recommended for bioinformatics data.


Keywords:

BEXA; bioinformatics data; classification algorithms; UCI data

Full Text:

PDF

References


H. Theron, I. Cloete, BEXA: A Covering Algorithm for Learning Propositional Concept Descriptions, in Machine Learning, Vol. 24, Boston: Kluwer Academic Publishers, 1996, pp.5-40.

J. van Zyl, I.Cloete, FuzzConRi – A Fuzzy Conjunctive Rule Inducer, in Proc. Workshop on Advances in Inductive Rule Learning, ECML, 2004, pp.194-203.

J. van Zyl, I.Cloete, Simultaneous Concept Learning of Fuzzy Rules, in Proc. Workshop on Advances in Inductive Rule Learning, CCML, 2004, pp.194-203.

P. Clark. The CN2 Induction Algorithm / Clark P. and Niblett T. // Machine Learning. Vol. 3, 1989, pp. 261-283.

J. Hong. AQ15: Incremental Learning of Attribute-Based Descriptions from Examples the Method and User Guide. Report of the Intelligent Systems Group, UIUCDCS-F-86-949 Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, 1986.

J. Zyl. Fuzzy set covering as a new paradigm for the induction of fuzzy classification rules. – Mannheim: PhD thesis, 2007. p 263.

A. Frank, A. Asuncion, UCI Machine Learning Repository Irvine, CA: University of California, School of Information and Computer Science. 2010. [Online] Available: http://archive.ics.uci.edu/ml]. [Accessed: June 3, 2012].

M. Gasparoviča M., L. Aleksejeva. Feature Selection for Bioinformatics Data Sets – Is It Recommended? // Proceedings of the 5th International Conference on Applied Information and Communication Technologies (AICT2012), Latvia, Jelgava, 26.-27. April, 2012. - pp 325-335.

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I.H.Witten.: The WEKA Data Mining Software: An Update. SIGKDD Explorations. 11:1, 2009, pp. 10-18.

J. Alcalá-Fdez,, A. Fernandez, J. Luengo, J. Derrac, S. García, L. Sánchez,, F. Herrera: KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework. Journal of Multiple-Valued Logic and Soft Computing. 17:2-3, 2011, pp.55-287.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2012 Madara Gasparoviсa, Ludmila Aleksejeva, Valdis Gersons

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.