Advances in Smart Systems Research

Publisher Future Technology Publications
Vol. 6 No. 2 CIMA 2017 Workshop Papers from the KES2017 Conference
Journal ISSN 2050-8662
 
Article TitleStatistical Analysis and Optimization of Classification Methods of Big Data in Medicine
Primary AuthorGeorge Fourfouris, Computer Engineering and Informatics
Other Author(s) George Economou; Spiridon Likothanasis
Pages 41 - 54
Article ID k17is-223
Publication Date 05-Nov-17
Abstract

The process of big data classification has been extensively explored for the past decades, being it essential for Machine Learning. Likewise, performing statistical analysis and investigating the optimization factors of applied algorithms, while evaluating datasets based on recent experience, are important as well in order to specify the best case scenarios parameters that enhance their accuracy. In this paper search on and implementation of classification algorithms, such as the state-of-the-art k-Nearest Neighbors (k-NN), the alternative k-Nearest Neighbors on Feature Projections (k-NNFP) algorithm and Vote Feature Intervals (VFI), over big medical datasets from UCI Machine Learning Repository, is presented. These algorithms, moreover, are statistically analyzed on a standardized Arrhythmia Dataset [1] in order to extract the scenarios that enhance the precision execution for multiple k-fold cross validations and the number of nearest neighbors. Additionally, feature selection and extraction methods are implemented in order to exploit and establish best-case scenarios.

 View Paper
Remarks Papers presented at 7th International Workshop on Combinations of Intelligent Methods and Applications (CIMA) as part of 21st International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, 6-8 September 2017, Marseille, France