In Entropy (Basel, Switzerland)
Data stream mining techniques have recently received increasing research interest, especially in medical data classification. An unbalanced representation of the classification's targets in these data is a common challenge because classification techniques are biased toward the major class. Many methods have attempted to address this problem but have been exaggeratedly biased toward the minor class. In this work, we propose a method for balancing the presence of the minor class within the current window of the data stream while preserving the data's original majority as much as possible. The proposed method utilized similarity analysis for selecting specific instances from the previous window. This group of minor-class was then added to the current window's instances. Implementing the proposed method using the Siena dataset showed promising results compared to the Skew ensemble method and some other research methods.
Fatlawi Hayder K, Kiss Attila
2022-Nov-11
EEG, imbalanced data, machine learning, similarity analysis