ODBOT: Outlier detection-based oversampling technique for imbalanced datasets learning
dc.contributor.author | Ibrahim, Mohammed H. | |
dc.date.accessioned | 2024-02-23T13:55:51Z | |
dc.date.available | 2024-02-23T13:55:51Z | |
dc.date.issued | 2021 | |
dc.department | NEÜ | en_US |
dc.description.abstract | In many real-world problems, the datasets are imbalanced when the samples of majority classes are much greater than the samples of minority classes. In general, machine learning and data mining classification algorithms perform poorly on imbalanced datasets. In recent years, various oversampling techniques have been developed in the literature to solve the class imbalance problem. Unfortunately, few of the oversampling techniques can be spread to tackle the relationship between the classes and use the correlation between attributes. Moreover, in most cases, the existing oversampling techniques do not handle multi-class imbalanced datasets. To this end, in this paper, a simple but effective outlier detection-based oversampling technique (ODBOT) is proposed to handle the multi-class imbalance problem. In the proposed ODBOT, the outlier samples are detected by clustering within the minority class(es), and then, the synthetic samples are generated by consideration of these outlier samples. The proposed ODBOT generates very efficient and consistent synthetic samples for the minority class(es) by analyzing well the dissimilarity relationships among attribute values of all classes. Moreover, ODBOT can reduce the risk of the overlapping problem among different class regions and can build a better classification model. The performance of the proposed ODBOT is evaluated with extensive experiments using commonly used 60 imbalanced datasets and five classification algorithms. The experimental results show that the proposed ODBOT oversampling technique consistently outperformed the other common and state-of-the-art techniques in terms of various evaluation criteria. | en_US |
dc.identifier.doi | 10.1007/s00521-021-06198-x | |
dc.identifier.endpage | 15806 | en_US |
dc.identifier.issn | 0941-0643 | |
dc.identifier.issn | 1433-3058 | |
dc.identifier.issue | 22 | en_US |
dc.identifier.scopus | 2-s2.0-85108334591 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.startpage | 15781 | en_US |
dc.identifier.uri | https://doi.org/10.1007/s00521-021-06198-x | |
dc.identifier.uri | https://hdl.handle.net/20.500.12452/10984 | |
dc.identifier.volume | 33 | en_US |
dc.identifier.wos | WOS:000664024000005 | en_US |
dc.identifier.wosquality | Q2 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer London Ltd | en_US |
dc.relation.ispartof | Neural Computing & Applications | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Class Imbalance Dataset | en_US |
dc.subject | Data Preprocessing | en_US |
dc.subject | Outlier Detection | en_US |
dc.subject | Oversampling | en_US |
dc.title | ODBOT: Outlier detection-based oversampling technique for imbalanced datasets learning | en_US |
dc.type | Article | en_US |