ODBOT: Outlier detection-based oversampling technique for imbalanced datasets learning

dc.contributor.authorIbrahim, Mohammed H.
dc.date.accessioned2024-02-23T13:55:51Z
dc.date.available2024-02-23T13:55:51Z
dc.date.issued2021
dc.departmentNEÜen_US
dc.description.abstractIn many real-world problems, the datasets are imbalanced when the samples of majority classes are much greater than the samples of minority classes. In general, machine learning and data mining classification algorithms perform poorly on imbalanced datasets. In recent years, various oversampling techniques have been developed in the literature to solve the class imbalance problem. Unfortunately, few of the oversampling techniques can be spread to tackle the relationship between the classes and use the correlation between attributes. Moreover, in most cases, the existing oversampling techniques do not handle multi-class imbalanced datasets. To this end, in this paper, a simple but effective outlier detection-based oversampling technique (ODBOT) is proposed to handle the multi-class imbalance problem. In the proposed ODBOT, the outlier samples are detected by clustering within the minority class(es), and then, the synthetic samples are generated by consideration of these outlier samples. The proposed ODBOT generates very efficient and consistent synthetic samples for the minority class(es) by analyzing well the dissimilarity relationships among attribute values of all classes. Moreover, ODBOT can reduce the risk of the overlapping problem among different class regions and can build a better classification model. The performance of the proposed ODBOT is evaluated with extensive experiments using commonly used 60 imbalanced datasets and five classification algorithms. The experimental results show that the proposed ODBOT oversampling technique consistently outperformed the other common and state-of-the-art techniques in terms of various evaluation criteria.en_US
dc.identifier.doi10.1007/s00521-021-06198-x
dc.identifier.endpage15806en_US
dc.identifier.issn0941-0643
dc.identifier.issn1433-3058
dc.identifier.issue22en_US
dc.identifier.scopus2-s2.0-85108334591en_US
dc.identifier.scopusqualityQ1en_US
dc.identifier.startpage15781en_US
dc.identifier.urihttps://doi.org/10.1007/s00521-021-06198-x
dc.identifier.urihttps://hdl.handle.net/20.500.12452/10984
dc.identifier.volume33en_US
dc.identifier.wosWOS:000664024000005en_US
dc.identifier.wosqualityQ2en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherSpringer London Ltden_US
dc.relation.ispartofNeural Computing & Applicationsen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectClass Imbalance Dataseten_US
dc.subjectData Preprocessingen_US
dc.subjectOutlier Detectionen_US
dc.subjectOversamplingen_US
dc.titleODBOT: Outlier detection-based oversampling technique for imbalanced datasets learningen_US
dc.typeArticleen_US

Dosyalar