EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method
dc.contributor.author | Hacibeyoglu, Mehmet | |
dc.contributor.author | Ibrahim, Mohammed H. | |
dc.date.accessioned | 2024-02-23T14:00:03Z | |
dc.date.available | 2024-02-23T14:00:03Z | |
dc.date.issued | 2018 | |
dc.department | NEÜ | en_US |
dc.description.abstract | Discretization is an important data preprocessing technique used in data mining and knowledge discovery processes. The purpose of discretization is to transform or partition continuous values into discrete ones. In this manner, many data mining classification algorithms can be applied the discrete data more concisely and meaningfully than continuous ones, resulting in better performance. In this study, an improved version of the unsupervised equal frequency (EF) discretization method, EF_Unique, is proposed for enhancing the performance of discretization. The proposed EF_Unique discretization method is based on the unique values of the attribute to be discretized. In order to test the success of the proposed method, 17 benchmark datasets from the UCI repository and four data mining classification algorithms were used, namely Naive Bayes, C.45, k-nearest neighbor, and support vector machine. The experimental results of the proposed EF_Unique discretization method were compared with those obtained using well-known discretization methods; unsupervised equal width (EW), EF, and supervised entropy-based ID3 (EB-ID3). The results show that the proposed EF_Unique discretization method outperformed EW, EF, and EB-ID3 discretization methods in 43, 41, and 27 out of the 68 benchmark tests, respectively. | en_US |
dc.identifier.doi | 10.1007/s13369-018-3144-z | |
dc.identifier.endpage | 7704 | en_US |
dc.identifier.issn | 2193-567X | |
dc.identifier.issn | 2191-4281 | |
dc.identifier.issue | 12 | en_US |
dc.identifier.scopus | 2-s2.0-85056251979 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.startpage | 7695 | en_US |
dc.identifier.uri | https://doi.org/10.1007/s13369-018-3144-z | |
dc.identifier.uri | https://hdl.handle.net/20.500.12452/11436 | |
dc.identifier.volume | 43 | en_US |
dc.identifier.wos | WOS:000449936300064 | en_US |
dc.identifier.wosquality | Q3 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer Heidelberg | en_US |
dc.relation.ispartof | Arabian Journal For Science And Engineering | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Classification Algorithms | en_US |
dc.subject | Data Mining | en_US |
dc.subject | Supervised Discretization | en_US |
dc.subject | Unsupervised Discretization | en_US |
dc.title | EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method | en_US |
dc.type | Article | en_US |