Item Infomation


Title: An ensemble model approach for many-feature data clustering
Authors: Lê, Thị Cẩm Bình
Ngô, Thành Long
Phạm, Văn Nha
Phạm, Thế Long
Issue Date: 2021
Abstract: The ensemble is a popular machine learning technique based on the principle of divide and conquer. In data clustering, the ensemble aims to improve performance in terms of processing speed and clustering quality. Most existing ensemble methods face inherent complex challenges such as uncertainty, ambiguity, and overlap. Fuzzy clustering has recently been developed to handle data with many-feature, heterogeneity, uncertainty, and big data. In this paper, we propose an ensemble feature- reduction clustering model (EFRC) using advanced machine learning techniques. The EFRC model consists of three phases. First, the data is feature-reduced by a random projection. Then, the data is divided into subsets based on the likelihood of overlap and quantification of noise. Various clustering techniques are used to cluster the subset of data. Finally, the results of the clustering modules are consensus using the classification technique to produce the final clustering result. Several tests were performed on the benchmark datasets. The test results show the superior performance of the EFRC model compared to the previous models.
URI: http://huc.dspace.vn/handle/DHVH/15445
Appears in Collections:LĨNH VỰC THÔNG TIN - THƯ VIỆN
ABSTRACTS VIEWS

45

VIEWS & DOWNLOAD

12

Files in This Item:


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.