Nâng cao hiệu năng tính toán cho thuật toán phân cụm FCM

Improving the computing performance for FCM clustering algorithm

Tác giả:

Tóm tắt:

Trong xu hướng phát triển của ngành công nghiệp 4.0, xử lí dữ liệu lớn là vấn đề nhận được nhiều quan tâm của các nhà khoa học. Do khối lượng tính toán rất lớn và phức tạp, việc xử lí dữ liệu lớn thường được thực hiện song song trên các bộ xử lý đa lõi, đa luồng. Tuy nhiên, các thuật toán song song cũng cần phải được áp dụng và cải tiến nhằm phát huy sức mạnh của các bộ xử lý này. Với bộ xử lý đồ họa GPU (Graphics Processing Units), thuật toán phân cụm mờ FCM (Fuzzy C-Means [1]) đã được triển khai một cách hiệu quả, tuy rằng chưa giải quyết được vấn đề song song không đầy đủ NFPP (Not-Fully Parallelized Problem) tại bước tính toán các Trọng tâm cụm (Cluster Centers). Trong bài báo này, các tác giả đề xuất một giải pháp tăng hiệu năng tính toán của thuật toán FCM bằng việc rút gọn dữ liệu ở những bước song song không đầy đủ với mô hình kiến trúc thiết bị tính toán hợp nhất CUDA (Compute Unified Device Architecture [2]).

Từ khóa:

FCM, song song không đầy đủ, GPU, phân cụm dữ liệu.

Abstract:

In the current trend of 4.0 industry, big data has received considerable attention from scientists. Due to the extremely large and complex data computational volume, big data tasks are typically performed in parallel using the power of massive multi-threading and mutil-core processors. However, the parallel algorithms also need to be applied and improved to harness the power of these processors. With Graphics Processing Units (GPUs), the Fuzzy C-Means (FCM [1]) cluster algorithm has implemented effectively, but not solved the not-fully parallelized problem (NFPP) at the calculating new cluster centers step. In this paper, the authors propose a method to improve the computing performance of the FCM algorithm by reducting data in the not-fully parallelized steps with Compute Unified Device Architecture Model (CUDA [2]).

Keyword:

FCM, not-fully parallelized, GPU, data clustering.

Số TC:

SỐ 54 (04/2018)

Số TT:

Trang:

Main menu

Nâng cao hiệu năng tính toán cho thuật toán phân cụm FCM

Tiêu điểm

Sự kiện

Tạp chí KHCN Hàng hải

Liên kết