DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

Semantic analysis based text clustering by the fusion of bisecting k-means and UPGMA algorithm


Article Information

Title: Semantic analysis based text clustering by the fusion of bisecting k-means and UPGMA algorithm

Authors: G. Loshma, Nagaratna P. Hedge

Journal: ARPN Journal of Engineering and Applied Sciences

HEC Recognition History
Category From To
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30
Y 2021-07-01 2022-06-30
X 2020-07-01 2021-06-30

Publisher: Khyber Medical College, Peshawar

Country: Pakistan

Year: 2016

Volume: 11

Issue: 3

Language: English

Categories

Abstract

Owing to the fastest data growth, this era can be claimed as the era of zettabytes. An effective mechanism is the need of this hour, to manage all the available data efficiently. Clustering is a technique to group relevant documents together. This work takes the semantics into account and clusters the document with the hybrid of bisecting k-means and UPGMA algorithm. The semantic analysis is made possible by the inclusion of Wordnet, which is a lexical database. The outcome of this algorithm is more accurate, as the clusters are meaningful. The performance of the proposed algorithm is evaluated with respect to precision, recall, F-measure, accuracy and misclassification rate. The experimental results of the proposed work are satisfactory.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...