Comparative Analysis for Alignment Based Document Clustering

Veeraman, T.; Nedunchelian, R.
June 2014
Australian Journal of Basic & Applied Sciences;Jun2014, Vol. 8 Issue 9, p22
Academic Journal
Background: Document Clustering is a technique that organizes a large quantity of unordered text Document into small number of meaning full and coherent cluster. Clustering approach facilitates the presentation of search result in more compact form and enables thematic browsing result set. Objective: The main problem of existing web search result based on poor vector representation of snippets. The Data units returned from the underlying database are normally encoded into the result page dynamically for human browsing which essential for many application such as internet comparison, shopping, and also be extracted out and assigned meaningful labels. Result: We present a clustering approach such K-Means, Weighted K-Means and Enhanced K-Means Algorithm. This method is capable of handling a variety of clustering approach based on Alignment Algorithm. Conclusion: Our Experimental result shows that the precision and result are achieved to improve the performance of clustering system is highly effective.


Related Articles

  • Performance Analysis of Clustering using Partitioning and Hierarchical Clustering Techniques. Punitha, S. C.; Thangaiah, P. Ranjith Jeba; Punithavalli, M. // International Journal of Database Theory & Application;Dec2014, Vol. 7 Issue 6, p233 

    Text clustering is the method of combining text or documents which are similar and dissimilar to one another. In several text tasks, this text mining is used such as extraction of information and concept/entity, summarization of documents, modeling of relation with entity,...

  • An integration of fuzzy association rules and WordNet for document clustering. Chen, Chun-Ling; Tseng, Frank S. C.; Liang, Tyne // Knowledge & Information Systems;Sep2011, Vol. 28 Issue 3, p687 

    With the rapid growth of text documents, document clustering technique is emerging for efficient document retrieval and better document browsing. Recently, some methods had been proposed to resolve the problems of high dimensionality, scalability, accuracy, and meaningful cluster labels by using...

  • Profiling Web Users Preferences with Text Mining. Bonifácio Costa, Pedro; Oliveira, Sancho; Nunes, Luís // CISTI (Iberian Conference on Information Systems & Technologies ;2013, Vol. 2, p73 

    In this paper we propose a new approach to clustering Web-based content that could be leveraged by users' preferences. These preferences may imply grouping or dividing the initial clusters so that the resulting clusters represent users' profiles. This approach could be applied to recommend...

  • Determining the Number and Sites for New Substations in Terms of Perspective Development of Urban Power Distribution Networks. Karpenko, A. P.; Kuzmina, I. A. // Science & Education of Bauman MSTU / Nauka i Obrazovanie of Baum;dec2014, Issue 12, p798 

    This article continues a cycle of A. P Karpenko's and I. A Kuzmina's papers regarding the problem of perspective development of distribution power supply network (the magazine "Science and education" No. 05, May 2014 [http://technomag.bmstu.ru/doc/709781.html], No. 10, October 2014...

  • An Efficient Text Clustering Approach using Biased Affinity Propagation. Sharma, Isha; Motwani, Mahak // International Journal of Computer Applications;Jun2014, Vol. 96, p1 

    Based on an effective clustering algorithm Seeds affinity propagation- in this paper an efficient clustering approach is presented which uses one dimension for the group of the words representing the similar area of interest with that we have also considered the uneven weighting of each...

  • Web Mining: Opinion and Feedback Analysis for Educational Institutions. Prakash Verma, Jai; Patel, Bankim; Patel, Atul // International Journal of Computer Applications;Dec2013, Vol. 84, p17 

    Big amount of data available in the forms of reviews, opinions, feedbacks, remarks, comments, observations, clarifications, and explanations that require a robust mechanism to store, retrieve, analyze, and management. In this paper, we are proposing system that provides review or summary of...

  • Text Mining Research Based on Intelligent Computing in Information Retrieval System. Yong Li // Telkomnika;Dec2015, Vol. 13 Issue 4, p1384 

    With the popularity and rapid development of the Internet, web text information has rapidly grown as well. To address the key problem of text mining, text clustering is investigated in this study. The shuffled frog leaping algorithm as a new type of swarm intelligence optimization algorithm can...

  • A Robust k-Means Type Algorithm for Soft Subspace Clustering and Its Application to Text Clustering. Tiantian Yang; Jun Wang // Journal of Software (1796217X);Aug2014, Vol. 9 Issue 8, p2120 

    Soft subspace clustering are effective clustering techniques for high dimensional datasets. Although several soft subspace clustering algorithms have been developed in recently years, its robustness should be further improved. In this work, a novel soft subspace clustering algorithm RSSKM are...

  • BIBLIOMETRIC METHODS FOR DETECTING AND ANALYSING EMERGING RESEARCH TOPICS. Glänzel, Wolfgang // El Profesional de la Información;mar/abr2012, Vol. 21 Issue 2, p194 

    This study gives an overview of the process of clustering scientific disciplines using hybrid methods, detecting and labelling emerging topics and analysing the results using bibliometrics methods. The hybrid clustering techniques are based on biblographic coupling and text-mining and 'core...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics