CAViz, an Interactive Graphical Tool for Image Mining

Pham, Nguyen-Khang; Morin, Annie; Gros, Patrick
December 2008
Journal of Computing & Information Technology;Dec2008, Vol. 16 Issue 4, p295
Academic Journal
We propose an interactive graphical tool, CAViz, which allows us to display and to extract knowledge from the results of a Correspondence Analysis CA on images. CA is a descriptive technique designed to analyze simple two-way and multi-way tables containing some measure of correspondence between the rows and columns. CA is very often used in Textual Data Analysis (TDA) where the contingency table crosses words and documents. In image mining, the first step is to define "visual" words in images (similar to words in texts). These words are constructed from local descriptors (SIFT, Scale Invariant Feature Transform) in images. Our tool CAViz is interactive, and it helps the user interpretating the results and the graphs of CA. An application to the Caltech4 base illustrates the interest of CAViz in image mining.


Related Articles

  • A Primer for Understanding and Applying Data Mining. Thuraisingham, Bhavani // IT Professional;Jan/Feb2000, Vol. 2 Issue 1, p28 

    Focuses on the application of data mining to business systems. Definition of data mining; Uses of data mining in promotions, marketing and sales; Technologies integrated into data mining; Planning a data mining project. INSET: Data Mining Outcomes and Techniques..

  • Approximate Nearest Neighbour Using Data Mining. Verma, Deepika; Kakkar, Namita // International Journal of Advanced Research in Computer Science;Jul/Aug2013, Vol. 4 Issue 9, p178 

    Data mining may be viewed as the extraction of the hidden predictive information from large databases, is a powerful new technology with great potential to analyze important information in the data warehouse. Nearest neighbor search (NNS), also known as proximity search, similarity search or...

  • Has Large-Scale Named-Entity Network Analysis Been Resting on a Flawed Assumption? Fegley, Brent D.; Torvik, Vetle I. // PLoS ONE;Jul2013, Vol. 8 Issue 7, p1 

    The assumption that a name uniquely identifies an entity introduces two types of errors: splitting treats one entity as two or more (because of name variants); lumping treats multiple entities as if they were one (because of shared names). Here we investigate the extent to which splitting and...

  • Estimating Illumination Distribution to Generate Realistic Shadows in Augmented Reality. Changkyoung Eem; Iksu Kim; Yeongseok Jung; Hyunki Hong // KSII Transactions on Internet & Information Systems;Jun2015, Vol. 9 Issue 6, p2289 

    Mobile devices are becoming powerful enough to realize augmented reality (AR) application. This paper introduces two AR methods to estimate an environmental illumination distribution of a scene. In the first method, we extract the lighting direction and intensity from input images captured with...

  • Finding Meaning in the Measures. Richard, Ron F.; Nunley, Nancy; Ball, Nigel // Sleep Review;Jul/Aug2007, Vol. 8 Issue 5, p40 

    The article looks at the points to consider when mining data to improve compliance and patient outcomes in sleep centers. The authors assert that data management has always been important in the field of sleep medicine and plays a vital role in the initial diagnosis and treatment of patients....

  • Novelty as a Measure of Interestingness in Knowledge Discovery. Bhatnagar, Vasudha; Al-Hegami, Ahmed Sultan; Kumar, Naveen // International Journal of Information Technology;2006, Vol. 2 Issue 1, p36 

    Rule Discovery is an important technique for mining knowledge from large databases. Use of objective measures for discovering interesting rules leads to another data mining problem, although of reduced complexity. Data mining researchers have studied subjective measures of interestingness to...

  • An Empirical Study of the Applications of Web Mining Techniques in Health Care. Kumar, Varun; Ahmed, MD. Ezaz // International Journal of Advanced Computer Science & Application;Oct2011, Vol. 2 Issue 10, p91 

    Few years ago, the information flow in health care field was relatively simple and the application of technology was limited. However, as we progress into a more integrated world where technology has become an integral part of the business processes, the process of transfer of information has...

  • A parallel clustering method combined information bottleneck theory and centroid-based clustering. Sun, Zhanquan; Fox, Geoffrey; Gu, Weidong; Li, Zhao // Journal of Supercomputing;Jul2014, Vol. 69 Issue 1, p452 

    Clustering is an important research topic of data mining. Information bottleneck theory-based clustering method is suitable for dealing with complicated clustering problems because that its information loss metric can measure arbitrary statistical relationships between samples. It has been...

  • A Tools-Based Approach to Teaching Data Mining Methods. Jafar, Musa J. // Journal of Information Technology Education;2010, Vol. 9, pIIP1 

    Data mining is an emerging field of study in Information Systems programs. Although the course content has been streamlined, the underlying technology is still in a state of flux. The purpose of this paper is to describe how we utilized Microsoft Excel's data mining add-ins as a front-end to...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics