TITLE

A WEBPAGE CLASSIFICATION ALGORITHM CONCERNING WEBPAGE DESIGN CHARACTERISTICS

AUTHOR(S)
Shih-Ting Yang
PUB. DATE
March 2012
SOURCE
International Journal of Electronic Business Management;2012, Vol. 10 Issue 1, p73
SOURCE TYPE
Academic Journal
DOC. TYPE
Article
ABSTRACT
Owing to the booming growth of Internet technology, the number of web documents has significantly increased over the Internet. If the webpage can be effectively managed, the knowledge demanders (i.e., Internet users) can efficiently absorb and use the knowledge documents; it has become the core topic in this information explosion era. Webpage classification technology with high accuracy can improve the efficiency for Internet users to search required knowledge and to save lots of knowledge-searching time. Differing from previous researches, this paper explores webpage design characteristics for webpage classification. That is, concerning complexity of webpage structure, this paper analyzes the webpage design characteristics including tag attributes and tag-region layout to develop an algorithm for webpage classification. Therefore, based on webpage design characteristic analysis, the text contained in specific tag-regions can be identified. Also, the keywords extracted from each tag-region are weighted according tag attributes and tag-region locations; then, the categories of the target webpage can be determined. Furthermore, based on the hyperlink tag, the similar webpage with higher correlations can be collected to re-determine target webpage categories. In addition to the webpage classification algorithm, a web-based webpage classification system is developed to demonstrate feasibility of the proposed model. The attempt of this research is to analyze and use the characteristics of webpage design for webpage classification technology to improve the effectiveness of classification.
ACCESSION #
73344407

Tags: INTERNET;  WEBSITES;  INTERNET users;  INTERNET searching;  HYPERLINKS;  TAGS (Metadata)

 

Related Articles

  • Increase Visibility via Social Bookmarking.  // PR News;4/25/2011, Vol. 67 Issue 17, p2 

    The article discusses the use of social bookmarking to organize online experience and increase efficiency. Social tagging enables an Internet user to bookmark a Web page and file it under a selected tag for future reference. According to the article, the most popular bookmarking Web sites...

  • SOCIAL SEARCH WITH SHARED KEYWORDS. Ueno, Taiki; Yasumura, Michiaki // Proceedings of the IADIS International Conference on WWW/Interne;Jan2009, p440 

    Current web search is typically an individual task where users need to have good domain knowledge and search literacy in order to get useful information easily and efficiently. Similar to how tags are shared in social tagging, in social search keywords of web users with similar interests are...

  • SOCIAL SEARCH WITH SHARED KEYWORDS. Ueno, Taiki; Yasumura, Michiaki // Proceedings of the IADIS International Conference on WWW/Interne;Nov2009, p440 

    Current web search is typically an individual task where users need to have good domain knowledge and search literacy in order to get useful information easily and efficiently. Similar to how tags are shared in social tagging, in social search keywords of web users with similar interests are...

  • Hashing Out Your Twitter Space. Roberts, Tanya // Bar Bulletin of the Maryland State Bar Association;Mar2012, Vol. 29 Issue 3, p15 

    The article provides the fundamentals of using hashtags in Twitter website. Twitter is reportedly a micro-blogging site where users can post quick status updates with 140 characters or less including hyperlinks, and links to photos. represented by a # symbol followed by a keyword, it is stated...

  • Incorporating web browsing activities into anchor texts for web search. Bo Zhou; Yiqun Liu; Min Zhang; Yijiang Jin; Shaoping Ma // Information Retrieval;Jun2011, Vol. 14 Issue 3, p290 

    nchor texts complement Web page content and have been used extensively in commercial Web search engines. Existing methods for anchor text weighting rely on the hyperlink information which is created by page content editors. Since anchor texts are created to help user browse the Web, browsing...

  • Looking Beyond Search. Scott, David M. // EContent;May2005, Vol. 28 Issue 5, p48 

    The article explores the browsing features of Internet services. One of the easiest ways to help Internet users find additional information is to create hyperlinks within a search result that point directly to a list of articles whose cited reference lists include at least one of the sources...

  • The accelerating growth of online tagging systems. Wu, L. // European Physical Journal B -- Condensed Matter;Sep2011, Vol. 83 Issue 2, p283 

    Research on the growth of online tagging systems not only is interesting in its own right, but also yields insights for website management and semantic web analysis. Traditional models that describing the growth of online systems can be divided between linear and nonlinear versions. Linear...

  • SELLING THE BIG PICTURE. Carr, Austin // Fast Company;Sep2011, Issue 158, p47 

    The article offers information on a form of web advertising, called photo tagging, by U.S. startup Pixazza that allows web publishers to turn their images into links where users can buy whatever is pictured.

  • Content-based and collaborative techniques for tag recommendation: an empirical evaluation. Lops, Pasquale; de Gemmis, Marco; Semeraro, Giovanni; Musto, Cataldo; Narducci, Fedelucio // Journal of Intelligent Information Systems;Feb2013, Vol. 40 Issue 1, p41 

    The rapid growth of the so-called Web 2.0 has changed the surfers' behavior. A new democratic vision emerged, in which users can actively contribute to the evolution of the Web by producing new content or enriching the existing one with user generated metadata. In this context the use of tags,...

Share

Read the Article

Courtesy of VIRGINIA BEACH PUBLIC LIBRARY AND SYSTEM

Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics