Search results

Items from 1 to 13 out of 13 results

chapter

Classification algorithms for relation prediction

C Boden, T Hafele, A Loser

2011 IEEE 27th International Conference on Data Engineering Workshops > 46 - 52

2011 IEEE International Conference on Data Engineering Workshops (ICDEW 2011)

Knowledge discovery from the Web is a cyclic process. In this paper we focus on the important part of transforming unstructured information from Web pages into structured relations. Relation extraction systems capture information from natural language text on Web pages, called Web text. However, extraction is quite costly and time consuming. Worse, many Web pages may not contain a textual representation...

chapter

Identification of malicious web pages for crawling based on network-related attributes of web server

G Hattori, K Matsumoto, C Ono, Y Takishima

2010 4th International Universal Communication Symposium > 355 - 361

2010 4th International Universal Communication Symposium (IUCS 2010)

In this paper, we propose an identification algorithm of malicious Web pages for crawlers, which collect Web pages for the later task to detect malicious Web pages based on the content. Recently, some organization would have to automatically crawl the Web pages with the crawlers for later checking by humans. However, since manually checking Web pages is an expensive task, the total cost would be enormous...

chapter

Dynamically Constructing a Global Schema for Web Entities

Xiuxing Xu, Qingzhong Li, Yongquan Dong, Yanhui Ding

2010 Seventh Web Information Systems and Applications Conference > 127 - 131

2010 7th Web Information Systems and Applications Conference (WISA 2010). Workshop on Semantic Web and Ontology (SWON2010). Workshop on Electronic Government Technology and Application (EGTA 2010)

With the rapid development of the Internet, popular entities have more and more instances on the Web. It is observed that, on one hand, for the same Web entity, different Web entity instances often contain different attributes, and for the same attribute, different Web entity instances often use different labels; on the other, new Web entity instances which contain new attributes and labels are appearing...

chapter

The study in ranking method for web entity

Peng Li

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1587 - 1591

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Along with the rapidly development of the information retrieval and web technology, web entity retrieval has become a new popular way for getting specific information, such as looking for a book or a movie. Like document retrieval, generally there are too many results returned for a query, so ranking is still a necessary step during the entity retrieval process. This paper will focus on the ranking...

chapter

A Web page classification algorithm and its application in E-government system

Boyi Xu, Jing Wang, Hongming Cai

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1767 - 1771

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

With the widespread of Internet application, more and more enterprises build their Web sites and provide business information through Web pages. Web page classification could be used to assign the enterprise Web pages to one or more predefined business categories. On the purpose of Internet-based enterprises administration in E-government system, algorithms and application related to web page classification...

chapter

Enhancing Web Page Classification via Local Co-training

Youtian Du, Xiaohong Guan, Zhongmin Cai

2010 20th International Conference on Pattern Recognition > 2905 - 2908

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper we propose a new multi-view semi-supervised learning algorithm called Local Co-Training(LCT). The proposed algorithm employs a set of local models with vector outputs to model the relations among examples in a local region on each view, and iteratively refines the dominant local models (i.e. the local models related to the unlabeled examples chosen for enriching the training set) using...

chapter

An n-Gram Based Approach to Multi-Labeled Web Page Genre Classification

J.E. Mason, M. Shepherd, J. Duffy, V. Keselj, more

2010 43rd Hawaii International Conference on System Sciences > 1 - 10

2010 43rd Hawaii International Conference on System Sciences (HICSS-43)

The extraordinary growth in both the size and popularity of the World Wide Web has created a growing interest not only in identifying Web page genres, but also in using these genres to classify Web pages. The hypothesis of this research is that an n-gram representation of a Web page can be used effectively to automatically classify that Web page by genre, even when the Web page belongs to more than...

chapter

Topic Distributions over Links on Web

Jie Tang, Jing Zhang, J.X. Yu, Zi Yang, more

2009 Ninth IEEE International Conference on Data Mining > 1010 - 1015

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

It is well known that Web users create links with different intentions. However, a key question, which is not well studied, is how to categorize the links and how to quantify the strength of the influence of a Web page on another if there is a link between the two linked Web pages. In this paper, we focus on the problem of link semantics analysis, and propose a novel supervised learning approach to...

chapter

Random forest classifier for multi-category classification of web pages

Win Thanda Aung, Khin Hay Mar Saw Hla

2009 IEEE Asia-Pacific Services Computing Conference (APSCC) > 372 - 376

2009 IEEE Asia-Pacific Services Computing Conference (APSCC 2009)

Web page classification is the automated assigning of predefined subject category to the document. Automatic Web page classification is one of the most essential techniques for Web mining given that the Web is a huge repository of various information including images, videos etc. And there is a need for categorization Web pages to satisfy user needs. The classification of Web pages into each category...

chapter

Hierarchical Classification of Business Information on the Web Using Incremental Learning

Yi Wang, Zhiguo Gong, Jingzhi Guo

2009 IEEE International Conference on e-Business Engineering > 303 - 309

2009 IEEE International Conference on e-Business Engineering. ICEBE 2009

The explosive Web make it hard to organize and manage Web information automatically. Therefore, online learning method such as incremental learning is gradually become effective instrument in practical applications. From our experiments, traditional incremental learning shows some flaws in the iterative process. To overcome the drawback caused by using only support vector to represent the whole former...

chapter

A novel Voting Algorithm of multi-class SVM for web page classification

P. Thamrongrat, L. Preechaveerakul, W. Wettayaprasit

2009 2nd IEEE International Conference on Computer Science and Information Technology > 327 - 331

2009 2nd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2009)

The increasing numbers of Web pages on the cyber world result to the less effectiveness of document retrieval that matches the need of users. The classification of Web pages is one of the solutions to solve this problem. This paper proposes VAMSVM_WPC model which is a novel voting algorithm for classifying the Web pages, which uses a multi-class SVM method. First, feature is generated from text and...

chapter

CUCS: A Web Page Classification Algorithm for Large Training Set

Jing Wang, Hongming Cai, Boyi Xu, Lihong Jiang

2008 IFIP International Conference on Network and Parallel Computing > 440 - 445

2008 IFIP International Conference on Network and Parallel Computing

This paper presents a new algorithm of Web page classification, CUCS(Combined UC and SVM), for large training set. CUCS combines the advantages of SVM (Support Vector Machine) and UC (Unsupervised Clustering), achieving high precision and fast speed. In the training stage, CUCS gets clustering centers, which include positive example centers and negative ones, by means of UC. Then CUCS prunes training...

chapter

Object image retrieval by exploiting online knowledge resources

Gang Wang, D. Forsyth

2008 IEEE Conference on Computer Vision and Pattern Recognition > 1 - 8

2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We describe a method to retrieve images found on Web pages with specified object class labels, using an analysis of text around the image and of image appearance. Our method determines whether an object is both described in text and appears in a image using a discriminative image model and a generative text model. Our models are learnt by exploiting established online knowledge resources (Wikipedia...

Filter options

Data set:
ieee
Keywords:
SUPPORT VECTOR MACHINES
INTERNET
WEB PAGES
TRAINING

Publication date

Set your own date range

Content availability

Available (12)
None (1)

Keywords

CLASSIFICATION ALGORITHMS (8)
PATTERN CLASSIFICATION (6)
DATA MINING (4)
INFORMATION RETRIEVAL (4)
SUPPORT VECTOR MACHINE (4)
WEB PAGE CLASSIFICATION (4)
WEB SITES (4)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
ALGORITHM DESIGN AND ANALYSIS (2)
CLASSIFICATION (2)
CLUSTERING ALGORITHM (2)
CLUSTERING ALGORITHMS (2)
DOCUMENT RETRIEVAL (2)
HTML (2)
MACHINE LEARNING (2)
PATTERN CLUSTERING (2)
RADIO FREQUENCY (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
SVM (2)
TEXT ANALYSIS (2)
UNSUPERVISED LEARNING (2)
WEB MINING (2)
WEB PAGE CLASSIFICATION ALGORITHM (2)
AUTOMATIC WEB PAGE CLASSIFICATION (1)
AUTOMATICALLY CHECKING SYSTEM (1)
BUSINESS CATEGORY (1)
BUSINESS DATA PROCESSING (1)
BUSINESS INFORMATION CLASSIFICATION (1)
CHEMISTRY (1)
CLASSIFICATION TREE ANALYSIS (1)
COMPANIES (1)
COMPONENT (1)
COMPUTATIONAL MODELING (1)
CONSTRUCTION INDUSTRY (1)
CONTEXT (1)
CORA DATASETS (1)
CORRELATION (1)
CRAWLERS (1)
CUCS (1)
DECISION TREE CLASSIFIER (1)
DECISION TREES (1)
DIRECTORY NAME (1)
DISCRIMINATIVE IMAGE MODEL (1)
DISTANCE FUNCTION CLASSIFICATION MODEL (1)
DISTANCE MEASUREMENT (1)
DOCUMENT PREDEFINED SUBJECT CATEGORY (1)
DOMAIN NAME (1)
E-GOVERNMENT SYSTEM (1)
ELECTRONIC GOVERNMENT (1)
ELECTRONIC PUBLISHING (1)
ENCODING (1)
ENCYCLOPEDIAS (1)
ENTITY RETRIEVAL (1)
ERROR ANALYSIS (1)
FEATURE (1)
FEATURE EXTRACTION (1)
FEATURE SELECTION (1)
FEATURE SELECTION METHODS (1)
GENERATIVE TEXT MODEL (1)
GLOBAL SCHEMA (1)
GOVERNMENT DATA PROCESSING (1)
GRAPH THEORY (1)
HUMAN CLASSIFIERS (1)
IMAGE RETRIEVAL (1)
INCREMENTAL LEARNING (1)
INDEXES (1)
INFORMATION EXTRACTION (1)
INFORMATION EXTRACTION PROCESS (1)
INFORMATION FILTERING (1)
INFORMATION FILTERS (1)
INFORMATION INTEGRATION (1)
INFORMATION SERVICES (1)
INTERNET-BASED ENTERPRISE ADMINISTRATION SYSTEM (1)
IP ADDRESS (1)
IP NETWORKS (1)
ITERATIVE METHODS (1)
ITERATIVE PROCESS (1)
KNOWLEDGE DISCOVERY (1)
LARGE TRAINING SET (1)
LINK ANALYSIS (1)
LINK SEMANTIC ANALYSIS (1)
LINK SEMANTICS ANALYSIS (1)
LINK-LABELED GRAPH (1)
LINK-WEIGHTED GRAPH (1)
LOCAL CO-TRAINING (1)
MACHINE LEARNING CLASSIFIERS (1)
MALICIOUS WEB PAGE IDENTIFICATION (1)
MULTI-CATEGORY WEB PAGE 11CLASSIFICATION (1)
MULTI-LABELED WEB PAGE GENRE CLASSIFICATION (1)
MULTICATEGORY CLASSIFICATION (1)
MULTICATEGORY WEB PAGE CLASSIFICATION (1)
MULTICLASS SUPPORT VECTOR MACHINE (1)
MULTILABELED DATA SET (1)
MULTIVIEW SEMI-SUPERVISED LEARNING ALGORITHM (1)
N-GRAM BASED APPROACH (1)
NATURAL LANGUAGE TEXT (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options