Search results

Items from 1 to 6 out of 6 results

chapter

Classifying Web Pages Using Information Extraction Patterns Preliminary Results and Findings

Lay-Ki Soon, Sang Ho Lee

2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems > 195 - 202

Sixth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2010)

Web page classification plays an essential role in facilitating more efficient information retrieval and information processing. Conventionally, web text documents are represented by term frequency matrix for classification purpose. However, considering the limitations of representing documents using terms or keywords, we propose to represent web pages using information extraction patterns that are...

chapter

An event ontology construction approach to web crime mining

Li Cunhua, Hu Yun, Zhong Zhaoman

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2441 - 2445

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Along with the rapid popularity of the Internet, crime information on the web is becoming increasingly rampant, and the majority of them are in the form of text. Because a lot of crime information in documents is described through events, event-based semantic technology can be used to study the patterns and trends of web-oriented crimes. In our research project on cyber crime mining, we construct...

chapter

Comparison of Attribute Selection Methods for Web Texts Categorization

Rizauddin Saian, Ku Ruhana Ku-Mahamud

2010 Second International Conference on Computer and Network Technology > 115 - 118

2010 Second International Conference on Computer and Network Technology (ICCNT 2010)

This paper presents a study on the performance of attribute selection methods to be used with Ant-Miner algorithm for web text categorization. The new generated data set by each attribute selection method was classified with Ant-Miner to see the performance in terms of predictive accuracy and the number of rules generated. The results of classification were also compared to C4.5 algorithm.

article

Cross-Domain Learning from Multiple Sources: A Consensus Regularization Perspective

Fuzhen Zhuang, Ping Luo, Hui Xiong, Yuhong Xiong, more

IEEE Transactions on Knowledge and Data Engineering > 2010 > 22 > 12 > 1664 - 1678

Classification across different domains studies how to adapt a learning model from one domain to another domain which shares similar data characteristics. While there are a number of existing works along this line, many of them are only focused on learning from a single source domain to a target domain. In particular, a remaining challenge is how to apply the knowledge learned from multiple source...

chapter

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Shiliang Sun

2008 IEEE International Conference on Data Mining Workshops > 731 - 735

2008 IEEE International Conference on Data Mining Workshops

For multi-view learning, existing methods usually exploit originally provided features for classifier training, which ignore the latent correlation between different views. In this paper, semantic features integrating information from multiple views are extracted for pattern representation. Canonical correlation analysis is used to learn the representation of semantic spaces where semantic features...

chapter

Topic generation for web document summarization

Heng-Yao Hsu, Chun-Wei Tsai, Ming-Chao Chiang, Chu-Sing Yang

2008 IEEE International Conference on Systems, Man and Cybernetics > 3702 - 3707

2008 IEEE International Conference on Systems, Man and Cybernetics (SMC 2008)

Over the past decade, more and more users of the Internet rely on the search engines to help them find the information they need. However, the information they find depends, to a large extent, on the ranking mechanism of the search engines they use. Not surprisingly, it, in general, consists of a large amount of information that is completely irrelevant. To help users of the Internet find the information...

Filter options

Keywords:
DATA MINING
TEXT ANALYSIS
WEB PAGES

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

ACCURACY (3)
CLASSIFICATION (3)
CLASSIFICATION ALGORITHMS (3)
INTERNET (3)
WEB MINING (3)
INFORMATION RETRIEVAL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
PATTERN CLASSIFICATION (2)
ACTIVE LEARNING (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANT-MINER ALGORITHM (1)
ATTRIBUTE EXTRACTION (1)
ATTRIBUTE SELECTION (1)
ATTRIBUTE SELECTION METHODS (1)
BAYESIAN METHODS (1)
C4.5 ALGORITHM (1)
CHINESE TEXT CLASSIFICATION (1)
CHINESE WEB PAGE (1)
CLASSIFICATION RULES (1)
CLASSIFIER TRAINING (1)
CLUSTERING ALGORITHMS (1)
CO-TESTING (1)
CO-TRAINING (1)
COMPUTER CRIME (1)
COMPUTER SCIENCE (1)
CONSENSUS REGULARIZATION LEARNING (1)
CONSENSUS REGULARIZATION. (1)
CORRELATION (1)
CRIME INFORMATION (1)
CROSS-DOMAIN LEARNING (1)
CROSS-DOMAIN LEARNING PERFORMANCE (1)
CYBER CRIME (1)
CYBER CRIME MINING (1)
DATA CHARACTERISTICS (1)
DATA DISTRIBUTION (1)
DECISION TREE (1)
DIGITAL SIGNAL PROCESSING (1)
DISTRIBUTED ALGORITHM (1)
DISTRIBUTED ALGORITHMS (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTION SCORING (1)
DOCUMENT HANDLING (1)
EVENT (1)
EVENT ONTOLOGY (1)
EVENT ONTOLOGY CONSTRUCTION (1)
EVENT-BASED SEMANTIC TECHNOLOGY (1)
FEATURE EXTRACTION (1)
GENETICS (1)
GOOGLE (1)
IMAGE CLASSIFICATION (1)
INDEXING (1)
INFORMATION EXTRACTION (1)
INFORMATION EXTRACTION PATTERNS (1)
INFORMATION GAIN (1)
INFORMATION PROCESSING (1)
INTERNET USERS (1)
LEARNING MODEL (1)
LOCAL CLASSIFIER (1)
LOGISTICS (1)
MACHINE LEARNING (1)
MATRIX ALGEBRA (1)
MULTI-VIEW LEARNING (1)
MULTIMEDIA COMPUTING (1)
MULTIPLE SOURCE DOMAIN (1)
MULTIPLE SOURCE DOMAINS (1)
MULTIVIEW SEMI-SUPERVISED (1)
ONTOLOGIES (1)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (1)
PATTERN REPRESENTATION (1)
PERFORMANCE GAIN (1)
PREDICTION CONSENSUS (1)
PRIVACY (1)
PROBABILITY DENSITY FUNCTION (1)
RANKING MECHANISM (1)
SEARCH ENGINES (1)
SEARCH METHODS (1)
SEMANTIC FEATURES (1)
SINGLE SOURCE DOMAIN (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINES (1)
SVM CLASSIFICATION (1)
TERM FREQUENCY MATRIX (1)
TEXT CLASSIFICATION (1)
TOPIC GENERATION (1)
TRAINING (1)
TV (1)
USER QUERY (1)
WEB CLASSIFICATION (1)
WEB CRIME MINING (1)
WEB DOCUMENT SUMMARIZATION (1)
WEB PAGE CLASSIFICATION (1)
WEB SITES (1)
WEB TEXT DOCUMENTS (1)
WEB TEXTS CATEGORIZATION (1)
WEB-ORIENTED CRIME (1)
more

INFONA - science communication portal

Search results

Classifying Web Pages Using Information Extraction Patterns Preliminary Results and Findings

An event ontology construction approach to web crime mining

Comparison of Attribute Selection Methods for Web Texts Categorization

Cross-Domain Learning from Multiple Sources: A Consensus Regularization Perspective

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Topic generation for web document summarization

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options