Search results

Items from 1 to 6 out of 6 results

chapter

An n-Gram Based Approach to Multi-Labeled Web Page Genre Classification

J.E. Mason, M. Shepherd, J. Duffy, V. Keselj, more

2010 43rd Hawaii International Conference on System Sciences > 1 - 10

2010 43rd Hawaii International Conference on System Sciences (HICSS-43)

The extraordinary growth in both the size and popularity of the World Wide Web has created a growing interest not only in identifying Web page genres, but also in using these genres to classify Web pages. The hypothesis of this research is that an n-gram representation of a Web page can be used effectively to automatically classify that Web page by genre, even when the Web page belongs to more than...

chapter

Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning

Jing Qiu, Lejian Liao, Peng Li

2009 International Conference on Intelligent Networking and Collaborative Systems > 219 - 225

2009 International Conference on Intelligent Networking and Collaborative Systems

Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. Supervised learning requires a large amount of labeled data, but the data labeling process can be expensive and time consuming. In this paper we present a kernel-based weakly supervised machine learning algorithm...

chapter

Obfuscated malicious javascript detection using classification techniques

P. Likarish, Eunjin Jung, Insoon Jo

2009 4th International Conference on Malicious and Unwanted Software (MALWARE) > 47 - 54

2009 4th International Conference on Malicious and Unwanted Software (MALWARE 2009)

As the World Wide Web expands and more users join, it becomes an increasingly attractive means of distributing malware. Malicious javascript frequently serves as the initial infection vector for malware. We train several classifiers to detect malicious javascript and evaluate their performance. We propose features focused on detecting obfuscation, a common technique to bypass traditional malware detectors...

chapter

Text classification in the Turkish marketing domain for context sensitive ad distribution

Melih Engin, T. Can

2009 24th International Symposium on Computer and Information Sciences > 105 - 110

2009 24th International Symposium on Computer and Information Sciences (ISCIS)

In this paper, we construct and compare several feature extraction approaches in order to find a better solution for classification of Turkish Web documents in the marketing domain. We produce our feature extraction techniques using characteristics of the Turkish language, structures of Web documents and online content in the marketing domain. We form datasets in different feature spaces and we apply...

chapter

Extracting Relations from Chinese Web Documents Using Kernel Methods

Jing Qiu, Lejian Liao

2009 Eighth IEEE/ACIS International Conference on Computer and Information Science > 352 - 356

2009 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS)

Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. In this paper, we present an automated system which could extract instances of an arbitrary given binary relation from Chinese Web documents in domain of football games. Different syntactic sources are combined by...

chapter

Sentiment Classification for Chinese Reviews Using Machine Learning Methods Based on String Kernel

Changli Zhang, Wanli Zuo, Tao Peng, Fengling He

2008 Third International Conference on Convergence and Hybrid Information Technology > 2 > 909 - 914

2008 Third International Conference on Convergence and Hybrid Information Technology (ICCIT)

Sentiment classification aims at mining reviews of people for a certain event's topic or product by automatic classifying the reviews into positive or negative opinions. With the fast developing of World Wide Web applications, sentiment classification would have huge opportunity to help people automatic analysis of customers' opinions from the web information. Automatic opinion mining will benefit...

Filter options

Data set:
ieee
Keywords:
SUPPORT VECTOR MACHINES
INTERNET
MACHINE LEARNING
WORLD WIDE WEB

Publication date

Set your own date range

Keywords

CLASSIFICATION ALGORITHMS (4)
DATA MINING (4)
FEATURE EXTRACTION (4)
KERNEL (4)
DOCUMENT HANDLING (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
ACCURACY (2)
INFORMATION RETRIEVAL (2)
KERNEL METHOD (2)
RELATION EXTRACTION (2)
TRAINING (2)
APPRAISAL (1)
ARBITRARY BINARY RELATION EXTRACTION (1)
ARTIFICIAL INTELLIGENCE (1)
AUTOMATIC OPINION MINING (1)
BOOTSTRAPPING (1)
CHINESE NEWS WEB DOCUMENTS (1)
CHINESE REVIEWS (1)
CHINESE WEB DOCUMENT (1)
CLASSIFICATION (1)
CLASSIFICATION TECHNIQUES (1)
COMPOSITE KERNELS (1)
COMPUTER GAMES (1)
COMPUTER SCIENCE (1)
CONTEXT SENSITIVE AD DISTRIBUTION (1)
CONVOLUTION (1)
DATA LABELING PROCESS (1)
DECISION MAKING (1)
DISTANCE FUNCTION CLASSIFICATION MODEL (1)
FEATURE EXTRACTION TECHNIQUES (1)
FOOTBALL GAME (1)
FOOTBALL GAMES (1)
GAME THEORY (1)
GAMES (1)
HTML (1)
HUMAN CLASSIFIERS (1)
INFORMATION FILTERS (1)
INVASIVE SOFTWARE (1)
KERNEL-BASED WEAKLY SUPERVISED MACHINE LEARNING ALGORITHM (1)
KNOWLEDGE ACQUISITION (1)
KNOWLEDGE ENGINEERING (1)
LEARNING SYSTEMS (1)
LINEAR KERNEL CLASSIFIERS (1)
MACHINE LEARNING CLASSIFIERS (1)
MALWARE (1)
MALWARE DETECTORS (1)
MARKETING DATA PROCESSING (1)
MULTI-LABELED WEB PAGE GENRE CLASSIFICATION (1)
MULTILABELED DATA SET (1)
N-GRAM BASED APPROACH (1)
NATURAL LANGUAGE PROCESSING (1)
OBFUSCATED MALICIOUS JAVASCRIPT DETECTION (1)
OPERATING SYSTEM KERNELS (1)
OPINION MINING (1)
PATTERN CLASSIFICATION (1)
PRINCIPAL COMPONENT ANALYSIS (1)
SEMANTIC ORIENTATION (1)
SENTIMENT CLASSIFICATION (1)
SOFTWARE (1)
STRING KERNEL (1)
SUPPORT VECTOR MACHINE (1)
SVM CLASSIFIER (1)
SYNTACTIC KERNELS (1)
SYNTACTIC SOURCE (1)
TEXT ANALYSIS (1)
TEXT CATEGORIZATION (1)
TEXT CLASSIFICATION (1)
TURKISH LANGUAGE (1)
TURKISH MARKETING DOMAIN (1)
TURKISH WEB DOCUMENTS (1)
WEAKLY SUPERVISED (1)
WEB PAGE (1)
WEB PAGE CORPUS (1)
WEB PAGES (1)
WEB SITES (1)
more

INFONA - science communication portal

Search results

An n-Gram Based Approach to Multi-Labeled Web Page Genre Classification

Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning

Obfuscated malicious javascript detection using classification techniques

Text classification in the Turkish marketing domain for context sensitive ad distribution

Extracting Relations from Chinese Web Documents Using Kernel Methods

Sentiment Classification for Chinese Reviews Using Machine Learning Methods Based on String Kernel

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options