Search results

Items from 1 to 6 out of 6 results

chapter

Gender Classification with Data Independent Features in Multiple Languages

Tim Isbister, Lisa Kaati, Katie Cohen

2017 European Intelligence and Security Informatics Conference (EISIC) > 54 - 60

2017 European Intelligence and Security Informatics Conference (EISIC)

Gender classification is a well-researched problem, and state-of-the-art implementations achieve an accuracy of over 85%. However, most previous work has focused on gender classification of texts written in the English language, and in many cases, the results cannot be transferred to different datasets since the features used to train the machine learning models are dependent on the data. In this...

chapter

A Model for Automatically Detecting and Blocking Pornographic Websites

Thuy-An Dinh, Tan-Binh Ngo, Duc-Lung Vu

2015 Seventh International Conference on Knowledge and Systems Engineering (KSE) > 244 - 249

2015 Seventh International Conference on Knowledge and Systems Engineering (KSE)

Preventing juveniles from accessing pornographic web pages remains a problem in Vietnam. The existing tools have failed to block these Vietnamese sites automatically and rely only on configuring black list and white list. In fact, the Vietnamese and English are different in both syntax and semantic, therefore, applying methods used for English into Vietnamese will definitely be much less effective...

chapter

The web software mining based on vector space model

Feijie Wang, Zhongying Bai

2009 First International Conference on Future Information Networks > 275 - 279

2009 First International Conference on Future Information Networks. ICFIN 2009

The article designed a Web software mining system, discussed the techniques what used in the system and raised the solutions for issues in system. A Web crawler software has been designed and implemented according to the feature of the World Wide Web. Base on the information present by Web pages, the article improved feature selection method and key words weighted algorithm using Web text mining techniques...

chapter

Web Pages Classification and Clustering by Means of Genetic Algorithms: A Variable Size Page Representing Approach

Z. Hossaini, A.M. Rahmani, S. Setayeshi

2008 International Conference on Computational Intelligence for Modelling Control&Automation > 436 - 440

2008 International Conference on Computational Intelligence for Modelling Control & Automation (CIMCA 2008)

Arranging mass of data in related groups is an important way that helps us to decide about them better, clustering and classification are two efficient methods of grouping huge volume of data, most of clustering and classification methods that work on Web pages grouping problems, use fixed size vectors in their learning algorithm. In the real world of WWW this assumption is not reliable. In this paper...

chapter

Extracting Subjective and Objective Evaluative Expressions from the Web

T. Nakagawa, T. Kawada, K. Inui, S. Kurohashi

2008 Second International Symposium on Universal Communication > 251 - 258

2008 Second International Symposium on Universal Communication

There are various opinions on the Web, and analyzing them is an important task. Although many previous studies focused on analyzing subjective evaluative expressions, objective evaluative expressions which describe positive or negative facts are also informative information. In this paper, we study extraction and classification of subjective and objective evaluative expressions on Japanese Web documents...

chapter

Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia

J.-H. Oh, D. Kawahara, K. Uchimoto, J. Kazama, more

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 1 > 322 - 328

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

We present a novel method for discovering missing cross-language links between English and Japanese Wikipedia articles. We collect candidates of missing cross-language links -- a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then we select the correct cross-language links among the candidates by using a classifier trained with various types of features...

Filter options

Keywords:
INTERNET
DICTIONARIES

Publication date

Set your own date range

Keywords

DATA MINING (3)
CLASSIFICATION ALGORITHMS (2)
INFORMATION RETRIEVAL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MACHINE LEARNING (2)
NATURAL LANGUAGE PROCESSING (2)
TEXT ANALYSIS (2)
WEB MINING (2)
WEB PAGES (2)
WORLD WIDE WEB (2)
ACCURACY (1)
BAG-OF-WORD MODEL (1)
BLOGS (1)
BOOK REVIEWS (1)
CHROME EXTENSION (1)
CLUSTERING (1)
COMPOUNDS (1)
CROSS-LANGUAGE LINKS (1)
DATABASES (1)
DECISION SUPPORT SYSTEMS (1)
DOCUMENT HANDLING (1)
DOMAIN INDEPENDENCE (1)
DRUGS (1)
ELECTRONIC PUBLISHING (1)
ENCYCLOPEDIAS (1)
ENGLISH WIKIPEDIA ARTICLES (1)
EVALUATIVE WORD DICTIONARY (1)
FEATURE EXTRACTION (1)
FEATURE SELECTION (1)
FEATURE SELECTION METHOD (1)
FIXED SIZE VECTOR (1)
GENDER (1)
GENETIC ALGORITHM (1)
GENETIC ALGORITHMS (1)
GOLD (1)
HELIUM (1)
INFORMATION SERVICES (1)
JAPANESE WEB DOCUMENT (1)
JAPANESE WIKIPEDIA ARTICLES (1)
K-MEANS ALGORITHM (1)
KEY WORDS WEIGHTED ALGORITHM (1)
LANGUAGE RESOURCE (1)
LEARNING ALGORITHM (1)
MATHEMATICAL OPERATORS (1)
MULTILINGUAL LANGUAGE RESOURCES (1)
MUTATION OPERATOR (1)
NAÏVE BAYES MODEL (1)
OBJECTIVE EVALUATIVE EXPRESSION CLASSIFICATION (1)
OBJECTIVE EVALUATIVE EXPRESSION EXTRACTION (1)
PATTERN CLASSIFICATION (1)
PATTERN CLUSTERING (1)
PORNOGRAPHIC WEBSITE (1)
PORTS (COMPUTERS) (1)
PRAGMATICS (1)
PSYCHOLOGY (1)
SEA MEASUREMENTS (1)
SENTIMENT ANALYSIS (1)
SOCIAL NETWORK SERVICES (1)
SOFTWARE (1)
SOFTWARE ALGORITHMS (1)
STANDARD CROSSOVER OPERATOR (1)
SUBJECTIVE EVALUATIVE EXPRESSION CLASSIFICATION (1)
SUBJECTIVE EVALUATIVE EXPRESSION EXTRACTION (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
UNIFORM RESOURCE LOCATORS (1)
VARIABLE SIZE PAGE REPRESENTING APPROACH (1)
VARIABLE SIZE VECTOR (1)
VECTOR SPACE MODEL (1)
WEB CRAWLER SOFTWARE (1)
WEB PAGES CLASSIFICATION (1)
WEB PAGES CLUSTERING (1)
WEB PAGES GROUPING PROBLEM (1)
WEB SITES (1)
WEB SOFTWARE MINING (1)
WEB TEXT MINING (1)
WIKIPEDIA (1)
WORD SEGMENTATION (1)
WWW (1)
more

INFONA - science communication portal

Search results

Gender Classification with Data Independent Features in Multiple Languages

A Model for Automatically Detecting and Blocking Pornographic Websites

The web software mining based on vector space model

Web Pages Classification and Clustering by Means of Genetic Algorithms: A Variable Size Page Representing Approach

Extracting Subjective and Objective Evaluative Expressions from the Web

Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options