Search results

Items from 1 to 5 out of 5 results

chapter

Web-based keyword adapted Language Modeling for Keyword Spotting

Wenzhu Shen, Ji Wu, Wei Li

2010 7th International Symposium on Chinese Spoken Language Processing > 251 - 255

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

Language Model (LM) constitutes one of the key components in Keyword Spotting (KWS). The rapid development of the World Wide Web (WWW) makes it an extremely large and valuable data source for LM training, but it is not optimal to use the raw transcripts from WWW due to the mismatch of content between the web corpus

chapter

Using a Semi-automatic Keyword Dictionary for Improving Violent Web Site Filtering

R. Guermazi, M. Hammami, A. Ben Hamadou

2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System > 337 - 344

2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System (SITIS)

system called "WebAngels filter" which uses textual and structural content-based analysis. These analysis are based on a violent keyword dictionary. We focus our attention on the keyword dictionary preparation, and we demonstrate that a semi-automatic keyword dictionary can be used to improve the filtering efficiency of

chapter

Experimental studies on pornographic web filtering techniques

C. Chantrapornchai, C. Promsombat, T. Charuenrutsatien, K. Suttirut

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 109 - 112

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

In this work, we compare various text-based pornographic Web filtering techniques. The techniques include blacklist and keyword blocking. The technique called SV is modified to extract a representative feature vector. Each test Web pagepsilas feature is extracted and gathered as a vector. The vector is then summarized

article

Linking Documents to Encyclopedic Knowledge

A. Csomai, R. Mihalcea

IEEE Intelligent Systems > 2008 > 23 > 5 > 34 - 41

important words or phrases in the text to other pages, thereby letting users quickly access additional information. An automatic text-annotation system combines keyword extraction and word-sense disambiguation to identify relevant links to Wikipedia pages.

chapter

Crawl Topical Vietnamese Web Pages Using Genetic Algorithm

Nguyen Quoc Nhan, Vu Tuan Son, Huynh Thi Thanh Binh, Tran Duc Khanh

2010 Second International Conference on Knowledge and Systems Engineering > 217 - 223

2010 Second International Conference on Knowledge and Systems Engineering (KSE)

performance. Apart from estimating the best path to follow, our system also expands its initial keywords by using genetic algorithm during the crawling process. To crawl Vietnamese web pages, we apply a hybrid word segmentation approach which consists of combining automata and part of speech tagging techniques for the Vietnamese

Filter options

Keywords:
WEB SITES
TEXT ANALYSIS
WEB PAGES

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

INTERNET (4)
FILTERING (2)
INFORMATION RETRIEVAL (2)
SEARCH ENGINES (2)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ARRAYS (1)
AUTOMATA (1)
AUTOMATA THEORY (1)
AUTOMATIC TEXT-ANNOTATION SYSTEM (1)
BIOLOGICAL CELLS (1)
BLACKLIST (1)
BUSINESS (1)
CLASSIFICATION ALGORITHMS (1)
COMPUTERS IN EDUCATION (1)
CRAWL TOPICAL VIETNAMESE WEB PAGES (1)
CRAWLERS (1)
DATA MODELS (1)
DATA SELECTION (1)
DATA-MINING (1)
DETECTION ERROR TRADEOFF CURVE (1)
DICTIONARIES (1)
DISTANCE MEASUREMENT (1)
EDUCATION (1)
ENCYCLOPAEDIAS (1)
ENCYCLOPEDIC KNOWLEDGE (1)
FOCUSED CRAWLER (1)
GENETIC ALGORITHM (1)
GENETIC ALGORITHMS (1)
GUIDELINES (1)
HARMFUL WEB PAGES (1)
HTML (1)
HYBRID WORD SEGMENTATION (1)
IMPROPER WEB FILTERING (1)
INDUSTRIAL PLANTS (1)
INFORMATION FILTERING (1)
INFORMATION FILTERS (1)
INTERNET BROWSING (1)
JOINING PROCESSES (1)
KEYWORD (1)
KEYWORD BLOCKING (1)
KEYWORD EXTRACTION (1)
KEYWORD SPECIFIC CORPUS (1)
KEYWORD SPOTTING (1)
MANUALS (1)
MIXTURE LANGUAGE MODEL (1)
N-GRAMS (1)
NATURAL LANGUAGE PROCESSING (1)
ONLINE REPOSITORY (1)
PORNOGRAPHIC WEB FILTERING (1)
PORNOGRAPHY (1)
PREDEFINED KEYWORD LIST (1)
PROXY SERVER (1)
QUERY PROCESSING (1)
RACISM (1)
REPRESENTATIVE FEATURE VECTOR (1)
SEARCH ENGINE QUERY (1)
SEMIAUTOMATIC KEYWORD DICTIONARY (1)
SERVERS (1)
SIMILARITY VECTOR (1)
SPEECH TAGGING (1)
STRUCTURAL CONTENT-BASED ANALYSIS (1)
TERMINOLOGY (1)
TEXT ANNOTATION (1)
TEXT CATEGORIZATION (1)
TEXTUAL CONTENT-BASED ANALYSIS (1)
TRAINING (1)
TRAINING DATA (1)
TWO-STEP DATA SELECTION METHOD (1)
VIETNAMESE TEXT CLASSIFIER (1)
VIETNAMESE WEBSITES (1)
VIETNAMESE WORD SEGMENTATION (1)
VIOLENCE (1)
VIOLENT WEB CONTENT DETECTION (1)
VIOLENT WEB CONTENT FILTERING SYSTEM (1)
VIOLENT WEB SITE FILTERING (1)
VIOLENT WEB SITES FILTERING (1)
WEB BASED KEYWORD ADAPTED LANGUAGE MODELING (1)
WEB CLASSIFICATION (1)
WEB CLASSIFICATION AND CATEGORIZATION (1)
WEB CORPUS (1)
WEB PAGE (1)
WEB TEXTUAL AND STRUCTURAL CONTENT (1)
WEBANGELS FILTER (1)
WIKIPEDIA (1)
WORD-SENSE DISAMBIGUATION (1)
WORLD WIDE WEB (1)
more

INFONA - science communication portal

Search results

Web-based keyword adapted Language Modeling for Keyword Spotting

Using a Semi-automatic Keyword Dictionary for Improving Violent Web Site Filtering

Experimental studies on pornographic web filtering techniques

Linking Documents to Encyclopedic Knowledge

Crawl Topical Vietnamese Web Pages Using Genetic Algorithm

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options