Search results

Items from 1 to 15 out of 15 results

chapter

Web-based keyword adapted Language Modeling for Keyword Spotting

Wenzhu Shen, Ji Wu, Wei Li

2010 7th International Symposium on Chinese Spoken Language Processing > 251 - 255

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

Language Model (LM) constitutes one of the key components in Keyword Spotting (KWS). The rapid development of the World Wide Web (WWW) makes it an extremely large and valuable data source for LM training, but it is not optimal to use the raw transcripts from WWW due to the mismatch of content between the web corpus

chapter

Using a Semi-automatic Keyword Dictionary for Improving Violent Web Site Filtering

R. Guermazi, M. Hammami, A. Ben Hamadou

2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System > 337 - 344

2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System (SITIS)

system called "WebAngels filter" which uses textual and structural content-based analysis. These analysis are based on a violent keyword dictionary. We focus our attention on the keyword dictionary preparation, and we demonstrate that a semi-automatic keyword dictionary can be used to improve the filtering efficiency of

chapter

Extracting and Clustering Related Keywords based on History of Query Frequency

T. Onoda, T. Yumoto, K. Sumiya

2008 Second International Symposium on Universal Communication > 162 - 166

2008 Second International Symposium on Universal Communication

Query-recommendation systems based on inputted queries have become widespread. These services are effective if users cannot input relevant queries. However, the conventional systems do not take into consideration the relevance between recommended queries. This paper proposes a method of obtaining related queries and clustering them by using the history of query frequencies in query logs. We define...

chapter

Experimental studies on pornographic web filtering techniques

C. Chantrapornchai, C. Promsombat, T. Charuenrutsatien, K. Suttirut

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 109 - 112

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

In this work, we compare various text-based pornographic Web filtering techniques. The techniques include blacklist and keyword blocking. The technique called SV is modified to extract a representative feature vector. Each test Web pagepsilas feature is extracted and gathered as a vector. The vector is then summarized

chapter

A multi-language search scheme using a multithread processing for Yahoo Image search

A. Tungkasthan, S. Intarasema, W. Premchaisawadi

2009 Eighth International Symposium on Natural Language Processing > 30 - 34

2009 Eighth International Symposium on Natural Language Processing. SNLP 2009

designed and implemented to resolve the problem of crossing language queries and retrieving images processes. It can greatly reduce lot of time and effort for the search. The experiments on diverse queries on Yahoo images search have shown that the proposed scheme can improve the images results for non-English keyword

chapter

Automatic Extraction of Useful Facet Hierarchies from Text Databases

W. Dakka, P.G. Ipeirotis

2008 IEEE 24th International Conference on Data Engineering > 466 - 475

2008 IEEE 24th International Conference on Data Engineering (ICDE '08)

to keyword searching. Thus far, the identification of the facets was either a manual procedure, or relied on apriori knowledge of the facets that can potentially appear in the underlying collection. In this paper, we present an unsupervised technique for automatic extraction of facets useful for browsing text databases

article

Linking Documents to Encyclopedic Knowledge

A. Csomai, R. Mihalcea

IEEE Intelligent Systems > 2008 > 23 > 5 > 34 - 41

important words or phrases in the text to other pages, thereby letting users quickly access additional information. An automatic text-annotation system combines keyword extraction and word-sense disambiguation to identify relevant links to Wikipedia pages.

chapter

Crawl Topical Vietnamese Web Pages Using Genetic Algorithm

Nguyen Quoc Nhan, Vu Tuan Son, Huynh Thi Thanh Binh, Tran Duc Khanh

2010 Second International Conference on Knowledge and Systems Engineering > 217 - 223

2010 Second International Conference on Knowledge and Systems Engineering (KSE)

performance. Apart from estimating the best path to follow, our system also expands its initial keywords by using genetic algorithm during the crawling process. To crawl Vietnamese web pages, we apply a hybrid word segmentation approach which consists of combining automata and part of speech tagging techniques for the Vietnamese

chapter

Narratives: A visualization to track narrative events as they develop

D. Fisher, A. Hoff, G. Robertson, M. Hurst

2008 IEEE Symposium on Visual Analytics Science and Technology > 115 - 122

2008 IEEE Symposium on Visual Analytics Science and Technology (VAST)

their historical and social context by understanding how the major topics associated with them have changed over time. Users can relate articles through time by examining the topical keywords that summarize a specific news event. By tracking the attention to a news article in the form of references in social media (such as

chapter

Tag Suggestion Method Based on Association Pattern and Bigram Approach

Hyunwoo Kim, Kangpyo Lee, Hyopil Shin, Hyoung-Joo Kim

2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing > 63 - 68

2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing (SNPD)

videos, we can only use a title. If there are tags - significant keywords of that multimedia, we can use tag information to search. Tag is a keyword of text, blog post, or multimedia. Users have already recognized about the value and importance of tags but only a few users are using tags. They might be annoying to add tags

chapter

Simple linguistic processing effect on multi-label emotion classification

Ye Wu, F. Ren

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

events. And a huge resource of text-based emotion can be found from the World Wide Web nowadays. This paper reports a study to investigate the effectiveness of using SVM (Support Vector Machine) on linguistic features considering emotion keywords and negative words, and classify a collection of blog posts sentences tagged

chapter

Building a semantic model of a textual document for efficient search and retrieval

E. Nyamsuren, Ho-Jin Choi

2009 11th International Conference on Advanced Communication Technology > 1 > 298 - 302

2009 11th International Conference on Advanced Communication Technology

This paper describes a new approach of enhancing textual document search and retrieval. The approach tries to take advantage of structured query languages in search and retrieval. For this purpose the semantic model of the document is created. The semantic model of the document is an ontology-like structured semantic annotation of the document that can support structured querying. This paper discusses...

chapter

A Method of Semantic Dictionary Construction from On-line Encyclopedia Classifications

Yun Li, Fang Tian, F. Ren, S. Kuroiwa, more

2007 International Conference on Natural Language Processing and Knowledge Engineering > 82 - 89

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

This paper introduces a method of constructing a semantic dictionary automatically from the keywords and classify relations of the web encyclopedia Chinese WikiPedia. Semantic units, which are affixes (core/modifier) shared between many phrased-keywords, are selected using statistic method and string affix matching

chapter

Context-based term identification and extraction for ontology construction

Hui-Ngo Goh, Ching-Chieh Kiu

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 7

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

with its topic-specific keywords. A hierarchical relationship of super-topics and sub-topics is defined by a taxonomy, meanwhile, Wikipedia is used to provide context and background knowledge for topics that defined in the taxonomy to guide the term identification and extraction. The experimental results have shown the

chapter

Scene Extraction for Video Clips Based on the Relation of Text, Pointing Region and Temporal Duration of User Comments

S. Wakamiya, D. Kitayama, K. Sumiya

2009 20th International Workshop on Database and Expert Systems Application > 289 - 294

2009 20th International Workshop on Database and Expert Systems Application. DEXA 2009

degree of relevancy for the user than is currently available with conventional methods, for example, using matching keywords. We describe here our method and the relation between the scenes and discuss a prototype system.

Filter options

Keywords:
WEB SITES
TEXT ANALYSIS

Publication date

Set your own date range

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options