Search results

Items from 1 to 8 out of 8 results

chapter

Keyword Extraction Using PageRank on Synonym Networks

Zhengyang Liu, Jianyi Liu, Wenbin Yao, Cong Wang

2010 International Conference on E-Product E-Service and E-Entertainment > 1 - 4

2010 International Conference on E-Product E-Service and E-Entertainment (ICEEE 2010)

Keyword extraction is an important application in the area of information technology. Automatic keyword extraction can help people know what is the article primarily talking about without reading the long passage carefully. This paper mainly introduced a keyword extraction algorithm using pagerank on Synonym. Firstly

chapter

Information Retrieval in Multilingual Environment

S.M. Chaware, S. Rao

2009 Second International Conference on Emerging Trends in Engineering&Technology > 648 - 652

2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009)

multilingual information where backend will be English database and front-end uses local languages like Hindi, Marathi or Gujrathi. Our system provides an interface to enter a keyword in local language, the keyword will be parsed, query will be formed and display the result in local language. We had developed an efficient

chapter

Intelligent information mining from veterinary clinical records and open source repository

P. Tangtulyangkul, T.S. Hocking, Chun Che Fung

TENCON 2009 - 2009 IEEE Region 10 Conference > 1 - 6

TENCON 2009. 2009 IEEE Region 10 Conference

utilizes text-mining, Web service technologies and domain knowledge, in order to extract keywords, to retrieve related records from an external source, and to filter the extracted keywords list. This study meets a practical challenge encountered at the School of Veterinary and Biomedical Sciences at Murdoch University. The

chapter

Implementation of Web Crawler

P. Gupta, K. Johari

2009 Second International Conference on Emerging Trends in Engineering&Technology > 838 - 843

2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009)

agent that targets a particular topic and visits and gathers only relevant web pages. In this dissertation I had worked on design and working of web crawler that can be used for copyright infringement. We will take one seed URL as input and search with a keyword, the searching result is based on keyword and it will fetch

chapter

Discovery of Maximally Frequent Tag Tree Patterns with Height-Constrained Variables from Semistructured Web Documents

Y. Suzuki, T. Miyahara, T. Shoudai, T. Uchida, more

International Workshop on Challenges in Web Information Retrieval and Integration > 104 - 112

Proceedings. International Workshop on Challenges in Web Information Retrieval and Integration

structured patterns in semistructured Web documents. A tag tree pattern is an edge labeled tree with ordered children and structured variables. An edge label of a tag tree pattern is a tag or a keyword in Web documents, or a wildcard for any string. Each variable, which matches any subtree, represents a field of a Web document

chapter

Aggregation of Information Resources on the Invisible Web

Gang Li, Guangzeng Kou

2009 Second International Workshop on Knowledge Discovery and Data Mining > 773 - 776

2009 Second International Workshop on Knowledge Discovery and Data Mining. WKDD 2009

There are huge numbers of valuable information resources resided on Invisible Web. However, it is hard to use for us. In this paper we propose a system called NewsReaper that is capable of making Invisible Web to be visible, especially the huge number of real-time information, which update frequently and are time-sensitive. NewsReaper makes use of information extraction, text classification, full...

chapter

Mining Multilingual Texts using Growing Hierarchical Self-Organizing Maps

Hsin-Chang Yang, Ding-Wen Chen, Chung-Hong Lee

2007 International Conference on Machine Learning and Cybernetics > 4 > 2263 - 2268

Sixth International Conference on Machine Learning Cybernetics

use a set of parallel corpora to train the map and apply a discovering process to identify the semantic groups and hierarchical structures of keywords for these languages. The discovered knowledge can then be applied to tasks such as multilingual information retrieval and automatic multilingual thesaurus construction.

chapter

Mining Unstructured Web Pages to Enhance Web Information Retrieval

Chung-Hong Lee

First International Conference on Innovative Computing, Information and Control - Volume I (ICICIC'6) > 2 > 429 - 432

First International Conference on Innovative Computing, Information and Control

automatically constructs a navigational structure for the WWW to help information finding. A self-organizing map is constructed to train the Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively. We then use these maps to develop a structure that may assist the

Filter options

Content availability:
None
Keywords:
INFORMATION RETRIEVAL
DATA MINING

Publication date

Set your own date range

Keywords

INTERNET (6)
ALGORITHM DESIGN AND ANALYSIS (3)
DATABASES (3)
WEB SITES (3)
DOCUMENT HANDLING (2)
INFORMATION SERVICES (2)
SEARCH ENGINES (2)
SELF-ORGANISING FEATURE MAPS (2)
WEB INFORMATION RETRIEVAL (2)
WEB PAGES (2)
AUTOMATA (1)
AUTOMATIC KEYWORD EXTRACTION (1)
AUTOMATIC MULTILINGUAL THESAURUS CONSTRUCTION (1)
BOYER-MOORE ALGORITHM (1)
BREADTH-FIRST SEARCH (1)
CAMPUS RECRUITMENT INFORMATION (1)
CANCER (1)
CHARACTERISTIC TREE STRUCTURED PATTERNS (1)
CLASSIFICATION (1)
CLINICAL DATABASE (1)
COMPUTATIONAL LINGUISTICS (1)
COPYRIGHT INFRINGEMENT (1)
CRAWLERS (1)
DOCUMENT CONTENT (1)
DOCUMENT SEARCHING (1)
DOMAIN KNOWLEDGE (1)
EDGE LABELED TREE (1)
ENCODING (1)
ENGLISH DATABASE (1)
FINITE AUTOMATA (1)
FINITE AUTOMATA ALGORITHM (1)
FREQUENT PATTERN DISCOVERY (1)
FULL TEXT INDEX (1)
GENERAL-PURPOSE SEARCH ENGINES (1)
GROWING HIERARCHICAL SELF-ORGANIZING MAP (1)
GROWING HIERARCHICAL SELF-ORGANIZING MAPS (1)
GUJRATHI (1)
HEIGHT-CONSTRAINED VARIABLES (1)
HINDI (1)
HTML (1)
HYPERMEDIA MARKUP LANGUAGES (1)
INDIAN LANGUAGE (1)
INFORMATION EXTRACTION (1)
INFORMATION FILTERING (1)
INFORMATION NAVIGATION (1)
INFORMATION RESOURCE AGGREGATION (1)
INFORMATION RESOURCES (1)
INFORMATION RESOURCES AGGREGATION (1)
INFORMATION STORAGE (1)
INFORMATION TECHNOLOGY (1)
INTELLIGENT INFORMATION MINING (1)
INTERESTING RELATIONSHIP DISCOVERY (1)
INVISIBLE WEB (1)
JOINING PROCESSES (1)
KEYBOARDS (1)
KEYWORD (1)
KEYWORD EXTRACTION (1)
KEYWORDS-INVISIBLE WEB (1)
KNUTT-MORRI-PRATT ALGORITHM (1)
LANGUAGE DIVERSITY (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LOCAL LANGUAGE RETRIEVAL (1)
LOCAL LANGUAGE STORAGE (1)
MACHINE LEARNING APPROACH (1)
MARATHI (1)
MAXIMALLY FREQUENT TAG TREE PATTERNS (1)
MEDICAL INFORMATION SYSTEMS (1)
MULTILINGUAL ENVIRONMENT (1)
MULTILINGUAL INFORMATION PROCESSING (1)
MULTILINGUAL INFORMATION RETRIEVAL (1)
MULTILINGUAL TEXT DOCUMENT CLUSTERING (1)
MULTILINGUAL TEXT MINING (1)
NATURAL LANGUAGE INTERFACES (1)
NETWORK THEORY (GRAPHS) (1)
NEWSREAPER (1)
OPEN SOURCE REPOSITORY (1)
ORGANIZATIONS (1)
PAGERANK (1)
PATTERN CLUSTERING (1)
PATTERN MATCHING (1)
PATTERN RECOGNITION ALGORITHMS (1)
PREDICTION ALGORITHMS (1)
PROTOTYPES (1)
QUERY (1)
QUERY FORMATION (1)
QUERY FORMULATION (1)
QUERY PROCESSING (1)
REAL-TIME INFORMATION (1)
RECRUITMENT (1)
RSS TECHNOLOGIES (1)
SELF-ORGANIZING FEATURE MAP CONSTRUCTION (1)
SEMISTRUCTURED WEB DOCUMENTS (1)
STRUCTURED VARIABLES (1)
SYNONYM CO-OCCURRENCE NETWORK (1)
TEXT ANALYSIS (1)
TEXT CLASSIFICATION (1)
TEXT-MINING (1)
THESAURI (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options