Search results

Items from 1 to 16 out of 16 results

chapter

Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages

Xinghua Li, Xindong Wu, Xuegang Hu, Fei Xie, more

2008 IEEE International Conference on Data Mining Workshops > 744 - 751

2008 IEEE International Conference on Data Mining Workshops

This paper presents a new keyword extraction algorithm for Chinese news Web pages using lexical chains and word co-occurrence combined with frequency features, cohesion features, and corelation features. A lexical chain is an external performance consistency by semantically related words of a text, and is the

chapter

A New Keywords Method to Improve Web Search

Chongchong Zhao, Zhiqiang Zhang, Xiaoqin Xie, Tingting Liang

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 477 - 484

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

quality of information retrieval. The contributions of our research are twofold. First, the existing ranking algorithms of search engine are classified. And we extend expression of queries by “keyword and ”, instead of keywords only. Second, a new ranking algorithm based on user feedback and semantic tags is

chapter

Web Page Clustering Based on Searching Keywords

Taoying Li, Yan Chen

2010 International Conference on Intelligent Computation Technology and Automation > 3 > 1163 - 1166

2010 International Conference on Intelligent Computation Technology and Automation (ICICTA 2010)

In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then

chapter

Experimental studies on pornographic web filtering techniques

C. Chantrapornchai, C. Promsombat, T. Charuenrutsatien, K. Suttirut

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 > 109 - 112

2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

In this work, we compare various text-based pornographic Web filtering techniques. The techniques include blacklist and keyword blocking. The technique called SV is modified to extract a representative feature vector. Each test Web pagepsilas feature is extracted and gathered as a vector. The vector is then summarized

chapter

Enhancements in query evaluation and page summarization of The Thinking Algorithm

M.S. Jameel, A. Akshat, C.T. Singh

2008 International Symposium on Information Technology > 3 > 1 - 8

2008 International Symposium on Information Technology

This paper explores a unique way in which the thinking algorithm adds an extra logical substrate to a Web search query using artificial intelligence. Instead of just going after keyword searching, the algorithm tries to assess the motives of the user behind entering a query. The algorithm tries to find the reasons as

chapter

News Contents Recommendation Model Based on Feedback of Web Usage

Ping Ni, Jianxin Liao, Xiaomin Zhu, Keyan Ren

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 431 - 435

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

In this paper, reclassification for the current classification through K-means would be implemented based on the feedback of Web usage mining in order to improve the accuracy of news recommendation and convergence of classification. It could extract most relative keywords and eliminate the disturbance of multi-vocal

chapter

Ranking model adaptation for domain specific mining using binary classifier for sponsored ads

M. Krishnamurthy, N. A. Anuja Jaishree, Anitha S. Pillai, A. Kannan

2014 14th International Conference on Hybrid Intelligent Systems > 35 - 42

2014 14th International Conference on Hybrid Intelligent Systems (HIS)

Domain — specific search focuses on one area of knowledge. Applying broad based ranking algorithms to vertical search domains is not desirable. The broad based ranking model builds upon the data from multiple domains existing on the web. Vertical search engines attempt to use a focused crawler that index only relevant web pages to a predefined topic. With Ranking Adaptation Model, one can adapt an...

chapter

Feature Selection algorithms based on HTML tags importance

Amany M. Sarhan, Ghada M. Hamissa, Heba E. Elbehiry

2015 Tenth International Conference on Computer Engineering & Systems (ICCES) > 185 - 190

2015 Tenth International Conference on Computer Engineering & Systems (ICCES)

of HTML page, and the proposed algorithms is performed. Complete evaluation is performed which indicates the effectiveness of using our technique. The experimental results show improved precision and recall with the proposed algorithms with respect to keyword-based search. The algorithms are implemented in JAVA and its

chapter

Ontological based webpage classification

Wui Kheun Ong, Jer Lang Hong, Fariza Fauzi, Ee Xion Tan

2012 International Conference on Information Retrieval & Knowledge Management > 224 - 228

2012 International Conference on Information Retrieval & Knowledge Management (CAMP)

Current classification techniques use word matching and clustering techniques to classify webpages. These techniques use ad hoc approach of checking and matching the entire keywords in a webpage for classification. These methods are efficient but not without problems. In general, they suffer from the following

chapter

Classifying Web Pages Using Information Extraction Patterns Preliminary Results and Findings

Lay-Ki Soon, Sang Ho Lee

2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems > 195 - 202

Sixth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2010)

Web page classification plays an essential role in facilitating more efficient information retrieval and information processing. Conventionally, web text documents are represented by term frequency matrix for classification purpose. However, considering the limitations of representing documents using terms or keywords

chapter

Development of semi-supervised named entity recognition to discover new tourism places

Khurniawan Eko Saputro, Sri Suning Kusumawardani, Silmi Fauziati

2016 2nd International Conference on Science and Technology-Computer (ICST) > 124 - 128

2016 2nd International Conference on Science and Technology-Computer (ICST)

rely on indexing web pages so that the information obtained by the tourist is still unfavorable because it only shows a web page with keywords that exist on the article. A support system to recognize tourism places on the web pages is required to produce better information presentation. In this study, the recognition

chapter

Automatic and Adaptive Clusters for Information Extraction

B.S. Charulatha, Paul Rodrigues, T. Chitralekha

2014 International Conference on Soft Computing and Machine Intelligence > 60 - 63

2014 International Conference on Soft Computing & Machine Intelligence (ISCMI)

done on a set of data is chosen to form the basis as done with keywords. If the base data is chosen arbitrarily, it is automatic, whereas some 'knowledge' or 'background' is put in the choice it is adaptive. Statistical features of the images are extracted from the pixel map of the image. The extracted features are

chapter

Using a thesaurus-based approach for the categorisation of web sites

Sameerchand Pudaruth, Youven Ankiah, Keshav Sembhoo

2014 Seventh International Conference on Contemporary Computing (IC3) > 624 - 628

2014 Seventh International Conference on Contemporary Computing (IC3)

the websites into their most appropriate category. Several parameters like the weight applied to each feature and the keywords used to classify the websites were tuned to yield better results. The experimental evaluation revealed that the method implemented provides very high accuracy. In particularly, we obtained an

chapter

Folksonomy for the Blogosphere: Blog Identification and Classification

Rujiang Bai, Xiaoyue Wang, Junhua Liao

2009 WRI World Congress on Computer Science and Information Engineering > 3 > 631 - 635

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Traditional automatic classifiers often conduct misclassifications. Folksonomy, a new manual classification scheme based on tagging efforts of users with freely chosen keywords can effective resolve this problem. Even though the scalability of folksonomy is much higher than the other manual classification schemes, the

chapter

Web Page Categorization Based on k-NN and SVM Hybrid Pattern Recognition Algorithm

Xuelin Shi, Ying Zhao, Xiangjun Dong

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 523 - 527

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Traditional information retrieval (IR) method use keywords matching to filter the documents, but usually retrieves unrelated Web pages. In order to effectively classify Web pages, we present a Web page categorization algorithm, named WebPSC (Web page similarity categorization). This algorithm uses latent semantic

chapter

Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis

Yong Zhang, Bin Fan, Long-bin Xiao

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 528 - 532

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

obtain latent semantic structure of original term-document matrix solving the polysemous and synonymous keywords problem. LS-SVM is an effective method for learning the classification knowledge from massive data, especially on condition of high cost in getting labeled classical examples. We adopt a novel method of Web page

Filter options

Keywords:
WEB PAGES
CLASSIFICATION ALGORITHMS

Publication date

Set your own date range

Keywords

DATA MINING (6)
WEB SITES (6)
ACCURACY (5)
CLASSIFICATION (5)
FEATURE EXTRACTION (5)
INFORMATION RETRIEVAL (5)
INTERNET (5)
SUPPORT VECTOR MACHINES (5)
CLUSTERING ALGORITHMS (4)
SEARCH ENGINES (4)
TEXT ANALYSIS (3)
WEB MINING (3)
COMPUTER SCIENCE (2)
DATABASES (2)
HTML (2)
INDEXING (2)
KEYWORD EXTRACTION (2)
MACHINE LEARNING (2)
NATURAL LANGUAGE PROCESSING (2)
SEMANTICS (2)
SINGULAR VALUE DECOMPOSITION (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
THESAURI (2)
WEB CLASSIFICATION (2)
WEB PAGE (2)
WEB PAGE CLASSIFICATION (2)
ABSTRACTS (1)
ADAPTATION MODELS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ASSOCIATION RULES (1)
BAYESIAN METHODS (1)
BLACKLIST (1)
BLOG CLASSIFICATION (1)
BLOG IDENTIFICATION (1)
BLOGOSPHERE (1)
BOOKS (1)
CATEGORIZATION (1)
CHINESE NEWS WEB PAGE (1)
CHINESE WEB PAGE CLASSIFICATION (1)
CLUSTERING (1)
COHESION FEATURES (1)
COMPOUNDED UNIQUENESS LEVEL (1)
COMPUTER CRIME (1)
CONTROLLED VOCABULARY (1)
CORELATION FEATURES (1)
CRAWLERS (1)
DATA MODELS (1)
DECISION TREE (1)
DICTIONARIES (1)
DISTANCE MEASUREMENT (1)
DOCUMENT HANDLING (1)
DUPLICATED PAGES DELETION (1)
EDUCATION (1)
FEATURE SELECTION (1)
FILTERING (1)
FOLISONOMY-AND-SUPPORT VECTOR MACHINE CLASSIFIER (1)
FOLKSONOMY (1)
FREQUENCY FEATURES (1)
FREQUENCY MEASUREMENT (1)
FSVMC (1)
FUZZY C MEANS (1)
GOOGLE (1)
GOVERNMENT (1)
GRAMMARS (1)
HETEROGENEOUS (1)
HUMANS (1)
IMAGE REPRESENTATION (1)
IMPROPER WEB FILTERING (1)
INFORMATION EXTRACTION (1)
INFORMATION EXTRACTION PATTERNS (1)
INFORMATION GAIN (1)
INFORMATION PROCESSING (1)
INTELLIGENT AGENT SYSTEM (1)
K-MEANS CLASSIFICATION (1)
K-NEAREST NEIGHBOR (1)
K-NN (1)
KEYWORD AUCTION (1)
KEYWORD BLOCKING (1)
KEYWORD SEARCHING (1)
KEYWORDS METHOD (1)
KNOWLEDGE ENGINEERING (1)
LARGE SCALE INTEGRATION (1)
LATENT SEMANTIC ANALYSIS (1)
LATENT SEMANTIC INDEXING (1)
LEARNING TO RANK (1)
LEAST SQUARE SUPPORT VECTOR MACHINE (1)
LEXICAL CHAIN (1)
LEXICAL CHAINS (1)
MACHINE LEARNING ALGORITHMS (1)
MANUAL CLASSIFICATION SCHEME (1)
MATCHING DEGREE (1)
MATRIX ALGEBRA (1)
MATRIX DECOMPOSITION (1)
MULTIMEDIA (1)
MULTIVOCAL WORD (1)
NAïVE BAYES (1)
NAIVE BAYES CLASSIFIER (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options