Search results

Items from 1 to 20 out of 99 results

chapter

CIC-AB: Online ad blocker for browsers

Arash Habibi Lashkari, Amy Seo, Gerard Drapper Gil, Ali Ghorbani

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 7

2017 International Carnahan Conference on Security Technology (ICCST)

Online advertisements (ads) have taken over the web, nowedays most websites contain some sort of ads. While ads produce revenue for the server maintainer or to businesses, they have become intrusive and dangerous as ever. The ads use more bandwidth, show inappropriate content, and spread malware such as adware and ransomware. Although there are many products to block ads, also known as ad blockers,...

chapter

Protecting web contents against persistent distributed crawlers

Shengye Wan, Yue Li, Kun Sun

2017 IEEE International Conference on Communications (ICC) > 1 - 6

ICC 2017 - 2017 IEEE International Conference on Communications

Web crawlers have been misused for several malicious purposes such as downloading server data without permission from the website administrator. In this paper, based on one observation that normal users and malicious crawlers have different short-term and long-term download behaviors, we develop a new anti-crawler mechanism called PathMarker to detect and constrain persistent distributed crawlers...

chapter

A Study of Content-Aware Classification of POI

Chieh-Chi Chiu, Zhong-Xing Xie, Hsin-Wen Wei, Wei-Tsong Lee

2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA) > 591 - 596

2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA)

With the rise of Internet technology and development of mobile application, more and more data and information are around us. However, it is not always easy to find the needed information that people want. Therefore, a good recommendation system is required for giving useful or interesting information. To provide useful information for user, a good classification of data is needed for recommendation...

chapter

Towards effective web page classification

Min Gu, Feng Zhu, Qing Guo, Yanhui Gu, more

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC) > 1 - 2

2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC)

In order to manage and organize information on the web, we propose a novel web page classification strategy integrating topic model and SVM. We use topic model to harness the implicit information on web pages for feature extraction. Accuracy of the strategy is 84.15%, 2.23% superior to the traditional classification strategy based on CHI.

chapter

A novel attribute weighting method with genetic algorithm for document classification

Sinan Ay, Yavuz Selim Dogan, Seyfullah Alver, Cetin Kaya

2016 24th Signal Processing and Communication Application Conference (SIU) > 1129 - 1132

2016 24th Signal Processing and Communication Application Conference (SIU)

Thanks to the proliferation of internet, a lot of data are produced by both websites and personal users. The documents are required to be classified in terms of their content in order to reach the necessary information fast and correctly from produced data. One of the biggest difficulties in document classification systems is detection of attribute that represent the classes in best way. In this research,...

chapter

A Machine Learning Based Web Spam Filtering Approach

Santosh Kumar, Xiaoying Gao, Ian Welch, Masood Mansoori

2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA) > 973 - 980

2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA)

Web spam has the effect of polluting search engine results and decreasing the usefulness of search engines.Web spam can be classified according to the methods used to raise the web page's ranking by subverting web search engine's algorithms used to rank search results. The main types are: content spam, link spam and cloaking spam. There has been little or no work on automatically classifying web spam...

chapter

Implicit links based kernel to enrich Support Vector Machine for web page classification

Abdelbadie Belmouhcine, Mohammed Benkhalifa

2015 10th International Conference on Intelligent Systems: Theories and Applications (SITA) > 1 - 4

2015 10th International Conference on Intelligent Systems: Theories and Applications (SITA)

Support Vector Machine (SVM) is a powerful classifier used widely in textual and web classification. It tries to find an hyperplane that separates positive and negative data, maximizes the margin. SVM is a classifier that is based on a kernel whose choice is very critical. We propose in this paper an implicit links based Gaussian kernel that uses an implicit links based distance. This kernel helps...

chapter

The classification of search results in the meta-search engine

Jiawei Liu, Qingshan Li, Yishuai Lin

2015 4th International Conference on Computer Science and Network Technology (ICCSNT) > 1 > 520 - 525

2015 4th International Conference on Computer Science and Network Technology (ICCSNT)

With the rapid development of the Internet, the demand of people on the Internet retrieval is increasing gradually. The meta-search engine is different from general search engine. It combines multiple search engine results and returns them to the user, but in order to meet the needs of different users, we need classify the results returned by the meta-search engine. Therefore, this article will discuss...

chapter

Classifying web hierarchically using multi label tree classifier

Daya Gupta, Harsh Tripathi, Mayukh Maitra

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

Classification and extraction of web finds its applications in semantic web, searching and information extraction. The first part of the paper deals with the problem of classifying web pages, according to their content. Further, the methodology to classify web pages hierarchically in order to achieve topic-wise modeling of websites using multi label tree classifier, a variant of classification where...

chapter

Feature Selection algorithms based on HTML tags importance

Amany M. Sarhan, Ghada M. Hamissa, Heba E. Elbehiry

2015 Tenth International Conference on Computer Engineering & Systems (ICCES) > 185 - 190

2015 Tenth International Conference on Computer Engineering & Systems (ICCES)

Traditionally in Web crawling, the required features are extracted from the whole contents of HTML pages. However, the position which a word is located inside the HTML tags indicates its importance in the web page. This research proposes two ideas concerning the Feature Selection stage in HTML web pages. The first idea reduces the features by simply extracting them from the important tags in an HTML...

chapter

Proposed Document Frequency technique for minimizing dataset in Web crawler

Amany M. Sarhan, Ghada M. Hamissa, Heba E. Elbehiry

2015 Tenth International Conference on Computer Engineering & Systems (ICCES) > 3 - 7

2015 Tenth International Conference on Computer Engineering & Systems (ICCES)

The explosive growth of webpage number on the Web has brought up some problems in the search process. One of these problems is that the general purpose search engines often return too many irrelevant results when users are searching for specific information on a given topic. Another problem is the massive increase in the number of pages to be indexed by Web search systems. In this research, two steps...

chapter

Classification of Web Pages as Evergreen Or Ephemeral Based on Content

Moonis Javed, Aly Akhtar, Akif Khan Yusufzai

2015 International Conference on Computational Intelligence and Communication Networks (CICN) > 1381 - 1385

2015 International Conference on Computational Intelligence and Communication Networks (CICN)

Classification of web content is an interesting and widely pursued field of research in machine learning. Web classification could be done in various ways based upon the criteria chosen. Subjective classification involves classification of web pages based upon the subject to which these pages belong (say history, economics, politics, etc.). Another way of classifying web pages could be based upon...

chapter

Web users browsing behavior prediction by implementing support vector machines in MapReduce using cloud based Hadoop

Pradipsinh K. Chavda, Jitendra S. Dhobi

2015 5th Nirma University International Conference on Engineering (NUiCONE) > 1 - 6

2015 5th Nirma University International Conference on Engineering (NUiCONE)

The motivation behind the work is that the prediction of web user's browsing behavior while serving the Internet, reduces the user's browsing access time and avoids the visit of unnecessary pages to ease network traffic. This research work introduces parallel Support Vector Machines for web page prediction. The web contains an enormous amount of data and web data increases exponentially, but the training...

chapter

WNPWR: Web navigation prediction framework for webpage recommendation

D Sejal, T Kamalakant, V Tejaswi, Dinesh Anvekar, more

2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS) > 120 - 125

2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS)

Huge amount of user request data is generated in web-log. Predicting users' future requests based on previously visited pages is important for web page recommendation, reduction of latency, on-line advertising etc. These applications compromise with prediction accuracy and modelling complexity. we propose a Web Navigation Prediction Framework for webpage Recommendation(WNPWR) which creates and generates...

chapter

E-commerce web page classification based on automatic content extraction

Warid Petprasit, Saichon Jaiyen

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 74 - 77

2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE)

Currently, There are many E-commerce websites around the internet world. These E-commerce websites can be categorized into many types which one of them is C2C (Customer to Customer) websites such as eBay and Amazon. The main objective of C2C websites is an online market place that everyone can buy or sell anything at any time. Since, there are a lot of products in the E-commerce websites and each...

chapter

An Event Data Extraction Method Based on HTML Structure Analysis and Machine Learning

Chenyi Liao, Kei Hiroi, Katsuhiko Kaji, Nobuo Kawaguchi

2015 IEEE 39th Annual Computer Software and Applications Conference > 3 > 217 - 222

2015 IEEE 39th Annual Computer Software and Applications Conference (COMPSAC)

This paper proposes an event data extraction method that extracts business event data, such as coupons, tickets, sales campaigns, etc., from a homepage or blog of shops and pushes them to users. Users no longer need to browse their favorite shops' homepage one by one. The method supports comprehensiveness and effectiveness for event data obtainment. This proposition works into two tasks: web page...

chapter

A comparative study of web pages classification methods applied to health consumer web pages

Aneeta Siddiqui, Mehnaz Adnan, Rizwan Alam Siddiqui, Tauseef Mubeen

2015 Second International Conference on Computing Technology and Information Management (ICCTIM) > 43 - 48

2015 Second International Conference on Computing Technology and Information Management (ICCTIM)

These days, the Internet is developing at an exponential rate and can cover just about any data required. Nonetheless, the immense measure of web pages makes it more difficult to effectively discover the target data by a user. Therefore, an efficient method, for classifying this huge amount of data is essential if the web pages are to be exploited to its full potential. In the domain of automatic...

chapter

Web spam detection using SVM classifier

Rahul C. Patil, D. R. Patil

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO) > 1 - 4

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO)

Web spam is one of the recent problems of search engines because it powerfully reduced the quality of the Web page. Web spam has an economic impact because spammers provide a large free advertising data or sites on the search engines and so an increase in the web traffic. In this paper we have implemented spam detection system based on a SVM classifier that combines new link features with content...

chapter

Identification and classification of emerging genres in WebPages

K. Pranitha Kumari, A. Venugopal Reddy

International Conference on Computing and Communication Technologies > 1 - 6

2014 International Conference on Computer and Communications Technologies (ICCCT)

The information in World Wide Web is dynamic and growing faster. Existing topic based search engines are not adequate to retrieve information required by the users. So there is a necessity to develop genre based search engines. Firstly, web genres have to be identified to develop genre based search engines. Presently, there exist a few genre corpuses which include web genres like articles, online...

chapter

Ranking model adaptation for domain specific mining using binary classifier for sponsored ads

M. Krishnamurthy, N. A. Anuja Jaishree, Anitha S. Pillai, A. Kannan

2014 14th International Conference on Hybrid Intelligent Systems > 35 - 42

2014 14th International Conference on Hybrid Intelligent Systems (HIS)

Domain — specific search focuses on one area of knowledge. Applying broad based ranking algorithms to vertical search domains is not desirable. The broad based ranking model builds upon the data from multiple domains existing on the web. Vertical search engines attempt to use a focused crawler that index only relevant web pages to a predefined topic. With Ranking Adaptation Model, one can adapt an...

Data set:
ieee
Keywords:
SUPPORT VECTOR MACHINES
WEB PAGES

Publication date

Set your own date range

INFONA - science communication portal

Search results

CIC-AB: Online ad blocker for browsers

Protecting web contents against persistent distributed crawlers

A Study of Content-Aware Classification of POI

Towards effective web page classification

A novel attribute weighting method with genetic algorithm for document classification

A Machine Learning Based Web Spam Filtering Approach

Implicit links based kernel to enrich Support Vector Machine for web page classification

The classification of search results in the meta-search engine

Classifying web hierarchically using multi label tree classifier

Feature Selection algorithms based on HTML tags importance

Proposed Document Frequency technique for minimizing dataset in Web crawler

Classification of Web Pages as Evergreen Or Ephemeral Based on Content

Web users browsing behavior prediction by implementing support vector machines in MapReduce using cloud based Hadoop

WNPWR: Web navigation prediction framework for webpage recommendation

E-commerce web page classification based on automatic content extraction

An Event Data Extraction Method Based on HTML Structure Analysis and Machine Learning

A comparative study of web pages classification methods applied to health consumer web pages

Web spam detection using SVM classifier

Identification and classification of emerging genres in WebPages

Ranking model adaptation for domain specific mining using binary classifier for sponsored ads

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options