The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The increasing volume of spam has become a serious threat not only to the Internet, but also to the society. However, it's a great challenge to discover the spam from the Internet effectively and efficiently. Content-based filtering is one of the mainstream methods to solve the problem. This paper proposed a content based spam topic detection strategy through keyword extraction. In particular, spam...
With the rapid growth of e-commerce, there has been millions of products in a large ecommerce site where customer unable to effectively choose the products they are exposed to. To overcome the product overload problem, a variety of recommendation methods have been developed. Collaborative filtering (CF) is the most successful recommendation method. However, the CF method has two well-known limitations,...
Web based applications are increasing at an enormous speed and consequently its users are also increasing at an exponential speed. The evolutionary changes in technology have made it possible to capture the users' essence and interactions with web applications through web server log file as web usage. The web usage Mining (WUM) is the process of discovering hidden patterns from the web usage. Due...
In the past decade the massive growth of the Internet brought huge changes in the way humans live their daily life; however, the biggest concern with rapid growth of digital information is how to efficiently manage and filter unwanted data. In this paper, we propose a method for managing RSS feeds from various news websites. A Web service was developed to provide filtered news items extracted from...
Nowadays, the emphasis on Web 2.0 is specially focused on user generated content, data sharing and collaboration activities. Protocols like RSS (Really Simple Syndication) allow users to get structured web information in a simple way, display changes in summary form and stay updated about news headlines of interest. In the e-Learning domain, RSS feeds meet demand for didactic activities from learners...
In recent years, the blog has become the most typical social media for citizens to share their opinions. In addition, a large number of blogs reflect current social trends or major issues. Especially, more than thousand articles and more than 10,000 responding messages (comments) are registered on a well-known blog in a day. It is hard to search and explore useful messages on blogs since most blog...
With more and more reviews on the Web, browsing through a mass of the related reviews becomes a heavy work. How to effectively analyzing and organizing these reviews attracts more attention. This paper pursues on the analysis of product reviews. It focuses on the product features that customer commented on and also whether their opinions are positive or negative. Different from the traditional method,...
In social news services, selecting valuable and credible news content is one of the most important issues. In traditional journalism, a small number of people called editors selected news that they considered worthwhile. Recently, several services have utilized reader voting to find news that is popular or credible but most of them are prone to abuse. In this paper, we present a collaborative news...
We introduce a novel set of social network analysis based algorithms for mining the Web, blogs, and online forums to identify trends and find the people launching these new trends. These algorithms have been implemented in Condor, a software system for predictive search and analysis of the Web and especially social networks. Algorithms include the temporal computation of network centrality measures,...
This paper presents a quantitative study of the use of the Wikipedia system by its users (both readers and editors), with special focus on the identification of time and kind-of-use patterns, characterization of traffic and workload, and comparative analysis of different language editions. The basis of the study is the filtering and analysis of a large sample of the requests directed to the Wikimedia...
Web 2.0-based IPTV is a new Internet protocol television (IPTV) infrastructure that allows users to participate in content creation and consumption through Web-based communities that are formed based on user interests. However, there are some limitations in making users actively participate in creating and utilizing communities. First, users need to explicitly create and manage their communities....
Web page de-duplication module is an important part of search engine system, which can improve its performance and quality with filtering the Web pages downloaded by crawler system of search engine and eliminating the duplicated Web pages. This paper from the source of duplicated Web pages - reshipment proposes a Web page de-duplication method that the information including original Web sites and...
The current active information service system focuses too much on the study of personalized modeling, and there is less research on artificial participation, automatic information analysis and automatic information update. According to the requirement of active information service to topical Web, the article integrate agent technology, information filtering technology, crawling technology on Heritrix...
Internet has brought about the problem of "information overload". How to liberate the journalists from the heavy network news filtering works is a significant challenge. In this paper a new information filtering method based on S-K (social learning and knowledge base) model is brought forward through the research of the existing information filtering technology. This approach can lead the...
The technologies for telecommunication, GPS, and mobile GIS have rapidly progressed in the last few years. They will allow us to access to the Internet at any time and any place. A user needs to acquire information easily and correctly through a web browser on a cell-phone or a PDA. Our purpose is to build a system providing blog pages written about a facility for a user who is standing in front of...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.