The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The terrorism activities are not only in real world as development of technology, but also in cyber world. Terrorism activities in cyber world are called cyber terrorism. One of methodology for cyber terrorism detection is by applying data mining algorithm to textual content of terrorism related web pages. Web mining is technology applied to extract information from the web. By using web mining, cyber...
In this paper, we propose a framework to answer questions of opinion type. The data source is the web pages returned from the search engine. By using Bayes Classifier, the main texts on the pages are classified into three categories at sentence level: positive review, negative review and neutral review. K-means method is used to cluster the sentences of positive review and negative review respectively...
An approach supervised by ontology is proposed for Web information extraction after analyzing two types of methods based on wrapper and concept model. Using concepts and taxonomy relation between concepts provided by ontology, this method can locate the wanted information blocks in Web page quickly by judging if adjacent sub-trees which are included in HTML Tree are isomorphic. Furthermore, combining...
The Semantic Web has made possible the use of the Internet to extract useful content, a task that could necessitate an infrastructure across the Web. With Hadoop, a free implementation of the MapReduce programming paradigm created by Google, we can treat these data reliably over hundreds of servers. This article describes how the Apriori algorithm was adapted to MapReduce in the search for relations...
At present, the scale and diversity of Web information are immense. Acquiring Web information simply relies on search engine which is increasingly unable to meet user needs, thus Web information extraction (WebIE) technology attracts widely attentions. In this paper, a framework of distributed multi-slot WebIE system based on agent is proposed. It includes user agent, mediator agent, wrapper agent,...
Collaborative tagging has emerged as a useful means to organize and share resources on the Web. Recommender systems have been utilized tags for identifying similar resources and generate personalized recommendations. In this paper, we analyze social and behavioral aspects of a tag-based recommender system which suggests similar Web pages based on the similarity of their tags. Tagging behavior and...
Community-driven Question Answering services are gaining increasing attention with tens of millions of users and hundreds of millions of posts in recent years. Due to its size, there is a need for users to be able to search these large question answer archives and retrieve high quality content. Research work shows that user reputation modeling makes a contribution when incorporated with relevance...
Social tagging is a process in which many users add metadata to a shared content. Through the past few years, the popularity of social tagging has grown on the Web. In this paper we investigated the use of social tags for Web page classification: adding new Web pages to an existing Web directory. A Web directory is a general human-edited directory of Web pages. It classifies a collection of pages...
Search engine technology plays an important role in web information retrieval. However, with Internet information explosion, traditional searching techniques cannot provide satisfactory result due to problems such as huge number of result Web pages, unintuitive ranking etc. Therefore, the reorganization and post-processing of Web search results have been extensively studied to help user effectively...
In order to identify a user's personal preference in navigating the Web or to recommend collected Web information, it is very useful to analyze the user's Web-browsing behavior. However, it is difficult to determine which Web-browsing behaviors are influential on predicting a user's interest because each individual has his/her own habit and personal manner in surfing the Web and locating documents...
Now the difficulty of information query on the Web is there is not matching between short, misty user query and the document of existing a great deal of redundancy and noise. Anew query method was brought forward based on first item clicked feedback under the analyzed excellent and disadvantage of relevance feedback and pseudo feedback.Present a measure of first item-clicked feedback. The experiment...
As computers and computer networks become more sophisticated, a vast amount of information and knowledge has been accumulated and circulated on the Web. They provide people with options regarding their daily lives and are starting to have a strong influence on governmental policies and business management. However, a crucial problem is that information on the Web is not necessarily credible. It is...
We postulate that, due to linguistics and cultures, there are differences in the Web structure and the content of the Web documents of various languages. In this work, we design experiments to study the characteristics of the Chinese Web, and compare them to that of the English Web. We also examine whether these differences in Web characteristics, if identified, have an impact on the effectiveness...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.