The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Huge amount of entities and theirs relationships are posted on the Web. Those entities and theirs relationship networks help many activities. In this paper, we focus on the task of extracting academic entity network from homepages. Homepages usually contain many entities, such as persons, conference/journal and organization and theirs relationship. However, homepages don't follow a unified layout...
The International Organization of Supreme Audit Institutions (INTOSAI) recognizes that disclosure is a factor that has become crucial to the success of the work of Supreme Audit Institutions (SAI). This article aims to analyze whether the Spanish SAIs have implemented new information technology, and more specifically the Internet, as a means to improve both the transparency of its actions as interaction...
This paper focuses on recording how much a user is interested in the contents displayed on the web page with Fuzzy theory detection and finding out their relationship, as well as the rule of categorizing, via Data Mining technology, websites on-line into related communities. Then, proper adaptations of the web contents will be made in accordance with the most suitable positions of advertisements not...
Lone wolf terrorists pose a large threat to modern society. The current ability to identify and stop these kind of terrorists before they commit a terror act is limited since they are very hard to detect using traditional methods. However, these individuals often make use of Internet to spread their beliefs and opinions, and to obtain information and knowledge to plan an attack. Therefore, there is...
This research gathered web network top-level domain (tld) interlink age among Muslim Middle East and North African Nations (MMENANs) in December 2010 and in April 2011, constituting before and after measures with respect to the 2011 Muslim Middle-East (MMENA) uprisings between these time points. This constitutes a naturalistic field experiment, with the uprisings occurring before April serving as...
Companies that are present at multiple online locations may have difficulty staffing them adequately in terms of helpdesk services with efficient use of human resources. In this paper a technological architecture is presented that allows a team of people to staff several online locations simultaneously, providing to customers/users the indication of human presence to attend them. We also present details...
In this study, we address the problem of searching experts in an arbitrary topic on the Web. In particular, we propose two methods that analyze the expertise of information senders of Web pages: 1) a method that computes expertise score based on hit count from a search engine (hit count method), and 2) a method that computes expertise score based on the number of documents that are attributed to an...
Web-scale relation extraction is crucial to building the Web people search engines. Previous extraction models, such as Snowball, focus only on single type extraction, while the real applications always require as many as possible types of relation. In this paper, we propose a novel Web-scale relation extraction framework Multi-Type Snowball (MultiSnowball). MultiSnowball targets at extracting multiple...
By moving from its original host-centric architecture to a new information-centric organization, the Internet will be able to offer new services and applications to end users, allowing for example the on-demand composition of a new service from those already available online. However, this requires the development of a Network of Information to offer users the possibility to annotate, discover and...
This paper aims to investigate the additional information value provided by user-created social tags and author-provided metadata as well as their effectiveness in facilitating Web clustering and discovery. We collected a data set of Web pages that includes both social tags from the del.icio.us website and author-provided metadata crawled from the internet. Based on this data set, we first checked...
Finding information about people using search engines is one of the most common activities on the Web. However, search engines usually return a long list of Web pages, which may be relevant to many namesakes, especially given the explosive growth of Web data. To address the challenge caused by name ambiguity in Web people search, this paper proposes a novel graph-based framework, GRAPE (abbr. a graph-based...
In order to meet the needs of monitoring and recording telephone calls and strengthen supervision over safety-production for the control center of organizations, the Web-based VoIP recording management system is developed. This system realizes several management modules, including management of device, channel, call record, recorded files, system setting, and the system auxiliary functions for user...
The objective of this paper is to propose new Web metrics based on its usability and effectiveness for different Web domain users of the page through the available utilities and its visual appearance. The earlier metrics concentrated on the number of visits, cost per visit and clicks to download ratio are not sufficient enough to fully assess the Web quality and the impact of the page features on...
Both classification and ranking strategy have been reported positively in mining the named entity (NE) translation from the snippets re-turned by the Web search engine. Taking the most challenging issue of the organization name and its translation as an example, this paper conducts a contrastive study on the two strategies under SVM framework. We empirically show that the method of translation ranking...
Event relationship extraction is a new research domain which has attracted more and more attentions. It is because that the relationships among different events can provide a lot of important information on special fields such as national defense, crime-solving or anti-terrorism. But there are innumerous event description Web pages on the Internet and the relationship among them is perplexing, so...
Information about individuals on publicly available Web sites stands as a valuable, yet unorganized, data source. Turning such an enormous data source into a ldquodatabaserdquo is highly desirable as it has the potential to lead to novel ways of using the available information to the largest extent. In this paper, we present PopulusLog, a novel Web data mining system. PopulusLog is a pioneering example...
Based on the granular computing theory, a new granularity model of Web structure is proposed in this paper, a new concept called Webpage granularity is defined, and some associated factors are presented to impact on the organizational structure of Website. Finally, an example is given to calculate and verify the model.
One promising application of natural language processing (NLP) research is in the area of information extraction (IE). In this paper, we present work flow of our IE system for the extraction of semantically rich information from the unstructured or semi-structured Chinese web pages. Knowledge engineering approach and automatic training approach are used to extract pattern and built knowledge repository...
Uncertainty over future oil prices, concerns over and energy security and increasing agreement over an urgent need to reduce global greenhouse gas emissions to avoid unacceptable temperature increases and climate change have combined to generate strong interest in low emission energy supply options, including increased efficiency of energy use and renewable energy technologies. These energy options,...
This study investigates how undergraduate and graduate students search the web for consumer health information. The 32 participants were asked to find answers to four health related topics. Data was collected through pre- and post search questionnaires, think-aloud protocol, and transaction logs. The results presented focus on the search process as a whole and by question and on the user satisfaction.
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.