The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
World Wide Web is undoubtedly a human creation. It was facilitated by a number of major inventions. Its continuous development results largely from innovative engineering efforts. The Web viewed as essentially a service provided on the Internet offers many different functions. We discuss information providing functionality offered as “Web search”. Recent trends involve devising schemes to make search...
A prototype system of electronic map interactive Web site from which data information of multi-dimension and multi-layer about the map information from 17 century to 19 century of China can be displayed and obtained has been studied, designed and practiced based on the techniques of Ajax and plug-in components ZoomMap. The functions of ZoomMap are expanded combining with an asynchronous communication...
Malicious web pages are an emerging security concern on the Internet due to their popularity and their potential serious impacts. Detecting and analyzing them is very costly because of their qualities and complexities. There has been some research approaches carried out in order to detect them. The approaches can be classified into two main groups based on their used analysis features: static feature...
Pharming attacks - a sophisticated version of phishing attacks - aim to steal users' credentials by redirecting them to a fraudulent website using DNS-based techniques. Pharming attacks can be performed at the client-side or into the Internet, using complex and well designed techniques that make the attack often imperceptible to the user. With the deployment of broadband connections for Internet access,...
Demographic information plays an important role in gaining valuable insights about a web-site's user-base and is used extensively to target online advertisements and promotions. This paper investigates machine-learning approaches for predicting the demographic attributes of web-sites using information derived from their content and their hyper linked structure and not relying on any information directly...
Mobile web browsing signifies accessing the content on web pages using a mobile device. It is common for Internet search engines to use keyword searching in which rank is assigned to each page based on several features. But it is an arduous task for a user to inscribe a keyword in such a delicate small mobile screen. A challenging research goal is the development of advanced web-based applications...
This paper presents World Wide Web Consortium's (W3C) Geolocation Application Program Interface (API), which is an interface to retrieve geographic location information of a client-side device. Currently Geolocation API is used as implementation in Web browsers, and new coming HTML5 standard. There are few methods how Geolocation API is determining location which are mentioned in this article, and...
Finding knowledge on the Web has long been a hot research issue. Today the Web has become a popular medium for publishing news and opinion articles, which are important carriers of human knowledge, especially of social knowledge. Developing techniques of automatically collecting and analysing these articles on a large scale is thus desirable. In this paper we propose techniques for searching for events...
Nowadays, more than 2 billion people around the world have access to the Internet regularly, and the Internet is the most important platform for information, work and entertainment with more than 150 million active websites. However, these websites are accessible only through devices equipped with a screen and a Graphical User Interface (GUI). Furthermore, these devices need a network connection to...
This paper studies the problem of extracting data from large numbers of semi-structured web pages. The fact that many websites have enormous pages generated dynamically from a underlying structured source like a database makes it feasible to induct a common template for similar web pages and then extract data accordingly. Previous work on this problem has limited practical utility because of either...
With the advent of Web 2.0, an increasing number of web sites has started offering their data over the web in standard formats and exposed their functionality as APIs. A new type of applications has taken advantage of the new data and services available by mixing them, in order to generate new applications fast and efficiently, getting its name from its own architectural style: mashups. A set of applications...
The use of Ajax techniques of websites brings web application with more interactive and dynamic interfaces without refreshing the page. This paper presents Ajax framework, jQuery use on the client and generation of simple response data on the server through implementation of medicinal plants web apps. Also this paper describes the features of data format in asynchronous communication. So we easily...
We present DESP, an automatic data extractor on Deep Web pages for book domain, which can extract data items and label attributes at the same time. The case of DESP is to extract books' information such as title, author, price and publisher from result pages returned from bookstore web sites. Although DESP is for a specific domain, the method used by DESP is highly adaptive and can suit other domains...
Information hiding technology is a hot spot in information security, and is applied in the fields of digital multimedia copyright protection and secret communication. According to the analysis of the characteristics of browser in parsing HTML of the web page and the little capacity available for information hided in web page, a new efficient web page information hiding method with tag attributes has...
The information on the Internet has been grown exponentially, the Internet users are overwhelmed by these information. How to automatically extract useful information from the relevant pages, so as to provide a convenient and rapid information query platform for the users, is an important issue. In this paper, based on simple tree matching algorithm, we present a Web data extraction method based on...
Internet is a huge source of information. Search engines have indexed much of this information and are able to extract the relevant webpages that are related to a given query. However, once the search engine retrieves a set of webpages, the user has to read the webpages in order to find the relevant information. This is a time consuming task because webpages often mix information related to different...
Accurate reference metadata extraction becomes an intriguing task to researchers who want to collect data of scientific publications. In this paper, we introduce an approach to extracting the reference metadata based on regular expressions. A prototype system named “Goldrusher” is created which automatically extracts data from the website of Association for Computing Machinery (ACM). The experimental...
Mining of association rules is an important research topic in web usage mining. The purpose of this paper is to research how to dig interesting association rules effectively from the Web logs after been preprocessed. Firstly, using the FP-growth algorithm for processing the web log records, obtaining a set of frequent access patterns, then using the combination of browse interestingness and site topology...
There are two shortages when the method of classification based on association rules is applied to classify the web documents: one is that the method process the web document as a plain text, ignoring the HTML tags information of the web page; another is that either item of the association rules is only the word in the web page, without considering the weight of the word, or it quantifies the weight...
The modern Web architecture basically follows the Representational State Transfer (REST) style. This style offers the architectural properties necessary to implement the Internet-scale Web. However, most authentication and delegation technologies that rely on session state actually deviate from the REST style. It must be noted, however, that the diversity of these technologies is imperative for the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.