The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Recent library digitization projects attempt to provide large collections of printed material from varying sources in a searchable format. The scanned documents are typically processed using Optical Character Recognition (OCR), which typically introduces errors in the text. This paper proposes a technique for correction of OCR degraded text that is independent of character-level OCR errors, and hence...
Non-negative matrix factorization is an important method helpful in the analysis of high dimensional datasets. It has a number of applications including pattern recognition, data clustering, information retrieval or computer security. One its significant drawback lies in its computational complexity. In this paper, we introduce a new method allowing fast approximate transformation from input space...
An evaluation study performed in Arabic language on the five web search engines Araby, Ayna, Google, MSN and Yahoo aimed to compare how good these search engines can satisfy the information needs of native Arab users on the internet in their mother tongue. The top ten search results for fifty randomly selected search queries and the descriptions of these results in the search results list were evaluated...
A novel solution is proposed to an important problem of learning real querying preferences and intentions from users who need to retrieve interesting information from a database but are not in a position to specify their information needs and/or intentions using a query language due to lack of knowledge and/or experience. A solution is proposed that is based on the presentation to the user of consecutive...
The extraction of temporal information from text documents is becoming increasingly important in many applications such as natural language processing, information retrieval, question answering, etc. Indeed, the temporal dimension plays a key role on most of these systems, promoting better performance. Our goal is the definition of a temporal document representation, incorporating the time dimension...
In this paper, we describe an alternative method of the recognition of human irises with the usage of Non-Negative Matrix Factorization. The proposed method has been implemented on graphic processor unit (GPU) which makes the method usable in the real world due to short computation time.
The fundamental model for Web navigation has not changed much since the beginning of the development of Hypertext and Web search engines. Current browsing allows users to search by formulating queries, entering known URLs, and by navigation by following links embedded in webpages. Considerable research has focused on navigation mechanisms to improve the effectiveness of the process of finding relevant...
The biclustering problem consists in simultaneously clustering rows and columns of a data matrix. The aim of this paper is to empirically assess the performance of cooperative coevolution as an alternative approach for coping with the task of discovering good and sizeable biclusters. For this purpose, two cooperative coevolutionary algorithms, one configured with genetic algorithms (GAs) and another...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.