The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Top-k document retrieval, which returns highly relevant documents relative to a query, is an essential task for many applications. One of the promising index frameworks is built by FM-index and wavelet tree for supporting efficient top-k document retrieval. The index, however, has difficulty on handling document frequency (DF) at search time because indexed terms are all substrings of a document collection...
Over the last few years, indoor localization has been a very dynamic research area that has drawn great attention. Many methods have been proposed for indoor positioning as well as navigation services. A big number of them were based on Radio frequency (RF) technology and Radio Signal Strength Indicator (RSSI) for their simplicity of use. The main issues of the studies conducted in this field are...
The rising of the modern Internet brought with it heap opportunities for attackers to gain illegal benefit from spreading spam mail. Spam is irrelevant or inappropriate messages sent on the Internet to a large number of recipients. Many researchers use a large number of classification method in machine learning to filter spam messages. But, there is still limited research which evaluate the use of...
Multibiometrics provides high recognition accuracy and population coverage by combining different biometric sources. However, some multibiometrics may obtain smaller-than-expected improvement of recognition accuracy if the combined biometric sources are dependent in terms of a false acceptance by mistakenly perceiving biometric features from two different persons as being from the same person. In...
The next generation intelligent devices need to understand and evolve with the user. Towards this goal, we present a User Graph generation framework that models user's level of interest and knowledge across a set of categories. The user graph is built through an unsupervised and semi-supervised topic modeling process, using latent semantic analysis technology. The self-evolving framework utilizes...
Semantic analysis often uses a pipeline of Natural Language Processing (NLP) tools such as part-of-speech (POS) tagging. Brill tagging is a classic rule-based algorithm for POS tagging within NLP. However, implementation of the tagger is inherently slow on conventional Von Neumann architectures. In this paper, we accelerate the second stage of Brill tagging on the Micron Automata Processor, a new...
Due to the growing number of unlabeled documents, it is becoming important to develop unsupervised methods capable of automatically extracting information. Topic models and neural networks represent two such methods, and parameter approximation algorithms are typically employed to estimate the parameters because it is not possible precisely to compute the parameters when using these methods. One of...
This paper presents a formal analysis of multiple popular approximate counting schemes that employ the conservative update policy, such as CU-Sketch and Minimal Increment Spectral Bloom Filters, under a unified framework. It is also shown that when applied to items picked from a skewed distribution, such as Zipf-like functions, the analysis follows very closely empirical results obtained through simulations...
The aim of clustering is to discover the clusters based on the similarity features of objects. The present algorithm of visual access tendency (VAT) can access an exact number of clusters by its VAT image. The VAT image displays the squared shaped dark blocks along the diagonal; number of cluster information is accessed by counting the number of obtaining square blocks. Other extended versions are...
An underwater target classifier can be trained only with the available limited instances of different ship and submarine emanations but in complex real world conditions, the classifier may encounter corrupted versions of the trained instances as well as novel occurrences such as targets belonging to an entirely different class. Most of the state-of-the art underwater target classifiers assign observed...
A Rough Set (RS) based dataset reduction method using SWARM optimization algorithm and a cluster validation function is proposed. In the proposed approach, the user specifies the classification quality required in advance, and the method then finds the attribute reducts and perform attribute discretization to satisfy the desired quality of classification. While many other solutions are possible, the...
In this study the usability of routinely measured meteorological parameters to estimate the global solar radiation is investigated. The proposed models are in the form of polynomials. The parameters such as ratio of duration of sunshine to maximum sunshine hours, mean temperature and mean relative humidity are used. Different combinations of these parameter sets have been used in proposing the monthly...
In spite of wide use of projection-based features in handwritten character recognition of several languages, its implementation was somewhat scanty in Bangla handwritten character recognition. This paper introduces the usage of projection profile features in recognizing handwritten Bangla basic characters. Alongside it also demonstrates a qualitative and quantitative analysis to visualize the effect...
Improving branch prediction accuracy is essential in enabling high-performance processors to find more concurrency and to improve energy efficiency by reducing wrong path instruction execution, a paramount concern in today's power-constrained computing landscape. Branch prediction traditionally considers past branch outcomes as a linear, continuous bit stream through which it searches for patterns...
In the last decade we have witnessed a huge increase of interest in data stream learning algorithms. A stream is an ordered sequence of data records. It is characterized by properties such as the potentially infinite and rapid flow of instances. However, a property that is common to various application domains and is frequently disregarded is the very high fluctuating data rates. In domains with fluctuating...
We proposed and evaluated an estimation method for the forced selection Japanese Diagnostic Rhyme Test (DRT). The proposed measure takes into account the forced selection manner of the DRT from a pair of rhyming words. The objective distance measure used here was based on the Articulation index Band Correlation (ABC), which showed favorable results for the English Modified Rhyme Test (MRT). The correlation...
This paper presents an effective re-ranking method that uses learning-to-rank paradigms to improve the accuracy of landmark-based audio fingerprinting (AFP) for audio music retrieval. The re-ranking mechanism is invoked whenever the returned ranking from an AFP system does not have a high enough confidence measure. We propose that use of new features for re-ranking, and employ the popular learning-to-rank...
Automatic personal identi����cation has become an important issue in several applications, such as physical buildings and information systems. Nowadays, biometric techniques are an important and effective solution for automatic personal identi����cation. One of the most popular biometric systems is based on the hand due to its ease of use. Hand has several modalities to be extracted, among them, Finger-Knuckle-Print...
Segmentation of multi-panel figure caption into subfigure captions and text labels is a problem that arises in a number of applications particularly in estimating the total number of panels in the multi-panel figure. The results of the existing methods for solving this problem are unsatisfactory and need improvement. Moreover, these methods are computationally expensive. In this paper, we propose...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.