The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The entropy measurement function is a central element of decision forest induction. The Shannon entropy and other generalized entropies such as the R?nyi and Tsallis entropy are designed to fulfill the Khinchin-Shannon axioms. Whereas these axioms are appropriate for physical systems, they do not necessarily model well the artificial system of decision forest induction. In this paper, we show that...
In course of a breaking news event, such as natural calamity, political uproar etc., a massive crowd sourced data is generated over social media which makes social media platforms an important source of information in such scenarios. The value of the information being propagated via social media is being increasingly realised by the news organisations and the journalists. Better tools and methodologies...
In this paper, we present a feasibility study for using a single Microsoft Kinect sensor to assess the quality of rehabilitation exercises. Unlike competing studies that have focused on the validation of the accuracy of Kinect motion sensing data at the level of joint positions, joint angles, and displacement of joints, we take a rule based approach. The advantage of our approach is that it provides...
Learning in non-stationary environments is not an easy task and requires a distinctive approach. The learning model must not only have the ability to continuously learn, but also the ability to acquired new concepts and forget the old ones. Additionally, given the significant importance that social networks gained as information networks, there is an ever-growing interest in the extraction of complex...
Retrieval engines provide results according to user request. Nevertheless, reaching satisfaction can not be guaranteed with simple retrieval step. Therefore, it is necessary to communicate this dissatisfaction to the system through relevance feedback techniques. Indeed, with the growing number of image collections and by applying approximate nearest neighbor (ANN) algorithms to resolve the curse of...
The paper proposes CONSIFT descriptors which are the rotation-variant modification of SIFT (primarily for affine-invariant keypoints). CONSIFT of a keypoint K is its SIFT computed relatively to the orientation defined by the location of another keypoint L (and concatenated with similarly computed SIFT for keypoint L relatively to the location of K). It is additionally recommended that K and L are...
In this paper we focus on the evaluation of the deformable part model (DPM) proposed by Felzenszwalb et al. [IO] in the context of vision-based people detection in heavy machines applications. The proposed system uses a single fisheye camera to provide a wide field-of-view (FOV) at low cost. However, the fisheye optical distortions present several difficulties for image processing and object recognition...
Can we learn from the unknown? Logical data sets of the ternary kind are often found in information systems. They contain unknown as well as true/false values. An unknown value may represent a missing entry (lost or indeterminable) or something with meaning, like a "Don't Know" response in a questionnaire. In this paper we introduce an effectively- and efficiently-superior algorithm for...
Discretization of streaming data has received surprisingly little attention. This might be because streaming data require incremental discretization with cut points that may vary over time and this is perceived as undesirable. We argue, to the contrary, that it can be desirable for a discretization to evolve in synchronization with an evolving data stream, even when the learner assumes that attribute...
Recent advances in distance function learning have demonstrated that learning a good distance metric can greatly improve the performance in a wide variety of tasks in data mining and web search. A major problem in such scenarios is the limited labeled knowledge available for learning the user intentions. Furthermore, distances are inherently local, where a single global distance function may not capture...
Food-related photos have become increasingly very popular, due to social networks, food recommendation and dietary assessment systems. Reliable annotation is essential in those systems, but user-contributed tags are often non-informative and inconsistent, and unconstrained automatic food recognition still has relatively low accuracy. Most works focus on exploiting only the visual content while ignoring...
In this paper, we propose the idea of hybrid cooperative co-evolution (hCC). In CC, multiple instances of the same evolutionary algorithm work in parallel, each optimizes a different subset of the problem in hand. In recent years, different approaches have been introduced to divide the problem variables into separate groups based on the property of separability. The idea is that when dependent variables...
In this paper we propose a strategy to use messages posted in a blogging platform for real-time sensing of traffic-related information. Specifically, we use the data that appear in a blog, in Portuguese language, which is managed by a Brazilian daily newspaper on its online edition. We propose a framework based on two modules to infer the location and traffic condition from unstructured, non georeferenced...
Ambient Assisted Living (AAL) is considered as the main technological solution that will enable the aged and people in recovery to maintain their independence and a consequent high quality of life for a longer period of time than would otherwise be the case. This goal is achieved by monitoring human's activities and deploying the appropriate collection of services to set environmental features and...
This paper addresses multi-objective problems with fuzzy data which are expressed by means of triangular fuzzy numbers. In our previous work, we have proposed a fuzzy Pareto approach for ranking the generated triangular-valued functions. Then, since the classical multi-objective optimization methods can only use crisp values, we have applied a defuzzification process. In this paper, we propose a fuzzy...
In this paper, we addressed the Named Entity Recognition (NER) problem for morphologically rich languages by employing a semi-supervised learning approach based on neural networks. We adopted a fast unsupervised method for learning continuous vector representations of words, and used these representations along with language independent features to develop a NER system. We evaluated our system for...
This paper proposes a robust minutiae based fingerprint image hashing technique. The idea is to incorporate the orientation and descriptor in the minutiae of fingerprint images using SIFT-Harris feature points. A recent shape context based perceptual hashing method has been compared against the proposed technique. Experimentally, the proposed technique has been shown to deliver better robustness against...
Given an undirected graph G, a specific node r, and capacity on the nodes, the maximum bounded r-tree problem consists of finding a tree of G rooted at r containing as many nodes as possible with respect to the node capacities. This NP-hard optimization problem has been recently considered in the context of peer-to-peer networks. In this work, we study the associated polytope, in the space of edge...
Creating sufficiently large annotated resources for supervised machine learning, and doing so for every problem and every domain, is prohibitively expensive. Techniques that leverage large amounts of unlabeled data, which are often readily available, may decrease the amount of data that needs to be annotated to obtain a certain level of performance, as well as improve performance when large annotated...
As an important branch of biomedical information extraction, Protein-Protein Interaction extraction (PPIe) from biomedical literatures has been widely researched, and machine learning methods have achieved great success for this task. However, the word feature generally adopted in the existing methods suffers badly from vocabulary gap and data sparseness, weakening the classification performance....
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.