The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper describes an extension of an intelligent acoustic event detection system, which is able to recognize sounds of dangerous events such as breaking glass or gunshot sounds in urban environment from commonly used noise monitoring stations. We propose to extend the system the way that it would not only detect the gunshots, but it would identify a suspects gun/pistol type as well. Such extension...
Fixed structure Mathematical Morphology (MM) operators have been used to detect QRS complexes in the ECG. These schemes are limited by the arbitrary setting of threshold values. Our study aims at extracting QRS complex fiducial points using MM with an adaptive structuring element, on a beat-to-beat basis. The structuring element is updated based on the characteristics of the previously detected QRS...
Current trends in patient care emphasize the overall quality of life as a goal for treatment outcome, therefore various ways of continuous patient monitoring have become increasingly popular in healthcare. Since an average human spends one third of his life asleep, it is apparent that the quality of sleep has an important impact on the overall life quality. The aim of the study was to investigate...
We developed a new, robust and efficient heart beat detector in multimodal data using an ECG signal, and one of the pulsatile signals such as blood pressure (BP), if present. To calculate the detection functions, simple and fast integer-multiplier sampling-frequency adjustable digital filters were developed. Using the morphological smoothing, the ECG and pulsatile-signal detection functions, and the...
A framework that can be used for assessing the suitability of different feature vectors in the task of determining the age similarity between a pair of faces is introduced. This framework involves the use of a dataset containing images displaying compounded types of variation along with the use of an ideal dataset, containing pairs of age-separated face images captured under identical imaging conditions...
Template protection technology can protect the confidentiality of a biometric template by certain conversion. We focus on the key-binding approach for template protection. This approach generates a secure template (or a conversion template) from joint data of a user's specific key with a user's template, and the key can be correctly extracted from the secure template only when a queried biometric...
The voice is most prominent & primary mode of communication among the human beings. With this speech human can communicate with machine, thus this technique is used in education, military and medical sectors. Though this is not the new area, from last few decades researchers are working on the improvement of accuracy in voice recognition system. The design of that system concerns major issues...
Brain Computer Interface (BCI) systems are the devices which are proposed to help the disabled, people who are incapable of making motor response to communicate with computer using brain signal. The aim of BCI is to interpret brain activity into digital form which acts as a command for a computer. One key challenge in current BCI research is how to extract features of random time-varying EEG signals...
In this paper, we propose a novel approach for detecting the text present in videos and scene images based on the Multiscale Weber's Local Descriptor (MWLD). Given an input video, the shots are identified and the key frames are extracted based on their spatio-temporal relationship. From each key frame, we detect the local region information using WLD with different radius and neighborhood relationship...
This paper proposes a novel no-reference Perception-based Image Quality Evaluator (PIQUE) for real-world imagery. A majority of the existing methods for blind image quality assessment rely on opinion-based supervised learning for quality score prediction. Unlike these methods, we propose an opinion unaware methodology that attempts to quantify distortion without the need for any training data. Our...
The best form of identifying a person for criminal investigation is from the fingerprint. Identifying suspects based on latent fingerprint is a procedure that is extremely important to forensics and law enforcement agencies. The small number of minutiae and the noise characteristic of latents make it extremely difficult to automatically match latents to their mated full prints that are stored in law...
Speech recognition and speaker recognition have wide range of applications in security systems and smart home designs. In this paper we discuss a method by which text dependent speaker recognition can be used to control gear shifting in light motor vehicles which could be helpful for people who lost one hand in accidents to drive cars. Speaker recognition involves two processes namely feature extraction...
Latent Low-Rank Representation (Lat LRR) has the empirical capability of identifying "salient" features. However, the reason behind this feature extraction effect is still not understood. Its optimization leads to non-unique solutions and has high computational complexity, limiting its potential in practice. We show that Lat LRR learns a transformation matrix which suppresses the most significant...
We propose a framework to perform multimodal registration of multiple images. In retinal imaging, this alignment enables the physician to correlate the features across modalities, which can help formulate a diagnosis. The images appear very different and there are few reliable modality-invariant features. We base our registration on the salient line structures extracted with a tensor-voting approach...
We approach the challenging problem of generating highlights from sports broadcasts utilizing audio information only. A language-independent, multi-stage classification approach is employed for detection of key acoustic events which then act as a platform for summarization of highlight scenes. Objective results and human experience indicate that our system is highly efficient.
Brain Computer Interface (BCI) aims at providing an alternate means of communication and control to people with severe cognitive or sensory-motor disabilities. These systems are based on the single trial recognition of different mental states or tasks from the brain activity. This paper discusses the major components involved in developing a Brain Computer Interface system which includes the modality...
Image edge detection is sensitive to noise which is contained by natural images so that it affects the quality of the image segmentation. In order to remove noise and improve edge detection accuracy, then improving the quality of image segmentation, a novel image segmentation algorithm via neighborhood the principal component analysis and Laplace operator is proposed. The feature vectors of each pixel...
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the...
Audio fingerprinting is widely used for audio identification, indexing, searching, navigation, monitoring and other monetization purposes, as well as support to other areas such as watermarking, music information retrieval and video identification. Because the ease of distorting intentionally or unintentionally an audio signal, the robustness and accuracy are very important characteristics in audio...
In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.