Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
Recommender systems are mostly well known for their applications in e-commerce sites and are mostly static models. Classical personalized recommender algorithm include collaborative filtering method applied in Amazon, matrix factorization algorithm from Netflix, etc. In this article, we hope to combine traditional model with behavior pattern extraction method. We use desensitized mobile transaction...
This paper presents a new statistical model for describing real textured images. Our model is based on the observation that the Scale-Invariant Feature Transform (SIFT) descriptors extracted from a given image can be properly modeled by the Gamma distribution. The maximum-likehood algorithm was used to estimate the two parameters of the Gamma distribution. The efficiency of the proposed approach was...
Over the years speech recognition has taken the market. The speech input can be used in varying domains such as automatic reader and for inputting data to the system. Speech recognition can minimize the use of text and other types of input, at the same time minimizing the calculation needed for the process. A decade back speech recognition was difficult to use in any system, but with elevation in...
Presently biometric applications have been widely employed for security needs. They enable to provide a higher security level concerning physical or logical access. The electrocardiogram (ECG) signal can be exploited as a biological tool for human identification given that it is robust to falsification and more universal. This research introduces a new method applying this ECG particularity. Five...
In this paper, we investigate methods to improve the recognition performance of low-resource languages with limited training data by borrowing subspace parameters from a high-resource language in subspace Gaussian mixture model (SGMM) framework. As a first step, only the state-specific vectors are updated using low-resource language, while retaining all the globally shared parameters from the high-resource...
Aligning two representations of the same domain with different expressiveness is a crucial topic in nowadays semantic web and big data research. OWL ontologies and Entity Relation Diagrams are the most widespread representations whose alignment allows for semantic data access via ontology interface, and ontology storing techniques. The term ""alignment" encompasses three different processes:...
We propose a language based on relational algebra extended by intervals for detecting high-level surveillance events from a video stream. The operators we introduce for describing temporal constraints are based on the well-known Allen's interval relationships. The semantics of our language are clearly defined and we illustrate its usefulness by expressing typical events in it and showing the promising...
This work proposes a system that percepts handsigns and gestures via computer vision system and extractsufficient amount of images from it. After applying imageprocessing and extracting the features of the images, the systemuses an algorithm to recognize the hand signs and gestures. Inthe process of recognizing the hand signs, the Artificial NeuralNetwork (ANN) is being trained with some specific...
One of the major challenge in human emotion recognition is extraction of features containing maximum prosodic information. The accuracy of entire emotion detection system eventually relies upon the efficiency of the selected feature. When it comes to identifying emotions from voice, ambiguity in detection can never be completely avoided due to several reasons. Exclusion of redundant information to...
The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic...
Building natural scene statistic models is a potentially transformative development for a wide variety of visual applications, ranging from the design of faithful image and video quality models to the development of perceptually optimized image enhancing techniques. Most predominant statistical models of natural images only characterize the univariate distributions of divisively normalized bandpass...
Design a software system on smart phone platform. The purpose of this system is providing a reasonable method to evaluate the English accent of non-native speakers, based on the phoneme recognition and fluency assessment, taking advantage of Hidden Markov Model (HMM). Meanwhile, this paper would use the neural net algorithm to combine the objective scoring and experts' scoring to increase the accuracy...
The safety of miners is of interest to all countries. In the event of a coal mine disaster, how to locate the miners remains the biggest and most urgent issue. The aim of this study is to propose a precise positioning method for underground mine environments. In this paper, a layered two-step Hidden Markov Model is proposed to simulate human walking in underground mine environments and an improved...
In this paper, a novel sparse representation over learned and exemplar dictionaries is explored to estimate the speech information of stressed speech. Stressed speech contains speech and stress informations. The acoustic variabilities are induced due to presence of stress information, which results in degradation of the performance of speech recognition system. In this work, the acoustic variabilities...
There has been a considerable use of acoustic features for speaker identification and recognition. Few of these features have also been used by researchers to recognize emotions in speech effectively. Here an attempt is made to characterize human speech emotions with acoustic features as speech rate, formant frequencies, amplitude and energy initially. Further, a reduced acoustic feature set based...
There has been confusion about the American Midland dialect for a long time. Since 1968, researchers have been looking for an answer to the question of whether it exists or not. Starting with Bailey, who was unsuccessful in identifying the Midland dialect based on vocabulary only, as vocabulary varies within the same community, and ending with Johnson, who proved that the Midland region is a separate...
Smile is not only a visual expression. When it occurs together with speech, it also alters its acoustic realization. Being able to synthesize speech altered by the expression of smile can hence be an important contributor for adding naturalness and expressiveness in interactive systems. In this work, we present a first attempt to develop a Hidden Markov Model (HMM)-based synthesis system allowing...
In speech recognition, it is preferable not to hypothesize the details, e.g., specific age and gender, of a target user. However, speaker independence is one of the things that degrades ASR performance. In this work, we propose a speaker adaptation method to recognize a short time utterance. There have been several studies on speaker-independent DNN-HMM in which i-vector is computed, and the additional...
Nowadays, hand gesture is one of main considerations for hearing impaired people because they use sign language to communicate with each other and to normal people. In general, the normal people have difficulties with sign language therefore they need an interpreter supporting communication. Then the automatic hand gesture recognition system is needed to help hearing impaired people integrating into...
This paper proposes a system that allows recognizing a person's emotional state with the help of recording audio signals. This system is able to recognize four emotions (anger, happiness, sadness and neutral) This emotion recognition technique is mainly composed of two subsystems as - 1) gender recognition (GR) and 2) emotion recognition (ER). It has been proved experimentally that the performance...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.