The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The pollen grains of different plant taxa exhibit various shapes and sizes. This structural diversity has made the identification and classification of pollen grains an important tool in many fields. Despite the myriad of applications, the classification of pollen grains is still a tedious and time-consuming process that must be performed by highly skilled specialists. In this paper, we propose an...
Recently, a novel "completely automated public Turing test to tell computers and humans apart (CAPTCHA)'' system has been proposed, in which users are asked to separate natural faces of humans and artificial faces of virtual world avatars. The system is based on the assumption that computers cannot separate them while it is an easy task for humans. Conventional digital forensics approaches to...
Captchas are frequently used on the modern world wide web to differentiate human users from automated bots by giving tests that are easy for humans to answer but difficult or impossible for algorithms. As artificial intelligence algorithms have improved, new types of Captchas have had to be developed. Recent work has proposed a new system called Avatar Captcha, in which a user is asked to distinguish...
We are now developing a Japanese speaking test called SCAT, which is part of J-CAT (Japanese Computerized Adaptive Test), a free online proficiency test for Japanese language learners. In this paper, we focus on the sentence-reading-aloud task and the sentence generation task in SCAT, and propose an automatic scoring method for estimating the overall score of answer speech, which is holistically determined...
In this paper, we explore the retrieval of perceptually similar audio. It focuses on finding sounds according to human perceptions. Thus such retrieval is more “human-centered” [1] than previous audio retrievals which intend to find homologous sounds. We make comprehensive use of various acoustic features to measure the perceptual similarity. Since some acoustic features may be redundant or even adverse...
Ensemble feature selection is known for its robustness and generalization of highly accurate predictive models. In this paper, we use different filter-based feature selection methods in an ensemble manner to improve face recognition. The goal is to distinguish human faces from avatar faces. Our approach was able to achieve very high accuracy, 99%, using less than 1% of the pixels in each image. This...
Probabilistic latent semantic analysis (PLSA) has been widely used in the machine learning community. However, the original PLSAs are not capable of modeling real-valued observations and usually have severe problems with over fitting. To address both issues, we propose a novel, regularized Gaussian PLSA (RG-PLSA) model that combines Gaussian PLSAs and hierarchical Gaussian mixture models (HGMM). We...
This paper proposes an investigation on classification of the positive and negative emotions via the use of electroencephalogram (EEG). EEG bandpowers are extracted as the feature of interest. Two simple decision rules to classify positive and negative emotions are proposed, i.e. 1) using both the left and right frontal information and 2) using only one side of the left or right frontal information...
Human Behavior Understanding (HBU) is a major challenge facing intelligent agents. Most approaches to solve this problem assume a recognition/detection context in which the agent/robot tries to match the perceived behavior to one or more predefined motion patterns (e.g. walking, running etc). A more challenging problem is discovering these motion patterns without apriori assumption about the motions...
We have been developing a hands-free voice controller for a home network system (HNS) by using microphone arrays. In our current implementation, however, all human-HNS interactions are performed by voice only. Hence, the interactions tend to be mechanical, dreary and uninformative. To achieve richer interactions, we try to introduce the virtual agent technology as a feedback interface of the HNS....
This paper presents a method of automatic lexical stress assessment for L2 English speech. Syllable stress can be labeled at three levels - primary (P), secondary (S) and no (N) stress, but secondary stress may vary among word pronunciations within and across accents and present difficulties for human perception. Hence, evaluation of lexical stress based on all three levels (i.e., the P-S-N criterion...
Attention-deficit/hyperactivity disorder (ADHD) is a neuropsychiatric disorder which is quite common in childhood, with an estimated prevalence of 5–8%, and often persists into adolescence and adulthood. It is further characterized as inappropriate developmentally symptoms of inattention, impulsiveness, motor over-activity and restlessness. The aim of this study is to evaluate the feasibility of diagnosing...
Since more and more motion data are created, the reuse of motion data becomes increasingly important. We present a method to reconstruct a 3D figure animation based on 2D user-defined moiton on our motion database. First, a novel interactive interface for defining the input query is persented. Users can draw a series of stick figure of 2D human motion through it. These actions have a certain mount...
Knowledge on human eye-hand coordination can be used for human-like system design and medical diagnosis. This document analyses and briefly presents the parameters of the coordination while executing different eye-hand related tasks. Existing quantitative model of manuo-ocular coordination, capable of simulating the human performance in target tracking, is redesigned for a capability to simulate the...
Due to the maturing of digital image processing techniques, there are many tools, which can edit an image easily without leaving obvious traces to the human eyes. So the authentication of digital images is an important issue in our life. In this paper, multi-resolution Weber law descriptors (WLD) based method that detects copy-move image forgery is introduced. The proposed multi-resolution WLD extracts...
The usage of non-scripted lecture videos as a part of learning material is becoming an everyday activity in most of higher education institutions due to the growing interest in flexible and blended education. Generally these videos are delivered as part of Learning Objects (LO) through various Learning Management Systems (LMS). Currently creating these video learning objects (VLO) is a cumbersome...
Aiming at the problems of the global image matching, an area based on image matching algorithm is presented. In the paper, the principle of area based image matching is given firstly and the algorithm is analyzed in detail. In the algorithm, in order to diminish the influence of the segmentation to image matching, color key feature points are chosen as the feature of image and weighted distance measurement...
The desire of human beings and the goal of government policy basically have a common point, i.e., the better life in happiness. However, the common point composed of multi-criteria from objective and subjective living causes difficulty in decision making. This research applies the fuzzy set extensions of dominance-based rough set approach (FSE-DRSA) on the better life in happiness defined by the Organization...
The objective of this study is to investigate the relationship between the features of augmented reality (AR) and human memorization ability. The basis of this relation is derived from the following features. The AR feature is that AR can provide information associated with specific locations in the real world. The feature of human memory is that humans can easily memorize information if the information...
This paper presents hand grasp classifier using perimeter change of the forearm. Two sensors based on strain gauge were employed. Signal processing was applied to remove some ripples. Four different classes were trained. Real time classifier was used to recognize the trained grasps. Experimental results show that the average accuracy was 81.2%.
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.