The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
We have been developing impression prediction techniques for oral presentations. The contribution of this paper is two folds. First, we introduce soft code assignment for the bag-of-features (BoF) representation to improve the prediction accuracy. Second, we discuss towards online impression prediction aiming at real-time feedback to the speaker. Experimental results using over 1,600 TED presentation...
For people living in the countryside, an effective long-distance medical and health service is very important. People living in western China, especially, require convenient communication in their native language with doctors working in a modern city. To address this problem, we developed a multiple-language translation system for long-distance medical and outpatient services. This system initially...
Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The...
A large number of real-world observations by social sensors all over the world can be obtained from various social networking services. Especially, observations covering miscellaneous areas of interest are posted to Twitter as short text messages. Our goal is to extract a wide range of observations related to the target of interest specified by the user from Twitter regardless of their popularity...
A challenging problem in the wireless LAN localization is the severe fluctuation of the receive signal strength (RSS) even for the stationary client. The time-varying RSS which is resulted from the noise deteriorates the system accuracy drastically. Previous works presented various approaches to improve the positioning accuracy. Singular-Value-Decomposition (SVD) based-noise reduction technique is...
Recently, many people have begun to take pictures of meals and food either at home or in restaurants. These pictures are then uploaded to social networking services (SNS) where they are shared with friends. People want to take pictures of food that looks delicious, but they often find this difficult. This is because most people lack the knowledge required to take attractive pictures. There are many...
In this paper, we propose a temporal modulation spectral resto-ration (TMSR) approach for robust feature extraction in automatic speech recognition. There were three main function blocks in TMSR. First, mean and variance normalization (CMVN) was applied to the original feature sequence. Second, the noise characteristic was estimated with an analysis of the normalized features. Third, a gain function...
It is necessary to identify speech segments carrying important information for speech intelligibility, particularly in noise. Earlier work based on a relative rootmean-square (RMS) level based segmentation suggested that middle-level (ranging from the overall RMS level to 10 dB below) segments contained more vowel-consonant boundaries wherein the spectral change was often most prominent, and perhaps...
This paper proposes a method for estimating the attractiveness of food photos in order to assist a user to shoot them attractively. The proposed method extracts both color and shape features from input food images, and then integrates them according to a regression scheme. By this way, the proposed method estimates the attractiveness of an unknown food photo. We also created a food image dataset taken...
Precision medicine is promising a revolution for healthcare and medicine in the 21st century. The scientific foundation for this revolution is accomplished by analyzing healthcare data, as well as biological high-throughput data sets from genomics, proteomics, transcriptomics, metabolomics, etc. With data mining and statistical techniques, it has the potential to improve health outcomes and reduce...
Scene recognition has a wide range of applications, such as object recognition and detection, content-based image indexing and retrieval, and intelligent vehicle and robot navigation. In particular, natural scene images tend to be very complex and are difficult to analyze due to changes of illumination and transformation. In this study, we investigate a novel model to learn and recognize scenes in...
Semantic computing is an emerging research field that has drawn much attention from both academia and industry. It addresses the derivation and matching of semantics of computational "contents" where "contents" may be anything including text, multimedia, hardware, network, etc. which can be mapped to many areas in computer science that involve analyzing and processing the intension...
Mobile visual search has undergone a wide development and gained much progress in recent years thanks to the ever-growing computational power of mobile devices. Most visual search methods take a single image as query and generate an image-level representation to implement image retrieval. To form a compact and discriminative representation for the query image, Fisher vectors (FV) have shown great...
Diabetic retinopathy is known to be one of the most frequent and serious eye diseases that typically cause blindness in adults between 20 and 60 years of age. Microaneurysm (MA) is one of the most important syndromes in color fundus images. A tool for automatic detection of MAs can significantly reduce the workload of the ophthalmologists. A multi-stage strategy to screen candidate MAs is used in...
Raising children is challenging and requires lots of care. Parents always have to keep track of the status of their children, and provide proper care to them in time, like hydration, dinning, clothing, discomfort relieving, etc. However, it is always difficult to stay alert or be aware of the care required to the children at proper moments. One reason is that parents nowadays are busy, as they usually...
City-identification of videos aims to determine the likelihood of a video belonging to a set of cities. In this paper, we present an approach using only audio, thus we do not use any additional modality such as images, user-tags or geo-tags. In this manner, we show to what extent the city-location of videos correlates to their acoustic information. Success in this task suggests improvements can be...
Recently more and more videos have been shared through the websites such as youtube.com. In order to utilize them efficiently, instance search (INS) techniques which find a specific person, object and place from a video database without metadata has been desired. It is known that the BM25 scoring method is a powerful tool for the INS task. It is, however, also known that it requires a time consuming...
Public cultural services are developed rapidly. The growth of the data related to the public cultural services are dramatically. It needs big data technologies for processing the massive public cultural data. This paper proposes a big data analysis platform for facilitating the public cultural services. The platform collects cultural data from public cultural institutions and analyzes the data by...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.