2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

rozdział

Unsupervised detection of non-stationary segments based on single-basis non-negative matrix factorization for effective annotation

Thanh Thi Hien Duong, Nobutaka Ono, Yasutaka Nakajima, Toshiya Ohshima

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

We propose novel methods for automatically detecting non-stationary segments using non-negative matrix factorization (NMF) with aiming to effective sound annotation. For acoustic event detection or acoustic scene analysis, preparing a sufficient amount of training data is important. However, listening to all recorded signals and annotating them are very time-consuming. Assuming that the observed acoustic...

rozdział

Saliency detection using quatemionic distance based weber descriptor and object cues

Muwei Jian, Qiang Qi, Junyu Dong, Xin Sun, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, a simple and efficient method, based on Quaternionic Distance Based Weber Descriptor (QDWD) and object cues, is proposed for saliency detection. Firstly, QDWD, which was initially designed for detecting outliers in color images, is used to represent the directional cues in an image. Meanwhile, two low-level priors, namely the color contrast and center cue of the image, are utilized...

rozdział

DNN-based voice activity detection with local feature shift technique

Tae Gyoon Kang, Kang Hyun Lee, Woo Hyun Kang, Soo Hyun Bae, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Recently, the deep neural networks (DNNs) are successfully adopted into the voice activity detection (VAD) area. However, the performance of the DNN-based VAD is still unsatisfactory in noise environments where the feature subspace of the training database and the test environments are not matched with each other. In this paper, we propose a local feature shift technique which normalizes the feature...

rozdział

A systolic FxLMS structure for implementation of feedforward active noise control on FPGA

Dongyuan Shi, Chuang Shi, Woon-Seng Gan

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Active noise control (ANC) is an efficient technique to deal with low frequency noise that is difficult to be abated by noise barrier or sound absorbing material. Many successful ANC systems have adopted the feedforward filtered-x least mean squares (FxLMS) algorithm to reduce machinery noise. The noise canceling headset is another well known example, where the feedback control structure is favorable...

rozdział

Classification of home appliance by using Probabilistic KNN with sensor data

SeungJun Kang, Ji Won Yoon

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

To date, many researchers have been conducted studies to control an electrical power to construct a smart home system which automatically manipulates individuals. One of the recent topics is NILM(Non-intrusive Load Monitoring) system to infer the devices states. In NILM, the approaches have been focused on dealing only with the feature of the electrical power signals to identify the states of the...

rozdział

Improved keyword spotting based on keyword/garbage models

Qiyu Chen, Weibin Zhang, Xiangmin Xu, Xiaofen Xing

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We propose two simple methods to improve the performance of a keyword spotting system. In our application, the users are allowed to change the keywords anytime if they want. Thus we focused on phone-based GMM-HMM models since they do not require keyword-specific training data. However, the GMM-HMM based models usually have very high false alarm rate, i.e., a keyword is not present but the system gives...

rozdział

Dual-camera HDR synthesis guided by long-exposure image

Po-Jung Wu, Kuang-Tsu Shih, Homer Chen

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Dual-camera electronic devices have the potential to deliver additional functionalities, such as depth acquisition and high dynamic range (HDR) imaging, more easily than those with a single camera. In this paper, we focus on the generation of a high dynamic range image for electronic devices equipped with a pair of parallel cameras that have different exposure time. Specifically, we propose a method...

rozdział

Dynamic convolutional neural network for activity recognition

Chih-Hsiang You, Chen-Kuo Chiang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, a novel Dynamic Convolutional Neural Network (D-CNN) is proposed using sensor data for activity recognition. Sensor data collected for activity recognition is usually not well-aligned. It may also contains noises and variations from different persons. To overcome these challenges, Gaussian Mixture Models (GMM) is exploited to capture the distribution of each activity. Then, sensor data...

rozdział

Predicting articulatory movement from text using deep architecture with stacked bottleneck features

Zhen Wei, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Using speech or text to predict articulatory movements can have potential benefits for speech related applications. Many approaches have been proposed to solve the acoustic-to-articulatory inversion problem, which is much more than the exploration for predicting articulatory movements from text. In this paper, we investigate the feasibility of using deep neural network (DNN) for articulartory movement...

rozdział

Speech emotion classification using multiple kernel Gaussian process

Sih-Huei Chen, Jia-Ching Wang, Wen-Chi Hsieh, Yu-Hao Chin, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Given the increasing attention paid to speech emotion classification in recent years, this work presents a novel speech emotion classification approach based on the multiple kernel Gaussian process. Two major aspects of a classification problem that play an important role in classification accuracy are addressed, i.e. feature extraction and classification. Prosodic features and other features widely...

rozdział

Object discovery in depth images

Tzu-Wei Huang, Yu-An Wei, Hwann-Tzong Chen, JenChi Liu

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We present an unsupervised method for discovering objects from depth information. Our method can identify new common objects appearing in different depth images. We use 2D bounding box proposals to detect candidate locations of objects in each depth image, and then retrieve the corresponding 3D bounding boxes using the depth information. Invalid object proposals can be further removed by analyzing...

rozdział

Saliency aware fast intra coding algorithm for HEVC

Liyuan Xiong, Wei Zhou, Xin Zhou, Guanwen Zhang, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, a saliency aware fast intra coding algorithm for HEVC is proposed consists of perceptual intra coding and fast intra prediction mode decision algorithm. Firstly, based on the visual saliency detection, an adaptive CU splitting method is proposed to reduce intra encoding complexity. Furthermore, quantization parameter is adaptively adjusted at the CU level according to the relative importance...

rozdział

Improvement algorithm of video coding efficiency using pre-filtering and post-filtering based on global motion compensation

Ho Hyeong Ryu, Kwang Yeon Choi, Byung Cheol Song

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper proposes a filtering approach based on global motion estimation (GME) and global motion compensation (GMC) as pre-processing and post-processing for video CODEC. For the pre-processing of video CODEC, group-of-pictures (GOP), i.e., basic unit for GMC and reference frames are first defined for an input video sequence. Next, GME and GMC are sequentially performed for every frame in each GOP...

rozdział

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations

Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we investigate a DNN tone-based extended recognition network (ERN) approach to Mandarin tone recognition and tone mispronunciation detection. Given a toneless syllable sequence, a tone-based ERN is constructed by assigning five different tones to each toneless syllable, obtaining a fully expanded tonal syllable network. Next, Viterbi decoding is carried out on the tone-based ERN to...

rozdział

On the use of I-vectors and average voice model for voice conversion without parallel data

Jie Wu, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Recently, deep and/or recurrent neural networks (DNNs/RNNs) have been employed for voice conversion, and have significantly improved the performance of converted speech. However, DNNs/RNNs generally require a large amount of parallel training data (e.g., hundreds of utterances) from source and target speakers. It is expensive to collect such a large amount of data, and impossible in some applications,...

rozdział

Speech enhancement method with geometric phase estimation by incorporating MIXMAX model

Xianyun Wang, Changchun Bao

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we propose a frequency-domain speech enhancement algorithm with phase estimation, in which the speech model is modeled by a Gaussian mixture model (GMM) in the log-spectral domain and two closed-form log-spectral amplitude estimators for speech and noise are derived directly by using a Mixture-Maximum (MIXMAX) model. Because the accurate estimation of speech phase could help to reduce...

rozdział

Fundamental study of decomposition based on heterogeneous sensing data in physical conversion sensor networks

Minato Oriuchi, Osamu Takyu, Keiichiro Shirai, Fumihito Sasamori, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

A physical wireless conversion sensor network (PhyC-SN) is attracting much attention for achieving real time collection of massive sensing data and reduction of power consumption in wireless sensor networks. Since the collected sensing data are interfered each other, we can hardly analyze the tendency of each sensing data. This paper proposes the novel data separation based on the data tracking with...

rozdział

An improved LEA block encryption algorithm to prevent side-channel attack in the IoT system

Jaehak Choi, Youngseop Kim

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Devices of IoT (Internet of Things) are limited in resources such as CPU, memory etc. The LEA (Lightweight Encryption Algorithm) was standardized as the encryption algorithm suitable for IoT devices in Korea in 2013. However, LEA is vulnerable to the side-channel analysis attack using consumed electric power. To supplement this vulnerability, masking technique is mainly used. However, in case of masking...

rozdział

KL-divergence based mispronunciation detection via DNN and decision tree in the phonetic space

Wenping Hu, Frank K Soong

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We propose to detect mispronunciations in a language learners speech via a discriminatively trained DNN in the phonetic space. The posterior probabilities of “senones” populated in a decision tree are trained and predicted speaker independently. Acoustic features of each input segment (with preceding and succeeding contexts of several frames) are mapped unto the whole set of senones in their corresponding...

rozdział

Brain-computer interface technology for speech recognition: A review

Mashael M. AlSaleh, Mahnaz Arvaneh, Heidi Christensen, Roger K. Moore

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper presents an overview of the studies that have been conducted with the purpose of understanding the use of brain signals as input to a speech recogniser. The studies have been categorised based on the type of the technology used with a summary of the methodologies used and achieved results. In addition, the paper gives an insight into some studies that examined the effect of the chosen stimuli...

INFONA - portal komunikacji naukowej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Unsupervised detection of non-stationary segments based on single-basis non-negative matrix factorization for effective annotation

Saliency detection using quatemionic distance based weber descriptor and object cues

DNN-based voice activity detection with local feature shift technique

A systolic FxLMS structure for implementation of feedforward active noise control on FPGA

Classification of home appliance by using Probabilistic KNN with sensor data

Improved keyword spotting based on keyword/garbage models

Dual-camera HDR synthesis guided by long-exposure image

Dynamic convolutional neural network for activity recognition

Predicting articulatory movement from text using deep architecture with stacked bottleneck features

Speech emotion classification using multiple kernel Gaussian process

Object discovery in depth images

Saliency aware fast intra coding algorithm for HEVC

Improvement algorithm of video coding efficiency using pre-filtering and post-filtering based on global motion compensation

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations

On the use of I-vectors and average voice model for voice conversion without parallel data

Speech enhancement method with geometric phase estimation by incorporating MIXMAX model

Fundamental study of decomposition based on heterogeneous sensing data in physical conversion sensor networks

An improved LEA block encryption algorithm to prevent side-channel attack in the IoT system

KL-divergence based mispronunciation detection via DNN and decision tree in the phonetic space

Brain-computer interface technology for speech recognition: A review

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

INFONA - portal komunikacji naukowej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) $("#expandableTitles").expandable();

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)