2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

chapter

Speaker recognition in duration-mismatched condition using bootstrapped i-vectors

Atsushi Ando, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

This paper presents a novel speaker recognition framework that handles duration mismatch between registered and test utterances. The i-vectors extracted from short utterances exhibit high variance due to phoneme imbalance, which causes performance degradation in the duration mismatch condition. Most conventional methods attempt to decrease the variance by offsetting i-vectors or speaker similarity...

chapter

Mandarin citation tone patterns of prelingual Chinese deaf adults

Yanting Chen, Yu Chen, Jin Zhang, Ju Zhang, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

The present study examined the citation patterns of Mandarin tones in prelingual deaf adults with cochelar implants or hearing aids. The results showed that the participants tried to build up tonal pattern by exploring phonetic features such as creaky voice and tonal duration. The results also indicated that although the participants had problems distinguishing T2 from T3, T2 was harder than T3 for...

chapter

Multi-lingual and multi-task DNN learning for articulatory error detection

Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, Jinsong Zhang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

For effective pronunciation error detection for second language learners, we address articulatory models based on deep neural network (DNN). Articulatory attributes are defined for manner and place of articulation. In order to efficiently train these models of non-native speech without using such data, which is difficult to collect in a large scale, we propose a multi-lingual learning method, in which...

chapter

Algorithms for comparison in residue number systems

Hanshen Xiao, Yu Ye, Guoqiang Xiao, Qin Kang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Residue Number Systems (RNSs) has been widely used in digital signal processing (DSP) systems and cases of fast computing, parallelism and fault tolerant because of its carry-free property. However, the comparison operation in an RNS is quite difficult and the computation cost is high, which are a significant limitation to apply it for division, scaling and overflow detection. Reverse conversions...

chapter

AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

Dong Wang, Lantian Li, Difei Tang, Qing Chen

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We present the AP16-OL7 database which was released as the training and test data for the oriental language recognition (OLR) challenge on APSIPA 2016. Based on the database, a baseline system was constructed on the basis of the i-vector model. We report the baseline results evaluated in various metrics defined by the AP16-OLR evaluation plan and demonstrate that AP16-OL7 is a reasonable data resource...

chapter

Depth map estimation with 4D light fields using confocal stereo

Qi Chen, Yuchen Zhang, Xiaochun Cao, Yunfei Zhang, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We present a scene depth map generation method based on light field cameras. From the plenoptic function, the angular information about each image point under different sizes of aperture is extracted, which could be used for confocal stereo. Considering confocal constancy and gradient constancy, we take into account two constraints: (1) When a pixel is in focus, its relative intensities across aperture...

chapter

Plenoptic image compression based on linear transformation and interpolation

Haixu Han, Xin Jin, Qionghai Dai

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Plenoptic cameras which can capture both spatial and angular light information by one shot have attracted great interests. Compared with traditional cameras, a plenoptic image has large resolution and enormous amounts of microlens images. Due to huge volume of data, an efficient plenoptic image compression method for transmission and storage is required. In this paper, a novel plenoptic image compression...

chapter

Joint selective encryption and data embedding technique in HEVC video

Yiqi Tew, KokSheik Wong, Raphael C.-W. Phan

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

An HEVC format-compliant joint selective encryption and data embedding technique is proposed. The proposed technique is separable, where the decryption and data extraction processes are independent, with minimal parsing overhead. Specifically, elements in the HEVC coding structure are divided into two groups, where one group is manipulated to perceptually mask the video content, while another is modified...

chapter

Light field upsampling by joint bilateral filtering on epipolar plane images

Hao-Chiang Shao, Wen-Liang Hwang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Due to the trade-off between spatial and angular resolution, the effective spatial resolution of a light field image is usually less than one percent of the number of pixels on the photo sensor. In this paper, we propose a prototype algorithm to upsample a light field image. Because the boundary edges of 3D objects would result in lines on epipolar plane images (EPIs), the main idea of our method...

chapter

A modified FSLMS algorithm for nonlinear ANC

Lei Luo, Jinwei Sun, Boyan Huang, Xiangbin Jiang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

By analyzing the theory of functional link artificial neural network (FLANN) structure based on filtered-s least mean square (FSLMS) algorithm which is usually used in the nonlinear active noise control (NANC) system, it can be found that the controller coefficients of nonlinear parts are multiple related, this problem causes much unbalance to calculate these coefficients and restraints the performance...

chapter

Disparity Map estimation using semi-global matching based on image segmentation

Eunsang Ko, Yo-Sung Ho

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we propose a semi-global matching method based on image segmentation. We perform a k-means clustering algorithm in only left image as image segmentation. Then, to improve result of image segmentation, we integrate adjacent and small labels along edges of objects. After that, we extract feature points to estimate the disparity range in each label, and add weights to the disparity range...

chapter

A hybrid algorithm for multiple parameters estimation with UCA of electromagnetic vector sensors

Julan Xie, Xue Yang, Huiyong Li, Jinfeng Hu

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In order to avoid the multi-dimensional spectrum peak search for multiple parameters estimation, a hybrid algorithm with uniform circular array (UCA) of electromagnetic vector sensors based on the beamspace transformation is proposed. In the beamspace, the azimuth angle can be split from other parameters and can be estimated without using spectral peak search. Then, the elevation estimation can be...

chapter

Depth image super-resolution via multi-frame registration and deep learning

Ching Wei Tseng, Hong-Ren Su, Shang-Hong Lai, JenChi Liu

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 8

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we develop an algorithm for depth image super-resolution from RGB-D images, which are acquired under different imaging conditions so that we can combine them to improve the image quality with precise 3D registration. We focus on how to increase the resolution and quality of depth images by combining multiple RGB-D images and using the deep learning technique. In the proposed solution,...

chapter

Tibetan vowel analysis with a multi-modal Mandarin-Tibetan speech corpus

Gyaltsen Lobsang, Wenhuan Lu, Kiyoshi Honda, Jianguo Wei, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper reports on the construction of a multi-modal Mandarin-Tibetan speech database collected from native speakers of WeiZang dialect. The Mandarin-Tibetan corpus contains 41 Tibetan sentences, 27 Chinese sentences, 30 Tibetan consonants, 4 Tibetan vowels, and 25 Tibetan monosyllables. A multi-modal data collection system was established, which comprises an ultrasound scanner, high-speed camera,...

chapter

Shape-adaptive image compression using lossy shape coding, SA-prediction, and SA-deblocking

Li-Ang Chen, Jian-Jiun Ding, Yih-Cherng Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 10

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

As the annoying blocking or ghost artifacts tend to appear in the conventional compression approaches either in the JPEG or JPEG2000 standards at low bitrate, the concept of the object-oriented image compression is proposed. This kind of methods is able to retain the image structural boundaries and therefore has relatively good visual qualities even in high compression ratios. In this paper, we propose...

chapter

Time-of-flight image enhancement for depth map generation

Yunseok Song, Yo-Sung Ho

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Time-of-Flight (ToF) cameras are easily accessible in this era. They capture real distances of objects in a controlled environment. Yet, the ToF image may include disconnected boundaries between objects. In addition, certain objects are not capable of reflecting the infrared ray such as black hair. Such problems are caused by the physics of ToF. This paper proposes a method to compensate such errors...

chapter

Underwater multi-spectral photometric stereo reconstruction from a single RGBD image

Hengchao Jiao, Yisong Luo, Nan Wang, Lin Qi, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Traditional theories and methods in 3D reconstruction were all proposed with implicit assumption of air environment. However, underwater environment is different in many aspects. The absorption and scattering effects caused by the suspended particles in the water attenuate the image signal, which disqualifies the traditional reconstruction algorithms. In this paper, we propose a novel method to reconstruct...

chapter

Self-localization and channel synchronization of smartphone arrays using sound emissions

Nobutaka Ono, Kazuaki Shibata, Hirokazu Kameoka

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we propose a simple calibration method for an ad hoc microphone array that utilizes sound emissions. Assuming that each device has a function to record sound while emitting another sound, the location, the recording time offset, and the sampling frequency mismatch of each device are estimated from the time of arrival (TOA) of the sound emitted by each device. The accurate estimation...

chapter

Intra block copy hash reduction for HEVC screen content coding

Che-Wei Kuo, Hsueh-Ming Hang, Chun-Liang Chien

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 9

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

To meet a wide range of needs for video applications such as remote desktop, video conference, distance education, and cloud gaming, the ISO/ITU Joint Collaborative Team on Video Coding (JCT-VC) committee is recently specifying the Screen Content Coding (SCC) standard, as one of the extensions of High Efficiency Video Coding (HEVC). In this paper, the hash search method of the standard adopted Intra...

chapter

Digital mirror box: An interactive hand-motor BMI rehabilitation tool for stroke patients

Yumie Ono, Takanori Tominaga, Takaho Murata

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 7

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We develop a brain-machine interface for the hand-motor rehabilitation of stroke patients. The interface provides both visual and proprioceptive feedback to the user based upon the successful generation of cortical motor commands. We discuss the details of the proposed system and provide a summary of the preliminary experiment. The experiment investigates the importance of simultaneous visual and...

INFONA - science communication portal

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Speaker recognition in duration-mismatched condition using bootstrapped i-vectors

Mandarin citation tone patterns of prelingual Chinese deaf adults

Multi-lingual and multi-task DNN learning for articulatory error detection

Algorithms for comparison in residue number systems

AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

Depth map estimation with 4D light fields using confocal stereo

Plenoptic image compression based on linear transformation and interpolation

Joint selective encryption and data embedding technique in HEVC video

Light field upsampling by joint bilateral filtering on epipolar plane images

A modified FSLMS algorithm for nonlinear ANC

Disparity Map estimation using semi-global matching based on image segmentation

A hybrid algorithm for multiple parameters estimation with UCA of electromagnetic vector sensors

Depth image super-resolution via multi-frame registration and deep learning

Tibetan vowel analysis with a multi-modal Mandarin-Tibetan speech corpus

Shape-adaptive image compression using lossy shape coding, SA-prediction, and SA-deblocking

Time-of-flight image enhancement for depth map generation

Underwater multi-spectral photometric stereo reconstruction from a single RGBD image

Self-localization and channel synchronization of smartphone arrays using sound emissions

Intra block copy hash reduction for HEVC screen content coding

Digital mirror box: An interactive hand-motor BMI rehabilitation tool for stroke patients

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)