Search results for: Dong Wang

Items from 1 to 8 out of 8 results

chapter

Learning to Detect Salient Objects with Image-Level Supervision

Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3796 - 3805

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Neural Networks (DNNs) have substantially improved the state-of-the-art in salient object detection. However, training DNNs requires costly pixel-level annotations. In this paper, we leverage the observation that image-level tags provide important cues of foreground salient objects, and develop a weakly supervised learning method for saliency detection using image-level tags only. The Foreground...

chapter

Memory visualization for gated recurrent neural networks in speech recognition

Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2736 - 2740

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recurrent neural networks (RNNs) have shown clear superiority in sequence modeling, particularly the ones with gated units, such as long short-term memory (LSTM) and gated recurrent unit (GRU). However, the dynamic properties behind the remarkable performance remain unclear in many applications, e.g., automatic speech recognition (ASR). This paper employs visualization techniques to study the behavior...

article

Person Re-Identification via Distance Metric Learning With Latent Variables

Chong Sun, Dong Wang, Huchuan Lu

IEEE Transactions on Image Processing > 2017 > 26 > 1 > 23 - 34

In this paper, we propose an effective person re-identification method with latent variables, which represents a pedestrian as the mixture of a holistic model and a number of flexible models. Three types of latent variables are introduced to model uncertain factors in the re-identification problem, including vertical misalignments, horizontal misalignments and leg posture variations. The distance...

chapter

System combination for short utterance speaker recognition

Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

For text-independent short-utterance speaker recognition (SUSR), the performance often degrades dramatically. This paper presents a combination approach to the SUSR tasks with two phonetic-aware systems: one is the DNN-based i-vector system and the other is our recently proposed subregion-based GMM-UBM system. The former employs phone posteriors to construct an i-vector model in which the shared statistics...

chapter

Multi-task recurrent model for true multilingual speech recognition

Zhiyuan Tang, Lantian Li, Dong Wang

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Research on multilingual speech recognition remains attractive yet challenging. Recent studies focus on learning shared structures under the multi-task paradigm, in particular a feature sharing structure. This approach has been found effective to improve performance on each individual language. However, this approach is only useful when the deployed system supports just one language. In a true multilingual...

chapter

Binary speaker embedding

Lantian Li, Chao Xing, Dong Wang, Kaimin Yu, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 4

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding. In this paper, we investigate binary speaker embedding, which transforms i-vectors to binary vectors (codes) by a hash function. We start from locality sensitive hashing (LSH), a simple binarization approach where binary codes are derived from a set...

chapter

Vehicle pose estimation in WAMI imagery via deep convolutional neural networks

Meng Yi, Dong Wang, Fan Yang, Jonathan Xu, more

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS) > 233 - 240

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS)

Wide Area Motion Imagery (WAMI) are usually taken from unmaned air vehicles at low frame rates, and having very wide ground coverage. These images serve as rich source for many applications like surveillance, urban planing and traffic monitoring. Thus, understanding WAMI imagery exploitation has been gaining more interest recent years. In this paper, we focus on estimating the pose of vehicles in...

chapter

Document classification with distributions of word vectors

Chao Xing, Dong Wang, Xuewei Zhang, Chao Liu

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 5

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

The word-to-vector (W2V) technique represents words as low-dimensional continuous vectors in such a way that semantic related words are close to each other. This produces a semantic space where a word or a word collection (e.g., a document) can be well represented, and thus lends itself to a multitude of applications including document classification. Our previous study demonstrated that representations...

Filter options

Keywords:
COMPUTATIONAL MODELING
TRAINING

Publication date

Set your own date range

Publication type

book (7)
article (1)

Keywords

SPEECH (4)
SPEECH RECOGNITION (3)
DATABASES (2)
SEMANTICS (2)
SPEAKER RECOGNITION (2)
VISUALIZATION (2)
BAYES METHODS (1)
BINARY CODES (1)
BINARY EMBEDDING (1)
CAMERAS (1)
COMPUTER ARCHITECTURE (1)
DATA MODELS (1)
DECODING (1)
DELAYS (1)
DETECTORS (1)
EDUCATIONAL INSTITUTIONS (1)
GATED RECURRENT UNIT (1)
HAMMING DISTANCE (1)
HAMMING DISTANCE LEARNING (1)
I-VECTOR (1)
KERNEL (1)
LATENT VARIABLES (1)
LEARNING SYSTEMS (1)
LEGGED LOCOMOTION (1)
LOGIC GATES (1)
LONG SHORT-TERM MEMORY (1)
LSH (1)
MEASUREMENT (1)
METRIC LEARNING (1)
NEURAL NETWORKS (1)
OBJECT DETECTION (1)
OPTIMIZATION (1)
PERSON RE-IDENTIFICATION (1)
POSE ESTIMATION (1)
RECURRENT NEURAL NETWORKS (1)
RESIDUAL LEARNING (1)
SPATIAL MISALIGNMENTS (1)
SUPERVISED LEARNING (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SUPPORT VECTOR MACHINES (1)
VECTORS (1)
more

INFONA - science communication portal

Search results for: Dong Wang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options