A cross-media distance metric learning framework based on multi-view correlation mining and matching

Hong Zhang; Xingyu Gao; Ping Wu; Xin Xu

doi:10.1007/s11280-015-0342-4

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Hong Zhang, Xingyu Gao, Ping Wu, Xin Xu

Source

World Wide Web > 2016 > 19 > 2 > 181-197

Abstract

With the explosion of multimedia data, it is usual that different multimedia data often coexist in web repositories. Accordingly, it is more and more important to explore underlying intricate cross-media correlation instead of single-modality distance measure so as to improve multimedia semantics understanding. Cross-media distance metric learning focuses on correlation measure between multimedia data of different modalities. However, the existence of content heterogeneity and semantic gap makes it very challenging to measure cross-media distance. In this paper, we propose a novel cross-media distance metric learning framework based on sparse feature selection and multi-view matching. First, we employ sparse feature selection to select a subset of relevant features and remove redundant features for high-dimensional image features and audio features. Secondly, we maximize the canonical coefficient during image-audio feature dimension reduction for cross-media correlation mining. Thirdly, we further construct a Multi-modal Semantic Graph to find embedded manifold cross-media correlation. Moreover, we fuse the canonical correlation and the manifold information into multi-view matching which harmonizes different correlations with an iteration process and build Cross-media Semantic Space for cross-media distance measure. The experiments are conducted on image-audio dataset for cross-media retrieval. Experiment results are encouraging and show that the performance of our approach is effective.

Identifiers

journal ISSN :	1386-145X
journal e-ISSN :	1573-1413
DOI	10.1007/s11280-015-0342-4

Authors

Hong Zhang

Wuhan University of Science and Technology, College of Computer Science and Technology, Wuhan, China
Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan, China

Xingyu Gao

Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China

Ping Wu

Wuhan University of Science and Technology, College of Computer Science and Technology, Wuhan, China

Xin Xu

Wuhan University of Science and Technology, College of Computer Science and Technology, Wuhan, China

Keywords

Cross-media Distance metric Sparse feature selection Multi-view matching

Additional information

Publication languages: English

Data set: Springer

Publisher

Springer US

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

A cross-media distance metric learning framework based on multi-view correlation mining and matching $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Hong Zhang

Xingyu Gao

Ping Wu

Xin Xu

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

A cross-media distance metric learning framework based on multi-view correlation mining and matching