Learning semantic embedding at a large scale

Min-Hsuan Tsai; Jinjun Wang; Tong Zhang; Yihong Gong; Thomas S. Huang

doi:10.1109/ICIP.2011.6116168

Learning semantic embedding at a large scale

Tsai, Min-Hsuan, Wang, Jinjun, Zhang, Tong, Gong, Yihong, Huang, Thomas S.

Source

2011 18th IEEE International Conference on Image Processing > 2497 - 2500

Abstract

A key problem in image annotation is to learn the underlying semantics. However, finding such semantic embeddings is a challenge task and often requires large amount of tagging information. In this paper, we propose to utilize multi-modality cues by incorporating visual and textual information as embedded objects. The paper further presents a multi-task learning framework that simultaneously learns the approximation of two semantic embeddings with efficient multi-stage convex relaxation technique. The experiments show that the proposed method presents very promising performance in both memory usage and training time for large-scale dataset, as well as image classification accuracy.