Search results for: Yan Song

Items from 1 to 14 out of 14 results

article

Local receptive fields based extreme learning machine with hybrid filter kernels for image classification

Bo He, Yan Song, Yuemei Zhu, Qixin Sha, more

Multidimensional Systems and Signal Processing > 2019 > 30 > 3 > 1149-1169

In this paper, an innovative method called extreme learning machine with hybrid local receptive fields (ELM-HLRF) is presented for image classification. In this method, filters generated by Gabor functions and the randomly generated convolution filters are incorporated into the convolution filter kernels of local receptive fields based extreme learning machine (ELM-LRF). Extreme learning machine (ELM)...

chapter

Fish recognition using convolutional neural network

Guoqing Ding, Yan Song, Jia Guo, Chen Feng, more

OCEANS 2017 – Anchorage > 1 - 4

OCEANS 2017 - Anchorage

Studying fish recognition has important realistic and theoretical significance to aquaculture and marine biology. Fish recognition is challenging problem because of distortion, overlap and occlusion of digital images. Previous researchers have done a lot of work on fish recognition, but the classification accuracy may be not high enough. Classification and recognition methods based on convolutional...

chapter

PCA and Kernel-based extreme learning machine for side-scan sonar image classification

Mingcui Zhu, Yan Song, Jia Guo, Chen Feng, more

2017 IEEE Underwater Technology (UT) > 1 - 4

2017 IEEE Underwater Technology (UT)

As an important role of oceanographic survey, side-scan sonar image classification has attracted much attention in the past two decades. Due to the special properties of sonar image, traditional approaches are difficult to get good classification accuracy, so their implementation in real world is blocked. In this paper, a novel classification system based on kernel-based extreme learning machine (KELM)...

chapter

Image classification with CNN-based Fisher vector coding

Yan Song, Xinhai Hong, Ian McLoughlin, Lirong Dai

2016 Visual Communications and Image Processing (VCIP) > 1 - 4

2016 Visual Communications and Image Processing (VCIP)

Fisher vector coding methods have been demonstrated to be effective for image classification. With the help of convolutional neural networks (CNN), several Fisher vector coding methods have shown state-of-the-art performance by adopting the activations of a single fully-connected layer as region features. These methods generally exploit a diagonal Gaussian mixture model (GMM) to describe the generative...

chapter

Compact convolutional neural network transfer learning for small-scale image classification

Zengxi Li, Yan Song, Ian Mcloughlin, Lirong Dai

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2737 - 2741

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Transfer learning methods have demonstrated state-of-the-art performance on various small-scale image classification tasks. This is generally achieved by exploiting the information from an ImageNet convolution neural network (ImageNet CNN). However, the transferred CNN model is generally with high computational complexity and storage requirement. It raises the issue for real-world applications, especially...

chapter

A spectral based visual matching method for image classification

Yan Song, Wu Guo, Li-Rong Dai, Ian Vince Mcloughlin

2014 International Conference on Audio, Language and Image Processing > 666 - 670

2014 International Conference on Audio, Language and Image Processing (ICALIP)

Visual matching algorithms can be described in terms of visual content representation and similarity measure. With local feature based representations, visual matching can be restated as: 1) how to obtain visual similarity from the local kernel matrix, and 2) how to calculate the local kernel matrix effectively and efficiently. Existing methods mostly focus on the former, and use Euclidean distance...

chapter

Effective image representation based on bi-layer visual codebook

Yan Song, Jinhui Tang, Xia Li, Qi Tian, more

The First Asian Conference on Pattern Recognition > 224 - 228

2011 First Asian Conference on Pattern Recognition (ACPR 2011)

Recently, the Bag-of-visual Words (BoW) based image representation has drawn much attention in image categorization and retrieval applications. It is known that the visual codebook construction and the related quantization methods play the important roles in BoW model. Traditionally, visual codebook is generated by clustering local features into groups, and the original feature is hard quantized to...

chapter

MCMC-based scene segmentation method using structure of video

Yan Song, T Ogawa, M Haseyama

2010 10th International Symposium on Communications and Information Technologies > 862 - 866

2010 10th International Symposium on Communications and Information Technologies (ISCIT 2010)

Video scene segmentation and classification are fundamental steps for multimedia retrieval, browsing and indexing. In this paper, we present a robust scene segmentation approach based on the Markov Chain Monte Carlo (MCMC) method using the structure of video sequences. In our method, there are two novel approaches to segment video sequences into scenes. The first approach is the use of the video structures...

chapter

An Improved Multiple Instance Learning Algorithm for Object Extraction

Mengyue Wang, Changlin Zhang, Yan Song

2010 Chinese Conference on Pattern Recognition (CCPR) > 1 - 5

2010 Chinese Conference on Pattern Recognition (CCPR 2010)

Based on MILES algorithm, we propose a novel multiple instance learning approach which regards visual word dictionary as feature space, and combines segmentation for object detection and extraction in the process of instance classification. This approach uses "Bag of Words" model. The whole image is considered as a multiple instance bag. The visual words that represent the image are regarded...

chapter

Extraction of image semantic features with spatial-range mean shift clustering algorithm

Mengyue Wang, Changlin Zhang, Yan Song

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 906 - 909

2010 10th International Conference on Signal Processing (ICSP 2010)

In recent years, the Bag-of-visual Words image representation has led to many significant results in visual object recognition and categorization. However, experiments show that the unsupervised clustering of primitive visual features tends to result in the limited discriminative ability of the visual codebook, since it does not take the spatial relationship between visual primitives into consideration...

chapter

Multiple instance learning using visual phrases for object classification

Yan Song, Qi Tian, Mengyue Wang, Heng Liu, more

2010 IEEE International Conference on Multimedia and Expo > 649 - 654

2010 IEEE International Conference on Multimedia and Expo (ICME)

Recently, bag of words (BoW) model has led to many significant results in visual object classification. However, due to the limited descriptive and discriminative ability of visual words, the resulting performance of visual object classification is still incomparable to its analogy in text domain, i.e. document categorization. Furthermore, for weakly labeled image data, where we only know whether...

chapter

Double-Density Dual-Tree Wavelet Transform Based Texture Classification

Yu-Long Qiao, Chun-Yan Song

2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 1322 - 1325

2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2009

Texture classification plays an important role in image analysis. The wavelet transform is a very efficient multiscale analysis method that has been successfully applied to describe the texture. The double-density dual-tree wavelet transform can simultaneously possess the properties of the double-density discrete wavelet transform (DWT) and the dual-tree DWT. In this paper, the texture feature based...

chapter

Double-Density Discrete Wavelet Transform Based Texture Classification

Yu-Long Qiao, Chun-Yan Song, Chun-Hui Zhao

Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007) > 1 > 91 - 94

2007 Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Texture classification plays an important role in image analysis. The wavelet is a very efficient multiscale analysis method that has been successfully applied to describe the texture. However, it is translation-invariant. The recent double-density discrete wavelet transform have two interesting property, low computational complexity and nearly shift invariant. In this paper, the texture feature based...

chapter

Automatic video annotation based on co-adaptation and label correction

Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dal, more

2006 IEEE International Symposium on Circuits and Systems > 4 pp.

2006 IEEE International Symposium on Circuits and Systems

As there is a large gap between high-level semantics and low-level features, it is difficult to obtain high-accuracy video semantic annotation through automatic methods. In this paper, we propose a novel automatic video annotation method, which greatly improves the annotation performance by learning from unlabeled video data, as well as exploring temporal consistency of video sequences. To effectively...

Filter options

Keywords:
IMAGE CLASSIFICATION

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Yan Song

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options