Ma

chapter

Visual Saliency Detection Based on Disperse Degree of Color

Jianshe Ma, Libo Guo, Ping Su

2017 2nd International Conference on Multimedia and Image Processing (ICMIP) > 38 - 42

2017 2nd International Conference on Multimedia and Image Processing (ICMIP)

Saliency detection aims to focus attention on the important parts of a map, which is an excellent ability of human visual system. In this paper, we present a saliency detection model based on the principle that the pixels belong to the background are more disperse than the ones of the target area. Color contrast in different channels is employed to classify the pixels. Our method outperformed five...

chapter

Effective selection of mixed color features for image segmentation

Luo Junfeng, Ma Jinwen

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 794 - 798

2016 IEEE 13th International Conference on Signal Processing (ICSP)

Image segmentation is a basic task in image analysis and understanding and feature extraction is important but difficult. In this paper, we propose an effective feature selection method for color image segmentation which selects a group of mixed color features or channels from some different color spaces according to the principle of the least entropy of pixels frequency histogram distribution. Actually,...

chapter

Vision-based indoor localization approach based on SURF and landmark

Kai Guan, Lin Ma, Xuezhi Tan, Shizeng Guo

2016 International Wireless Communications and Mobile Computing Conference (IWCMC) > 655 - 659

2016 International Wireless Communications and Mobile Computing Conference (IWCMC)

Image based indoor localization is an important problem with many useful application. This paper proposes an indoor localization system for performing fine localization and less latency with more priori information, including tile angel and the relative height between camera optical center and origin in reference coordinate system (RCS). The system is divided into two stages: offline stage and online...

chapter

A fast visual map building method using video stream for visual-based indoor localization

Hao Xue, Lin Ma, Xuezhi Tan

2016 International Wireless Communications and Mobile Computing Conference (IWCMC) > 650 - 654

2016 International Wireless Communications and Mobile Computing Conference (IWCMC)

Visual-based indoor localization have become a favored research area in recent years. It can be used inside a building where GPS signals are often not available. And due to its low deployment cost, visual-based indoor localization has been implemented in the complicated indoor environment. However, in order to increase the accuracy of indoor localization, the scale of image database should be as large...

chapter

Smart phone camera image localization method for narrow corridors based on epipolar geometry

Yicheng Zhang, Lin Ma, Xuezhi Tan

2016 International Wireless Communications and Mobile Computing Conference (IWCMC) > 660 - 664

2016 International Wireless Communications and Mobile Computing Conference (IWCMC)

As some public buildings have become large in spatial scale, people find it more and more difficult to know their actual location in these buildings. Generally in the indoor environment, to get location information is relatively more complex than that in the outdoor environment, for traditional outdoor localization methods do not perform well in indoor environment. Under this circumstances, image...

chapter

Effect of multi-condition training and speech enhancement methods on spoofing detection

Hong Yu, Achintya Sarkar, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, more

2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE) > 1 - 5

2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE)

Many researchers have demonstrated the good performance of spoofing detection systems under clean training and testing conditions. However, it is well known that the performance of speaker and speech recognition systems significantly degrades in noisy conditions. Therefore, it is of great interest to investigate the effect of noise on the performance of spoofing detection systems. In this paper, we...

chapter

Deep neural networks for head pose classification

Yang Lu, Shujuan Yi, Nan Hou, Jingfu Zhu, more

2016 12th World Congress on Intelligent Control and Automation (WCICA) > 2787 - 2790

2016 12th World Congress on Intelligent Control and Automation (WCICA)

Head pose classification and estimation are essential for many face detection and recognition tasks and tracking applications. This paper proposes the robust and fast algorithms for head pose classification from labeled head pose database by employing deep neural networks (DNNs). The DNNs have capabilities to learn from raw images and process large-scale training image datasets. The proposed DNNs...

chapter

Isolated speech recognition using Fuzzy C Means technique

Vani H.Y, M.A. Anusuya

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT) > 352 - 357

2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)

Automatic speech recognition is one of the challenging area in the field of speech signal processing. Automatic speech recognition technology converts speech signal into text. This paper presents the implementation of isolated kannada word recognizer using Vector Quantization (VQ) and Fuzzy-C Means (FCM) techniques. The paper compares and contrasts the recognition accuracies of FCM and k-means techniques...

chapter

Sparse autoencoder based spatial pyramid facial feature learning

Ma Xiao, Jufu Feng

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) > 770 - 774

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)

The spatial pyramid feature learning methods, such as Spatial Pyramid Matching (SPM) and Sparse Coding based Spatial Pyramid Matching (ScSPM), have achieved significant performance in image categorization. While most of these methods are still based on manual-design features, such as SIFT, HOG and LBP, which limits the representation of data. In this paper, we propose a novel Sparse Autoencoder based...

chapter

An image based approach for speech perception

Nguyen Quang Trung, Bui The Duy, Ma Thi Chau

2015 2nd National Foundation for Science and Technology Development Conference on Information and Computer Science (NICS) > 208 - 213

2015 2nd National Foundation for Science and Technology Development Conference on Information and Computer Science (NICS)

Classification of speech signal is one of the most vital problems in speech perception and spoken word recognition. Although, there have been many studies on the classification of speech signals but the results are still limited. In this paper, we propose an image based approach for speech signal classification based on the combination of Local Naïve Bayes Nearest Neighbor (LNBNN) and Scale-invariant...

chapter

An application of KL transform in feature extraction and selection for polyp differentiation via CT colonography

Yifan Hu, Bowen Song, Ming Ma, Zhengrong Liang

2014 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC) > 1 - 5

2014 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)

The main task of computer-aided diagnosis (CADx) is to differentiate the pathological stages to which each detected colorectal lesion belongs, especially to differentiate hyperplastic polyps, which are non-neoplastic and seldom show malignant potential, from neoplastic lesions, which are malignant or at risk for malignant transformation. If we could extract useful pattern information from detected...

chapter

A Tibetan Component Representation Learning Method for Online Handwritten Tibetan Character Recognition

Long-Long Ma, Jian Wu

2014 14th International Conference on Frontiers in Handwriting Recognition > 317 - 322

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

This paper presents a Tibetan component representation learning method for component-based online handwritten Tibetan character recognition. In conventional methods, we designed features manually for Tibetan components. The hand-crafted features are often incomplete and decrease the component recognition accuracy, which influences component-based character recognition performance. To overcome the...

chapter

How does the shape descriptor measure the perceptual quality of the retargeting image?

Lin Ma, Long Xu, Huanqiang Zeng, King N. Ngan, more

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) > 1 - 6

2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

Perceptual quality evaluation of the retargeting image plays an important role in benchmarking different retargeting methods, as well as guiding or optimizing the retargeting process. The distortions introduced during the retargeting process are mainly categorized into shape distortion and content information loss [1]. The shape distortion measurement is critical to the evaluation of retargeting image...

chapter

Traffic sign recognition based on kernel sparse representation

Rui Wang, Guoqiang Xie, Junli Chen, Xiuli Ma, more

2014 International Conference on Audio, Language and Image Processing > 386 - 389

2014 International Conference on Audio, Language and Image Processing (ICALIP)

This paper proposes a novel approach based on scale invariant feature transform (SIFT) and kernel sparse representation for traffic sign recognition in complex traffic scenes. This module consists of several steps. In the first stage, SIFT is introduced for feature extraction from samples and test targets, respectively. The features are mapping to the kernel space. In the second stage, we construct...

chapter

Triangulation-Based Singer Identification for Duet Music Data Indexing

Wei-Ho Tsai, Cin-Hao Ma

2014 IEEE International Congress on Big Data > 270 - 275

2014 IEEE International Congress on Big Data (BigData Congress)

This study proposes a system to automaticallyidentify multiple singers in a long audio stream that may havesinging voices overlapping in time. The system is of great helpin handling the rapid proliferation of music data. To achievethis, an audio stream is segmented into a sequence ofconsecutive, non-overlapping, fixed-length clips using asliding window, and then undergoes solo/duetrecognition, single...

chapter

Semi-automatic Tibetan Component Annotation from Online Handwritten Tibetan Character Database by Optimizing Segmentation Hypotheses

Long-Long Ma, Jian Wu

2013 12th International Conference on Document Analysis and Recognition > 1340 - 1344

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

One of important steps in hybrid statistical-structural recognition method for handwritten characters is to label primitives for classifier training and label structural position information for structural recognition. In this paper, we propose a semi-automatic component (primitive) annotation method for online handwritten Tibetan character database. All samples of each character class are over-segmented...

chapter

Statistical formant descriptors with linear predictive coefficients for accent classification

Yusnita Ma, Paulraj Mp, Sazali Yaacob, Shahriman Ab, more

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA) > 906 - 911

2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA)

Accent is a special trait of human speech that can deliver some information about a speaker's background. At the same time it is one of the profound factors that affects the intelligibility and performance of speech recognition systems (ASRs) if not delicately handled. Normally accent recognizer in the preceding stage offers subsystem training or adaptation strategy to improve the ASRs. Formant analysis...

chapter

3D Facial Expression Recognition Based on Encoded Templates

Yiding Wang, Xiaolei Ma

2012 Symposium on Photonics and Optoelectronics > 1 - 4

2012 Symposium on Photonics and Optoelectronics (SOPO 2012)

In this paper, we propose a method for 3D facial expression recognition. The algorithm is composed of three steps. The first step is to extract the region of interested 3D face, some data preprocessing works, including face location, point cloud rotation and uniform distribution of points cloud, have been done in this step. Otherwise, the second step is feature extraction, novel features are extracted...

chapter

Perceptual similarity based robust low-complexity video fingerprinting

Karthikeyan Shanmuga Vadivel, Felix Fernandes, Zhan Ma, PoLin Lai, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1337 - 1340

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we present a novel video fingerprinting algorithm which leverages the concept of perceptual similarity between different video sequences. Inspired by the popular structural similarity (SSIM) index, we quantify the perceptual similarity between different video sequences by proposing a perceptual distance metric (PDM) which is utilized in the matching stage of our proposed video fingerprinting...

INFONA - science communication portal

Search results for: Ma

Trending topic discovery of Twitter Tweets using clustering and topic modeling algorithms

Visual Saliency Detection Based on Disperse Degree of Color

Effective selection of mixed color features for image segmentation

Vision-based indoor localization approach based on SURF and landmark

A fast visual map building method using video stream for visual-based indoor localization

Smart phone camera image localization method for narrow corridors based on epipolar geometry

Effect of multi-condition training and speech enhancement methods on spoofing detection

Deep neural networks for head pose classification

Isolated speech recognition using Fuzzy C Means technique

Sparse autoencoder based spatial pyramid facial feature learning

An image based approach for speech perception

An application of KL transform in feature extraction and selection for polyp differentiation via CT colonography

A Tibetan Component Representation Learning Method for Online Handwritten Tibetan Character Recognition

How does the shape descriptor measure the perceptual quality of the retargeting image?

Traffic sign recognition based on kernel sparse representation

Triangulation-Based Singer Identification for Duet Music Data Indexing

Semi-automatic Tibetan Component Annotation from Online Handwritten Tibetan Character Database by Optimizing Segmentation Hypotheses

Statistical formant descriptors with linear predictive coefficients for accent classification

3D Facial Expression Recognition Based on Encoded Templates

Perceptual similarity based robust low-complexity video fingerprinting

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results for: Ma

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options