Search results

chapter

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, more

2017 IEEE International Symposium on Multimedia (ISM) > 495 - 499

2017 IEEE International Symposium on Multimedia (ISM)

Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...

chapter

360° view camera based visual assistive technology for contextual scene information

Mazin Ali, Ferat Sahin, Shitij Kumar, Celal Savur

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2135 - 2140

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

In this paper, a system to aid the visually impaired by providing contextual information of the surroundings using 360° view camera combined with deep learning is proposed. The system uses a 360° view camera with a mobile device to capture surrounding scene information and provide contextual information to the user in the form of audio. The scene information from the spherical camera feed is classified...

chapter

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Lili Pan, Samira Pouyanfar, Hao Chen, Jiaohua Qin, more

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 181 - 189

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...

chapter

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

Junjie Chen, Xiren Miao, Hao Jiang, Jing Chen, more

2017 Chinese Automation Congress (CAC) > 2109 - 2114

2017 Chinese Automation Congress (CAC)

In order to realize autonomous landing of the unmanned aerial vehicle (UAV) in power patrolling, a visual method vision based on Faster Regions with Convolutional Neural Network (Faster R-CNN) for UAVs is studied. In this paper, we design the landing sign of the combination of concentric circles and pentagon, and propose the Faster R-CNN recognition algorithm which can be used to identify the target...

chapter

An improved dropout method and its application into DBN-based handwriting recognition

Guangzheng Hu, Huifang Li, Lixuan Luo, Yuanqing Xia

2017 36th Chinese Control Conference (CCC) > 11145 - 11149

2017 36th Chinese Control Conference (CCC)

As a typical deep learning method, Deep Belief Network (DBN) and Dropout method are usually used together for pattern recognition in case of lacking training data. Dropout training can avoid the overfitting phenomenon in deep neural network. During the testing stage, the outputs of all neurons in hidden layers are multiplied by a same factor as their actual outputs in the original Dropout method....

chapter

Robust and real-time deep tracking via multi-scale domain adaptation

Xinyu Wang, Hanxi Li, Yi Li, Fumin Shen, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1338 - 1343

2017 IEEE International Conference on Multimedia and Expo (ICME)

Visual tracking is a fundamental problem in computer vision. Recently, some deep-learning-based tracking algorithms have been achieving record-breaking performances. However, due to the high complexity of deep learning, most deep trackers suffer from low tracking speed, and thus are impractical in many real-world applications. Some new deep trackers with smaller network structure achieve high efficiency...

chapter

Detecting Chinese calligraphy style consistency by deep learning and one-class SVM

Zhang Jiulong, Guo Luming, Yang Su, Sun Xudong, more

2017 2nd International Conference on Image, Vision and Computing (ICIVC) > 83 - 86

2017 2nd International Conference on Image, Vision and Computing (ICIVC)

When beginners practice Chinese calligraphy, they often copy from ancient calligraphic works and try to imitate the style as closely as possible. However there are inevitably some characters whose styles are not correctly followed. Thus we are motivated to detect the style consistency of all written characters in one practice. With the styles extracted by using stacked autoencoders of deep neural...

chapter

Human action recognition using transfer learning with deep representations

Allah Bux Sargano, Xiaofeng Wang, Plamen Angelov, Zulfiqar Habib

2017 International Joint Conference on Neural Networks (IJCNN) > 463 - 469

2017 International Joint Conference on Neural Networks (IJCNN)

Human action recognition is an imperative research area in the field of computer vision due to its numerous applications. Recently, with the emergence and successful deployment of deep learning techniques for image classification, object recognition, and speech recognition, more research is directed from traditional handcrafted to deep learning techniques. This paper presents a novel method for human...

chapter

Predicting the popularity of instagram posts for a lifestyle magazine using deep learning

Shaunak De, Abhishek Maity, Vritti Goel, Sanjay Shitole, more

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA) > 174 - 177

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA)

In this paper we use a Deep Neural Network (DNN) trained on data collected from the visual media-sharing social platform Instagram account of a popular Indian lifestyle magazine to predict the popularity of future posts. This predicted popularity of the post can be used to decide advertising rates and measure performance metrics important for publishing strategy decisions. The DNN primarily uses growth...

chapter

Image describing based on bidirectional LSTM and improved sequence sampling

Ji Li, Yongfei Shen

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 735 - 739

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Motivated by great performance gained by Recurrent neural network applied on machine translation, people began to pay attention to image describing with related deep learning methods. Recurrent neural network can not remember long term information but Long-Short Term Memory(LSTM) can handle this well. However, the LSTM applied on image describing to predict sentences in previous literature [1] can...

chapter

Visual features for context-aware speech recognition

Abhinav Gupta, Yajie Miao, Leonardo Neves, Florian Metze

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5020 - 5024

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Automatic transcriptions of consumer generated multi-media content such as “Youtube” videos still exhibit high word error rates. Such data typically occupies a very broad domain, has been recorded in challenging conditions, with cheap hardware and a focus on the visual modality, and may have been post-processed or edited.

chapter

Finetuning Convolutional Neural Networks for visual aesthetics

Yeqing Wang, Yi Li, Fatih Porikli

2016 23rd International Conference on Pattern Recognition (ICPR) > 3554 - 3559

2016 23rd International Conference on Pattern Recognition (ICPR)

Inferring the aesthetic quality of images is a challenging computer vision task due to its subjective and conceptual nature. Most image aesthetics evaluation approaches focused on designing handcrafted features, and only a few adopted learning of relevant and imperative characteristics in a data-driven manner. In this paper, we propose to attune Convolutional Neural Networks (CNNs) for image aesthetics...

chapter

Traffic sign recognition with convolutional neural network based on max pooling positions

Rongqiang Qian, Yong Yue, Frans Coenen, Bailing Zhang

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 578 - 582

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

Recognition of traffic signs is vary important in many applications such as in self-driving car/driverless car, traffic mapping and traffic surveillance. Recently, deep learning models demonstrated prominent representation capacity, and achieved outstanding performance in traffic sign recognition. In this paper, we propose a traffic sign recognition system by applying convolutional neural network...

chapter

Scene classification of high resolution remote sensing images using convolutional neural networks

Gong Cheng, Chengcheng Ma, Peicheng Zhou, Xiwen Yao, more

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 767 - 770

IGARSS 2016 - 2016 IEEE International Geoscience and Remote Sensing Symposium

Scene classification of high resolution remote sensing images plays an important role for a wide range of applications. While significant efforts have been made in developing various methods for scene classification, most of them are based on handcrafted or shallow learning-based features. In this paper, we investigate the use of deep convolutional neural network (CNN) for scene classification. To...

chapter

Traffic sign recognition using visual attribute learning and convolutional neural network

Rong-Qiang Qian, Yong Yue, Frans Coenen, Bai-Ling Zhang

2016 International Conference on Machine Learning and Cybernetics (ICMLC) > 386 - 391

2016 International Conference on Machine Learning and Cybernetics (ICMLC)

The problem of extracting high level information from digital images and videos is frequently faced in the area of computer vision and machine learning. For the recognition of traffic signs, a lot of outstanding methods have been proposed, and deep models demonstrates that their powerful representation capacity, can archieve dominant performances. In this paper a method for recognizing traffic signs...

chapter

Structured output tracking with deep neural network and optical flow

Youngjoo Jo, Jun-Cheol Park, Dae-Shik Kim

2016 2nd International Conference on Control, Automation and Robotics (ICCAR) > 350 - 356

2016 2nd International Conference on Control, Automation and Robotics (ICCAR)

The deep learning of neural network works on vision recognition and classification tasks briskly, and it can extract great features of an image for classification. Recently, many approaches have studied the visual tracking in two-ways with these characteristics. First, they can regard tracking problem as classifying each video and frame by learning all dataset. Second, use the deep neural network...

chapter

Pedestrian recognition method based on depth hierarchical feature representation

Rui Sun, Guang-Hai Zhang, Jun Gao

2015 12th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) > 173 - 178

2015 12th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

For feature representation of pedestrian recognition, a hybrid hierarchical feature representation method which combines representation ability of bag of words model and depth layered with learning adaptability is presented. This method first uses HOG local descriptor for local features extraction, and then encoding the feature by a depth of layered coding method, the layered coding method by spatial...

chapter

Recognizing objectionable images using convolutional neural nets

Reza Moradi, Rahman Yousefzadeh

2015 Signal Processing and Intelligent Systems Conference (SPIS) > 133 - 137

2015 Signal Processing and Intelligent Systems Conference (SPIS)

In recent years different methods for detecting objectionable images have proposed. All of the previous systems are based on extracting pre-defined and certain features from the images. In this paper a method is proposed in order to detect objectionable images using convolutional neural networks. In this method first features are learned through a sparse auto-encoder and then training is done by a...

chapter

Multimedia data mining using deep learning

Peter Wlodarczak, Jeffrey Soar, Mustafa Ally

2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC) > 190 - 196

2015 Fifth International Conference on Digital Information Processing and Communications (ICDIPC)

Due to the large amounts of Multimedia data on the Internet, Multimedia mining has become a very active area of research. Multimedia mining is a form of data mining. Data mining uses algorithms to segment data to identify useful patterns and to make predictions. Despite the successes in many areas, data mining remains a challenging task. In the past, multimedia mining was one of the fields where the...

chapter

Learning to Detect Saliency with Deep Structure

Yu Hu, Zenghai Chen, Zheru Chi, Hong Fu

2015 IEEE International Conference on Systems, Man, and Cybernetics > 1770 - 1775

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Deep learning has shown great successes in solving various problems of computer vision. To the best of our knowledge, however, little existing work applies deep learning to saliency modeling. In this paper, a new saliency model based on convolutional neural network is proposed. The proposed model is able to produce a saliency map directly from an image's pixels. In the model, multi-level output values...

INFONA - science communication portal

Search results

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

360° view camera based visual assistive technology for contextual scene information

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

An improved dropout method and its application into DBN-based handwriting recognition

Robust and real-time deep tracking via multi-scale domain adaptation

Detecting Chinese calligraphy style consistency by deep learning and one-class SVM

Human action recognition using transfer learning with deep representations

Predicting the popularity of instagram posts for a lifestyle magazine using deep learning

Image describing based on bidirectional LSTM and improved sequence sampling

Visual features for context-aware speech recognition

Finetuning Convolutional Neural Networks for visual aesthetics

Traffic sign recognition with convolutional neural network based on max pooling positions

Scene classification of high resolution remote sensing images using convolutional neural networks

Traffic sign recognition using visual attribute learning and convolutional neural network

Structured output tracking with deep neural network and optical flow

Pedestrian recognition method based on depth hierarchical feature representation

Recognizing objectionable images using convolutional neural nets

Multimedia data mining using deep learning

Learning to Detect Saliency with Deep Structure

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options