Search results

Items from 21 to 40 out of 58,537 results

chapter

Deep Attribute Driven Image Similarity Learning Using Limited Data

Nitin Gupta, Ankush Gupta, Vikas Joshi, L. Venkata Subramaniam, more

2017 IEEE International Symposium on Multimedia (ISM) > 146 - 153

2017 IEEE International Symposium on Multimedia (ISM)

In this work, we propose to derive the attribute specific similarity score for a pair of images using an existing parent deep model. As an example, given two facial images, we derive a similarity score for attributes like gender and complexion using an existing face recognition model. It is not always feasible to train a new model for each attribute, as training of deep neural network based model...

chapter

Towards Efficient 3D Pose Retrieval and Reconstruction from 2D Landmarks

Hashim Yasin

2017 IEEE International Symposium on Multimedia (ISM) > 169 - 176

2017 IEEE International Symposium on Multimedia (ISM)

In this paper, we deal with the most challenging task of recovering the 3D human pose from just a single monocular image, that may be a synthetic image or a real internet image. The retrieval and reconstruction of the articulated 3D pose, both are prerequisites for the analysis of the people in images/videos. We address both tasks together and propose an efficient framework for search & retrieval...

chapter

An Iterative Feature-Pair Updating Framework for Rigid Template Matching with Outliers

Yang Yang, Qian Kou, Shaoyi Du, Shuang Luo, more

2017 IEEE International Symposium on Multimedia (ISM) > 200 - 207

2017 IEEE International Symposium on Multimedia (ISM)

To deal with the rigid template matching problem in real-world scenarios, we propose a novel iterative feature-pair updating framework which is also robust to high levels of outliers, such as background changing, complex nonrigid deformation and partial occlusion. Given a pair of template image and target image, we first extract a set of corresponding feature-pairs as candidates. Then, we propose...

chapter

A Video Shot Boundary Detection Approach Based on CNN Feature

Rui Liang, Qingxin Zhu, Honglei Wei, Shujiao Liao

2017 IEEE International Symposium on Multimedia (ISM) > 489 - 494

2017 IEEE International Symposium on Multimedia (ISM)

In nowadays, as the development of digital photographic technology, video files grow rapidly, there is a great demand for automatic video semantic analysis in many scenes, such as video semantic understanding, content-based analysis, video retrieval. Shot boundary detection is a key basic technology and first step for video analysis. However, recent methods are time consuming and performs bad in the...

chapter

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, more

2017 IEEE International Symposium on Multimedia (ISM) > 495 - 499

2017 IEEE International Symposium on Multimedia (ISM)

Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...

chapter

Cross-Modal Transfer Learning for HEp-2 Cell Classification Based on Deep Residual Network

Haijun Lei, Tao Han, Weifeng Huang, Jong Yih Kuo, more

2017 IEEE International Symposium on Multimedia (ISM) > 465 - 468

2017 IEEE International Symposium on Multimedia (ISM)

Accurate Human Epithelial-2 (HEp-2) cell image classification plays an important role in the diagnosis of many autoimmune diseases. However, the traditional approach requires experienced experts to artificially identify cell patterns, which extremely increases the workload and suffer from the subjective opinion of physician. To address it, we propose a very deep residual network (ResNet) based framework...

chapter

Hand Gesture Recognition Based on Wavelet Invariant Moments

Xi Liu, Chen Li, Lihua Tian

2017 IEEE International Symposium on Multimedia (ISM) > 459 - 464

2017 IEEE International Symposium on Multimedia (ISM)

In this paper, a new method of hand gesture recognition is proposed. First, the hand region is separated based on the depth information. Then the wavelet feature is calculated by enforcing the wavelet invariant moments of the hand region, and the distance feature is extracted by calculating the distance from fingers to hand centroid. Next, a feature vector which is composed of wavelet invariant moments...

chapter

A New Multimedia Documents Clustering Approach Based on Feature Patterns Similarity

Pushpalatha K, Ananthanarayana V. S.

2017 IEEE International Symposium on Multimedia (ISM) > 296 - 299

2017 IEEE International Symposium on Multimedia (ISM)

With the rapid advances in digital technology, the multimedia documents have been growing ubiquitously. The analysis of this huge repository of multimedia documents requires efficient organization of documents. Multimedia document clustering organizes the multimedia documents with common multimedia topics. The important step of multimedia document clustering is computing the similarity between multimedia...

chapter

Robust and Fast Object Tracking for Challenging 360-degree Videos

Ahmad Delforouzi, Marcin Grzegorzek

2017 IEEE International Symposium on Multimedia (ISM) > 274 - 277

2017 IEEE International Symposium on Multimedia (ISM)

The task of object tracking in rectangular videos has been addressed in recent years by many researchers, where each method tries to propose a solution for a special challenge. Handling a variety of challenging situation of object tracking in 360-degree videos is still an unsolved problem and needs to be more considered. In the real world, the challenging situations include moving camera, high-resolution...

chapter

Adaptive Sparse Learning for Neurodegenerative Disease Classification

Haijun Lei, Yujia Zhao, Yuting Wen, Baiying Lei

2017 IEEE International Symposium on Multimedia (ISM) > 292 - 295

2017 IEEE International Symposium on Multimedia (ISM)

This paper proposed an adaptive sparse learning (ASL) framework to solve the multi-classification problem for neurodegenerative disease analysis. Specifically, we integrate the idea of feature selection and subspace learning to construct a least square regression model. The principle of Fisher's linear discriminant analysis (LDA) and locality preserving projection (LPP) are incorporated to utilize...

chapter

Mining Culture-Specific Music Listening Behavior from Social Media Data

Martin Pichl, Eva Zangerle, Gunther Specht, Markus Schedl

2017 IEEE International Symposium on Multimedia (ISM) > 208 - 215

2017 IEEE International Symposium on Multimedia (ISM)

Incorporating user characteristics and contextual information has shown to be essential when it comes to personalized music retrieval and recommendation. To this end, the current location of a user is often exploited. However, relying solely on GPS coordinates neglects the cultural background of users, which does not necessarily coincide with political borders. In this paper, we analyze culture-specific...

chapter

Enhancing Effectiveness of Descriptors for Searching and Recognition in Motion Capture Data

Jan Sedmidubsky, Petr Elias, Pavel Zezula

2017 IEEE International Symposium on Multimedia (ISM) > 240 - 243

2017 IEEE International Symposium on Multimedia (ISM)

Computer-aided analyses of motion capture data require an effective and efficient concept of motion similarity. Traditional methods generally compare motion sequences by applying time-warping techniques to high-dimensional trajectories of joints. An increasing effectiveness of machine-learning techniques, such as deep convolutional neural networks, brings new possibilities for similarity comparison...

chapter

Estimation of Optimal Encoding Ladders for Tiled 360° VR Video in Adaptive Streaming Systems

Cagri Ozcinar, Ana De Abreu, Sebastian Knorr, Aljosa Smolic

2017 IEEE International Symposium on Multimedia (ISM) > 45 - 52

2017 IEEE International Symposium on Multimedia (ISM)

Given the significant industrial growth of demand for virtual reality (VR), 360º video streaming is one of the most important VR applications that require cost-optimal solutions to achieve widespread proliferation of VR technology. Because of its inherent variability of data-intensive content types and its tiled-based encoding and streaming, 360º video requires new encoding ladders in adaptive streaming...

chapter

Spatio-Temporal Compositing of Video Elements for Immersive eLearning Classrooms

Uma Gopalakrishnan, P. Venkat Rangan, Ramkumar N, Balaji Hariharan

2017 IEEE International Symposium on Multimedia (ISM) > 138 - 145

2017 IEEE International Symposium on Multimedia (ISM)

Current live eLearning systems enable remote students to view the teaching environment comprising of several information sources such as the teacher and the teaching aids. These information sources are presented as individual video and audio elements. As a result, spatial connections between these elements, such as the teacher using hand gestures to point to an area on the screen, become meaningless...

chapter

A Real-Time Annotation of Motion Data Streams

Petr Elias, Jan Sedmidubsky, Pavel Zezula

2017 IEEE International Symposium on Multimedia (ISM) > 154 - 161

2017 IEEE International Symposium on Multimedia (ISM)

Current motion-capture technologies produce continuous streams of 3D human joint trajectories. One of the challenges is to automatically annotate such streams of complex spatio-temporal data in real time. In this paper, we propose an efficient approach to label motion stream data in real time with a limited usage of main memory. Based on a set of user-defined motion profiles, each of them specified...

chapter

Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis

Yann Bayle, Ladislav Marsik, Martin Rusek, Matthias Robine, more

2017 IEEE International Symposium on Multimedia (ISM) > 177 - 184

2017 IEEE International Symposium on Multimedia (ISM)

We introduce Kara1k, a new musical dataset composed of 2,000 analyzed songs thanks to a partnership with a karaoke company. The dataset is divided into 1,000 cover songs provided by Recisio Karafun application1, and the corresponding 1,000 songs by the original artists. Kara1k is mainly dedicated toward cover song identification and singing voice analysis. For both tasks, it offers novel approaches,...

chapter

Influence of Video Quality on Multi-view Activity Recognition

Jun-Ho Choi, Manri Cheon, Jong-Seok Lee

2017 IEEE International Symposium on Multimedia (ISM) > 511 - 515

2017 IEEE International Symposium on Multimedia (ISM)

This paper presents a study that evaluates the performance of multi-view human activity recognition with videos having degraded quality. For the activity recognition models, a support vector machine-based approach using spatiotemporal features and a deep learning-based approach using convolutional and recurrent layers are built. We investigate the recognition performance of the two models with respect...

chapter

Detection and Visualization of Arabic Emotions on Social Emotion Map

Mohammed F. Alhamid, Saad Alsahli, Majdi Rawashdeh, Mubarak Alrashoud

2017 IEEE International Symposium on Multimedia (ISM) > 378 - 381

2017 IEEE International Symposium on Multimedia (ISM)

Abstract. In the context of smart cities and Internet of Things (IoT), there are many trending contents on the social networks that reflect the picture of the community or their interest. In this paper, we propose a model that automatically collect trending social data and analyze them automatically. The model explores trending contents, overall attitude of textual contents and the relationships among...

chapter

A Fast Video Shot Boundary Detection Employing OTSU’s Method and Dual Pauta Criterion

Zhe Yang, Lihua Tian, Chen Li

2017 IEEE International Symposium on Multimedia (ISM) > 583 - 586

2017 IEEE International Symposium on Multimedia (ISM)

Video shot boundary detection is a fundamental step towards video information processing in e-learning scenarios. In the field of shot boundary detection, there still exists difficulty in choosing suitable thresholds for different videos, and empirical thresholds usually lead to low precision. Thus, we propose an original method to generate video-based threshold which is calculated by video itself...

chapter

Fast Binary Descriptor Search for Keypoint Matching by Norm Ordering

Masahiko Sugimura, Takayuki Baba, Ryuta Tanaka

2017 IEEE International Symposium on Multimedia (ISM) > 561 - 566

2017 IEEE International Symposium on Multimedia (ISM)

Keypoint matching between images is an important technique for computer vision applications such as image retrieval. Although binary feature descriptors such as BRIEF enable fast measurement of distance, exhaustive search is still time-consuming. Hashing methods such as Locality Sensitive Hashing (LSH), while being effective to accelerate searching, result in large memory consumption and thus are...

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Content availability

Available (57,602)
None (935)

Keywords

TRAINING (9,934)
DATA MINING (7,451)
SUPPORT VECTOR MACHINES (7,401)
IMAGE SEGMENTATION (6,157)
ACCURACY (5,775)
DATABASES (5,161)
IMAGE COLOR ANALYSIS (4,885)
CLASSIFICATION ALGORITHMS (4,155)
PIXEL (4,025)
VISUALIZATION (3,855)
FACE RECOGNITION (3,805)
HISTOGRAMS (3,761)
FACE (3,742)
IMAGE CLASSIFICATION (3,672)
CAMERAS (3,642)
SHAPE (3,495)
IMAGE EDGE DETECTION (3,237)
PRINCIPAL COMPONENT ANALYSIS (3,175)
ROBUSTNESS (2,858)
ALGORITHM DESIGN AND ANALYSIS (2,694)
WAVELET TRANSFORMS (2,658)
IMAGE RECOGNITION (2,550)
ARTIFICIAL NEURAL NETWORKS (2,542)
SPEECH (2,541)
OBJECT DETECTION (2,540)
KERNEL (2,516)
HIDDEN MARKOV MODELS (2,404)
COMPUTER VISION (2,387)
VECTORS (2,370)
COMPUTATIONAL MODELING (2,369)
TRANSFORMS (2,161)
CORRELATION (2,099)
DETECTORS (2,028)
PATTERN RECOGNITION (2,020)
IMAGE MATCHING (2,019)
MACHINE LEARNING (1,958)
NOISE (1,955)
MEDICAL IMAGE PROCESSING (1,949)
IMAGE RETRIEVAL (1,902)
LEARNING (ARTIFICIAL INTELLIGENCE) (1,833)
ELECTROENCEPHALOGRAPHY (1,821)
SUPPORT VECTOR MACHINE (1,789)
MATHEMATICAL MODEL (1,780)
CLASSIFICATION (1,779)
ESTIMATION (1,748)
IMAGE PROCESSING (1,707)
FEATURE SELECTION (1,678)
PATTERN CLASSIFICATION (1,640)
NEURAL NETWORKS (1,625)
SPEECH RECOGNITION (1,621)
IMAGE TEXTURE (1,600)
IMAGE COLOUR ANALYSIS (1,585)
CLUSTERING ALGORITHMS (1,584)
TESTING (1,581)
HUMANS (1,541)
SEMANTICS (1,530)
NEURAL NETS (1,485)
IMAGE RESOLUTION (1,474)
ENTROPY (1,382)
OBJECT RECOGNITION (1,358)
SVM (1,348)
THREE DIMENSIONAL DISPLAYS (1,333)
CONFERENCES (1,321)
EDGE DETECTION (1,267)
VIDEO SIGNAL PROCESSING (1,260)
THREE-DIMENSIONAL DISPLAYS (1,258)
REMOTE SENSING (1,254)
IMAGE SEQUENCES (1,216)
IMAGE RECONSTRUCTION (1,208)
MONITORING (1,205)
LIGHTING (1,182)
DATA MODELS (1,169)
IMAGE CODING (1,154)
MEDICAL SIGNAL PROCESSING (1,142)
STATISTICAL ANALYSIS (1,135)
INDEXES (1,130)
VEHICLES (1,121)
PATTERN CLUSTERING (1,084)
DISEASES (1,059)
SENSORS (1,044)
IMAGE REPRESENTATION (1,037)
DISCRETE WAVELET TRANSFORMS (1,036)
EDUCATIONAL INSTITUTIONS (1,035)
CHARACTER RECOGNITION (1,034)
BIOMETRICS (ACCESS CONTROL) (1,030)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1,030)
OPTIMIZATION (1,025)
BIOMEDICAL IMAGING (984)
GABOR FILTERS (972)
TEXT ANALYSIS (972)
SIGNAL PROCESSING (968)
IMAGE MOTION ANALYSIS (959)
SUPPORT VECTOR MACHINE CLASSIFICATION (955)
CONTEXT (947)
TRACKING (939)
EQUATIONS (934)
EMOTION RECOGNITION (921)
ELECTROCARDIOGRAPHY (916)
NEURONS (910)
more

Data set

ieee (58,194)
Springer (335)
Wiley (8)

INFONA - science communication portal

Search results

Deep Attribute Driven Image Similarity Learning Using Limited Data

Towards Efficient 3D Pose Retrieval and Reconstruction from 2D Landmarks

An Iterative Feature-Pair Updating Framework for Rigid Template Matching with Outliers

A Video Shot Boundary Detection Approach Based on CNN Feature

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Cross-Modal Transfer Learning for HEp-2 Cell Classification Based on Deep Residual Network

Hand Gesture Recognition Based on Wavelet Invariant Moments

A New Multimedia Documents Clustering Approach Based on Feature Patterns Similarity

Robust and Fast Object Tracking for Challenging 360-degree Videos

Adaptive Sparse Learning for Neurodegenerative Disease Classification

Mining Culture-Specific Music Listening Behavior from Social Media Data

Enhancing Effectiveness of Descriptors for Searching and Recognition in Motion Capture Data

Estimation of Optimal Encoding Ladders for Tiled 360° VR Video in Adaptive Streaming Systems

Spatio-Temporal Compositing of Video Elements for Immersive eLearning Classrooms

A Real-Time Annotation of Motion Data Streams

Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis

Influence of Video Quality on Multi-view Activity Recognition

Detection and Visualization of Arabic Emotions on Social Emotion Map

A Fast Video Shot Boundary Detection Employing OTSU’s Method and Dual Pauta Criterion

Fast Binary Descriptor Search for Keypoint Matching by Norm Ordering

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options