Search results

chapter

Poster abstract: MicroBrain: Compressing deep neural networks for energy-efficient visual inference service

Shiming Ge, Zhao Luo, Qiting Ye, Xiao-Yu Zhang

2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) > 1000 - 1001

IEEE INFOCOM 2017 -IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

The deployments of deep neural network models on mobile or embedded devices have been hindered due to their large number of weights. In this work, we develop a deep neural network (DNN) model compression service termed MicroBrain to reduce the resource usage for energy-efficient visual inference. By automatically analyzing the trained DNN models, we propose a high-performance DNN model compression...

chapter

Real-time EEG-based person authentication system using face rapid serial visual presentation

Qunjian Wu, Ying Zeng, Zhimin Lin, Xiaojuan Wang, more

2017 8th International IEEE/EMBS Conference on Neural Engineering (NER) > 564 - 567

2017 8th International IEEE/EMBS Conference on Neural Engineering (NER)

As a new biometric, the Electroencephalogram (EEG) signal has the advantages of invisibility, non-clonability, and non-coercion compare to traditional biometrics. However, the real-time and stability are the difficulties that the current EEG-based person authentication systems face. In this paper, we design a real-time and stable person authentication system using EEG signals, which are elicited by...

chapter

A real-time visual tracking technique for mobile display stabilization

Hsu-Fu Hsiao, Jyh-Da Wei

2017 International Conference on Applied System Innovation (ICASI) > 440 - 442

2017 International Conference on Applied System Innovation (ICASI)

Mobile devices can cause visual discomfort and even injuries to the eyes owing to shaking and vibration. In this study, we aim to develop a real-time visual tracking technique for mobile display stabilization in order to provide users with a comfortable interaction experience and reduce the hurt caused by screen vibration. The workflow of this mechanism includes tracking the motion of the mobile device,...

chapter

A self-organizing model for affective memory

Pablo Barros, Stefan Wermter

2017 International Joint Conference on Neural Networks (IJCNN) > 31 - 38

2017 International Joint Conference on Neural Networks (IJCNN)

Emotions are related to many different parts of our lives: from the perception of the environment around us to different learning processes and natural communication. Therefore, it is very hard to achieve an automatic emotion recognition system which is adaptable enough to be used in real-world scenarios. This paper proposes the use of a growing and self-organizing affective memory architecture to...

chapter

Category-selective top-down modulation in the fusiform face area of the human brain during visual search

Salman Ul Hassan Dar, Tolga Cukur

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Several regions in the ventral-temporal cortex of the human brain are thought to have representations of specific categories of objects. Furthermore, a distributed network of frontal and parietal brain regions is implicated in attentional control. It is assumed that during visual search, attention-control regions send top-down signals to the target category-selective areas to bias the processing in...

chapter

Amazigh audiovisual speech recognition system design

Ilham Addarrazi, Hassan Satori, Khalid Satori

2017 Intelligent Systems and Computer Vision (ISCV) > 1 - 5

2017 Intelligent Systems and Computer Vision (ISCV)

It is well known that speech recognition is a multimodal process which uses information not only from audio but also from vision. This paper describes our experience to design an audio visual speech recognition system, which relates the acoustic and the visual information in order to improve noise robustness of automatic speech recognition. The accuracy rate for face and mouth detection using Viola-Jones...

chapter

Piloting mobile mixed reality simulation in paramedic distance education

James Birt, Emma Moore, Michael A. Cowling

2017 IEEE 5th International Conference on Serious Games and Applications for Health (SeGAH) > 1 - 8

2017 IEEE 5th International Conference on Serious Games and Applications for Health (SeGAH)

New pedagogical methods delivered through mobile mixed reality (via a user-supplied mobile phone incorporating 3d printing and augmented reality) are becoming possible in distance education, shifting pedagogy from 2D images, words and videos to interactive simulations and immersive mobile skill training environments. This paper presents insights from the implementation and testing of a mobile mixed...

chapter

Manually annotated characteristic descriptors: Measurability and variability

Chris Zeinstra, Raymond Veldhuis, Luuk Spreeuwers, Arnout Ruifrok

2017 5th International Workshop on Biometrics and Forensics (IWBF) > 1 - 6

2017 5th International Workshop on Biometrics and Forensics (IWBF)

In this paper we study the measurability and variability of manually annotated characteristic descriptors on a forensic relevant face dataset. Characteristic descriptors are facial features (landmarks, shapes, etc.) that can be used during forensic case work. With respect to measurability, we observe that a significant proportion cannot be determined in images representative of forensic case work...

chapter

A joint learning based Face Super Resolution approach via contextual topological structure

Liang Chen, Ruimin Hu, Zhen Han, Zhongyuan Wang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1088 - 1092

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Face Super Resolution(FSR) is to infer High Resolution(HR) facial images from given Low Resolution(LR) ones with the assistance of LR and HR training pairs. Among existing methods, local patch based methods are superior in visual and objective quality than global based methods. These local patch based methods are based on the consistency assumption that the neighbors in HR/LR space form similar local...

chapter

Power-law stochastic neighbor embedding

Huan-Hsin Tseng, Issam El Naqa, Jen-Tzung Chien

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2347 - 2351

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Stochastic neighbor embedding (SNE) aims to transform the observations in high-dimensional space into a low-dimensional space which preserves neighbor identities by minimizing the Kullback-Leibler divergence of the pairwise distributions between two spaces where Gaussian distributions are assumed. Data visualization could be improved by adopting the t-SNE where Student t distribution is used in the...

chapter

Laplace gradient based Discriminative and Contrast Invertible descriptor

Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1842 - 1846

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The performance of local descriptors such as SIFT drops under severe illumination changes. In this paper, we propose a Discriminative and Contrast Invertible (DCI) local feature descriptor. In order to increase the discriminative ability of the descriptor under illumination changes, a Laplace gradient based histogram is proposed. Moreover, a robust contrast flipping estimate is proposed based on the...

chapter

Vid2speech: Speech reconstruction from silent video

Ariel Ephrat, Shmuel Peleg

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5095 - 5099

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speechreading is a notoriously difficult task for humans to perform. In this paper we present an end-to-end model based on a convolutional neural network (CNN) for generating an intelligible acoustic speech signal from silent video frames of a speaking person. The proposed CNN generates sound features for each frame based on its neighboring frames. Waveforms are then synthesized from the learned speech...

chapter

Semantic Text Summarization of Long Videos

Shagan Sah, Sourabh Kulhare, Allison Gray, Subhashini Venugopalan, more

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 989 - 997

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Long videos captured by consumers are typically tied to some of the most important moments of their lives, yet ironically are often the least frequently watched. The time required to initially retrieve and watch sections can be daunting. In this work we propose novel techniques for summarizing and annotating long videos. Existing video summarization techniques focus exclusively on identifying keyframes...

chapter

An intelligent digital system for visually impaired person (vip's)

M. Preetha, K. Elavarasi, K. Ramyadevi

2017 International Conference on Information Communication and Embedded Systems (ICICES) > 1 - 4

2017 International Conference on Information Communication and Embedded Systems (ICICES)

This paper bring the decision about the problem facing by the visual impaired person. Here, We designed the device to system for the visually impaired person to handle problem in the environment. They face difficulties in independent accessing public transport since they cannot read the route number and unsure about the physical location of the bus, identifying the person, and also they find difficulty...

chapter

XJU1: A Chinese Ethnic Minorities Face Database

Hang Zuo, Liejun Wang, Jiwei Qin

2017 International Conference on Machine Vision and Information Technology (CMVIT) > 7 - 11

2017 International Conference on Machine Vision and Information Technology (CMVIT)

Due to the various factors like expression, pose, illumination and accessory variation etc., human face seem different in multiple occasions. To determine the efficiency of the different face recognition algorithms, it requires benchmark face images. In this paper, we presented a comprehensive study of the available 2D face databases and also introduces the creation of a visual face database, Xinjiang...

chapter

Face recognition system using bag of features and multi-class SVM for robot applications

Salah Nasr, Kais Bouallegue, Muhammad Shoaib, Hassen Mekki

2017 International Conference on Control, Automation and Diagnosis (ICCAD) > 263 - 268

2017 International Conference on Control, Automation and Diagnosis (ICCAD)

Face recognition system is used for the identification and verification of a face from a video or digital image. In the first phase, Viola Jones algorithm is used to detect and crop face region automatically from image/video frame. The second phase is to recognize the face of a person, in our proposed method Bag of Word technique used to extract features from an image which uses SURF for interest...

chapter

A template-projection approach to decode higher-order vision in realtime and at the perceptual threshold

Kai J Miller, Dora Hermes

2017 5th International Winter Conference on Brain-Computer Interface (BCI) > 30 - 35

2017 5th International Winter Conference on Brain-Computer Interface (BCI)

The link between object perception and neural activity in visual cortical areas is a problem of fundamental importance in neuroscience. We measured brain surface physiology with implanted electrocorticography (ECoG) electrodes in humans. Physiological responses to visual stimuli in object-specific ventral temporal loci are highly polymorphic in different cortical loci, for both broadband and raw potential...

chapter

Effective and efficient visual stimuli design for quantitative autism screening: An exploratory study

Tri Vu, Hoan Tran, Kun Woo Cho, Chen Song, more

2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI) > 297 - 300

2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)

Autism spectrum disorder (ASD) is one of the most common childhood developmental disorders. Early detection and intervention for ASD are critical for increasing child success. In the past decade, utilizing the abnormal eye gaze characteristics of children with autism in regard to certain visual stimuli is emerging as a screening approach due to its cost-efficiency and promising accuracy. However,...

chapter

Detecting falling people by autonomous service robots: A ROS module integration approach

Sergio Hernandez-Mendez, Carolina Maldonado-Mendez, Antonio Marin-Hernandez, Homero Vladimir Rios-Figueroa

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP) > 1 - 7

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP)

In this paper is presented the integration of diverse modules for people fallen detection by a mobile service robot. This integration has been achieved in the middleware ROS (Robotics Operation System). The proposed implementation are arranged over an modular architecture of three layers: Hardware, Processing and Decision. The modules implemented are on the processing layer. The first module uses...

chapter

An overview of Multimodal Sentiment Analysis research: Opportunities and Difficulties

Mohammad Aman Ullah, Md. Monirul Islam, Norhidayah Binti Azman, Zulkifly Mohd Zaki

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR) > 1 - 6

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)

The scatter form of multimedia data such as text, image, audio, and video posted regularly in the social media may contain useful information for the organizations. But, this information should be derived with the use of some form of analysis known as Multimodal Sentiment Analysis (MSA). But, there is a lack of proper analytic tools for such analysis. This paper presents a thorough overview of more...

INFONA - science communication portal

Search results

Poster abstract: MicroBrain: Compressing deep neural networks for energy-efficient visual inference service

Real-time EEG-based person authentication system using face rapid serial visual presentation

A real-time visual tracking technique for mobile display stabilization

A self-organizing model for affective memory

Category-selective top-down modulation in the fusiform face area of the human brain during visual search

Amazigh audiovisual speech recognition system design

Piloting mobile mixed reality simulation in paramedic distance education

Manually annotated characteristic descriptors: Measurability and variability

A joint learning based Face Super Resolution approach via contextual topological structure

Power-law stochastic neighbor embedding

Laplace gradient based Discriminative and Contrast Invertible descriptor

Vid2speech: Speech reconstruction from silent video

Semantic Text Summarization of Long Videos

An intelligent digital system for visually impaired person (vip's)

XJU1: A Chinese Ethnic Minorities Face Database

Face recognition system using bag of features and multi-class SVM for robot applications

A template-projection approach to decode higher-order vision in realtime and at the perceptual threshold

Effective and efficient visual stimuli design for quantitative autism screening: An exploratory study

Detecting falling people by autonomous service robots: A ROS module integration approach

An overview of Multimodal Sentiment Analysis research: Opportunities and Difficulties

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options