Search results

chapter

Augmented and virtual reality approaches to help with peripheral vision loss

Ola Younis, Waleed Al-Nuaimy, Majid A. Al-Taee, Ali Al-Ataby

2017 14th International Multi-Conference on Systems, Signals & Devices (SSD) > 303 - 307

2017 14th International Multi-Conference on Systems, Signals & Devices (SSD)

Peripheral vision loss (also called tunnel vision) is one of the main visual field disorders that can be very frustrating, and affect confidence and main activities of the patient. In this paper, two promising solutions for the peripheral vision loss are presented and discussed. The first one uses optical see-through glasses that are augmented by computer-generated images to notify the user about...

chapter

End-to-end visual speech recognition with LSTMS

Stavros Petridis, Zuwei Li, Maja Pantic

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2592 - 2596

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Traditional visual speech recognition systems consist of two stages, feature extraction and classification. Recently, several deep learning approaches have been presented which automatically extract features from the mouth images and aim to replace the feature extraction stage. However, research on joint learning of features and classification is very limited. In this work, we present an end-to-end...

chapter

Visual attention is captured by task-irrelevant faces, but not by pareidolia faces

Atsunori Ariga, Katsuhiko Arihara

2017 9th International Conference on Knowledge and Smart Technology (KST) > 266 - 269

2017 9th International Conference on Knowledge and Smart Technology (KST)

It has been reported that visual attention is captured exogenously by faces. This study used pareidolia faces and examined whether subjective perception of a face is sufficient for capturing attention or the registry of an actual face is necessary for attentional capture. Three experiments demonstrated that a completely task-irrelevant face distractor captured attention exogenously, in turn disrupting...

chapter

Scalable Video-on-Demand Streaming for Heterogeneous Clients in Wireless Network

Li Zhou, Xiaohua Tian, Xiaoying Gan, Hui Yu, more

2016 12th International Conference on Mobile Ad-Hoc and Sensor Networks (MSN) > 39 - 44

2016 12th International Conference on Mobile Ad-Hoc and Sensor Networks (MSN)

Periodic broadcasting has achieved prominent performance in VoD (Video on Demand) service in wired network. However, the development of wireless VoD service still has gone on hard and slowly in comparison to the rapid growth of mobile video service. In this paper, we propose a scalable video streaming method HQOBA (Heterogeneous Quality-Oriented Bandwidth Allocation) to address the problems in wireless...

chapter

Story segmentation in TV news broadcast

Raghvendra Kannao, Prithwijit Guha

2016 23rd International Conference on Pattern Recognition (ICPR) > 2948 - 2953

2016 23rd International Conference on Pattern Recognition (ICPR)

Segmentation of TV news broadcast into semantically meaningful stories is an essential pre-requisite for a wide range of video analytics applications. In this work we have introduced a hybrid approach for news story segmentation based on conditional random fields (CRFs). The story boundary detection problem is converted into a shot classification problem by classifying video shots into either of the...

chapter

Streaming news image summarization

Hao Li, Shangfu Peng, Hanan Samet

2016 23rd International Conference on Pattern Recognition (ICPR) > 1279 - 1284

2016 23rd International Conference on Pattern Recognition (ICPR)

Automatic summarization of streaming news images is critical for efficient news browsing. Although image duplicates are redundant for news reading, the number of duplicates of a news image is a good indicator for its importance. We describe the architecture used in a news aggregation system for online streaming news image summarization. Given a sequence of images for a news topic, we first cluster...

chapter

ChaLearn Joint Contest on Multimedia Challenges Beyond Visual Analysis: An overview

Hugo Jair Escalante, Victor Ponce-Lopez, Jun Wan, Michael A. Riegler, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 67 - 73

2016 23rd International Conference on Pattern Recognition (ICPR)

This paper provides an overview of the Joint Contest on Multimedia Challenges Beyond Visual Analysis. We organized an academic competition that focused on four problems that require effective processing of multimodal information in order to be solved. Two tracks were devoted to gesture spotting and recognition from RGB-D video, two fundamental problems for human computer interaction. Another track...

chapter

Efficient large scale near-duplicate video detection base on spark

Jinna Lv, Bin Wu, Shuai Yang, Bingjing Jia, more

2016 IEEE International Conference on Big Data (Big Data) > 957 - 962

2016 IEEE International Conference on Big Data (Big Data)

With the huge amount of web video data and its exponential growth in recent years, there are new challenges in Near-Duplicate Video Detection (NDVD) which have attracted much attention owing to its wide applications. One of the problems is how to extract discriminative features to achieve higher precision, and the other problem is how to improve the efficiency of large scale video analysis. Existing...

chapter

A systemic approach to automatic metadata extraction from multimedia content

Christos Varytimidis, Georgios Tsatiris, Konstantinos Rapantzikos, Stefanos Kollias

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 7

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis...

chapter

Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view

Spyridon Thermos, Gerasimos Potamianos

2016 IEEE Spoken Language Technology Workshop (SLT) > 579 - 584

2016 IEEE Spoken Language Technology Workshop (SLT)

Motivated by increasing popularity of depth visual sensors, such as the Kinect device, we investigate the utility of depth information in audio-visual speech activity detection. A two-subject scenario is assumed, allowing to also consider speech overlap. Two sensory setups are employed, where depth video captures either a frontal or profile view of the subjects, and is subsequently combined with the...

chapter

Fast near-duplicate detection from image streams on online social media during disaster events

Ashish Kumar Layek, Akash Gupta, Saptarshi Ghosh, Sekhar Mandal

2016 IEEE Annual India Conference (INDICON) > 1 - 6

2016 IEEE Annual India Conference (INDICON)

User-generated content on online social media (OSM) has several data mining applications, such as extracting useful information during disaster events. Since popular / important content is often re-posted by multiple people on OSM, identifying duplicate content is an important first step in many data mining applications. In this work, we develop a methodology to identify near-duplicate images posted...

chapter

Visual Big Data Analytics for Traffic Monitoring in Smart City

Dinesh Singh, C. Vishnu, C. Krishna Mohan

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 886 - 891

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

The application such as video surveillance for traffic control in smart cities needs to analyze the large amount (hours/days) of video footage in order to locate the people who are violating the traffic rules. The traditional computer vision techniques are unable to analyze such a huge amount of visual data generated in real-time. So, there is a need for visual big data analytics which involves processing...

chapter

Promoting active participation of the learners in an authoring based learning movie system

A S M Mahfujur Rahman, Abdulmotaleb El Saddik

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA) > 1 - 6

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)

Netflix, Hulu, etc are some of the most popular video content streaming services that are increasingly being accessed through many popular consumer devices such as Apple TV, XBox, Wii, etc. It has now become possible to conveniently interact with the video contents by using the input hardwares that these devices provide. We emulate the setups that many of these popular platforms provide in order to...

chapter

Pedestrian tracking from an unmanned aerial vehicle

Chao Bian, Zhen Yang, Tao Zhang, Huilin Xiong

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 1067 - 1071

2016 IEEE 13th International Conference on Signal Processing (ICSP)

In this paper we present a scheme for pedestrian tracking from an unmanned aerial vehicle (UAV), which includes the motion control of the UAV, and the visual tracking of a specific pedestrian from the moving platform. In the visual tracking part, we use an online updating feature queue and the Locality-constrained Linear Coding (LLC) method to match the pedestrian target. The ground station receives...

chapter

Video annotation for immersive journalism using masking techniques

Joao Meira, Joao Marques, Joao Jacob, Rui Nobrega, more

2016 23° Encontro Português de Computação Gráfica e Interação (EPCGI) > 1 - 7

2016 23° Encontro Português de Computação Gráfica e Interação (EPCGI)

This paper proposes an interactive annotation technique for 360° videos that allows the use of traditional video editing techniques to add content to immersive videos. Using the case study of immersive journalism the main objective is to diminish the entry barrier for annotating 360° video pieces, by providing a different annotation paradigm and a set of tools for annotation. The spread of virtual...

chapter

Deep Neural Networks for Page Stream Segmentation and Classification

Ignazio Gallo, Lucia Noce, Alessandro Zamberletti, Alessandro Calefati

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

In this manuscript we propose a novel method for jointly page stream segmentation and multi-page document classification.The end goal is to classify a stream of pages as belonging to different classes of documents. We take advantage of the recent state-of-the-art results achieved using deep architectures in related fields such as document image classification, and we adopt similar models to obtain...

chapter

Extraction of visual information in basketball broadcasting video for event segmentation system

Jae-Hyuck Park, Keeseong Cho

2016 International Conference on Information and Communication Technology Convergence (ICTC) > 1098 - 1100

2016 International Conference on Information and Communication Technology Convergence (ICTC)

Video analysis is an essential process to segment and summarize sports videos automatically. In this paper, we propose fast and simple computer vision algorithms which can be employed to an event segmentation system for basketball broadcasting videos. In our approach, camera panning is estimated by the optical flow estimation and flow segmentation algorithms. For recognizing shot classes and clock...

chapter

Step and activity detection based on the orientation and scale attributes of the SURF algorithm

Chadly Marouane, Andre Ebert, Claudia Linnhoff-Popien, Maximilian Christil

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN) > 1 - 8

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN)

In recent years, the importance of location-based services and indoor positioning systems increased significantly for both, research and industry. Visual localization systems have the advantage of not depending on dedicated infrastructure and thus they are interesting for navigation within buildings. While there are already approaches which are using pre-recorded databases of reference images to obtain...

chapter

Visual odometry using motion vectors from visual feature points

Chadly Marouane, Marco Maier, Alexander Leupold, Claudia Linnhoff-Popien

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN) > 1 - 8

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN)

In recent years, location-based services and indoor positioning systems gained increasing importance for both, research and industry. Visual localization systems have the advantage of not being dependent on dedicated infrastructure and thus are especially interesting for navigation within buildings. While there are already approaches of using pre-recorded databases of reference images to obtain an...

chapter

Drones for live streaming of visuals for people with limited mobility

Eleni Mangina, Evan O'Keeffe, Joe Eyerman, Lizbeth Goodman

2016 22nd International Conference on Virtual System & Multimedia (VSMM) > 1 - 6

2016 22nd International Conference on Virtual System & Multimedia (VSMM)

Robotics is the field currently taking its place as a leading candidate for dramatic changes in everyday life. Advances in the past 10 years in sensing, actuator and power technologies have fuelled an explosion of opportunities in this exciting, and surprisingly affordable domain. Small Unmanned Aircraft Systems (drones) are being rapidly developed for research, public service, and commercial applications,...

INFONA - science communication portal

Search results

Augmented and virtual reality approaches to help with peripheral vision loss

End-to-end visual speech recognition with LSTMS

Visual attention is captured by task-irrelevant faces, but not by pareidolia faces

Scalable Video-on-Demand Streaming for Heterogeneous Clients in Wireless Network

Story segmentation in TV news broadcast

Streaming news image summarization

ChaLearn Joint Contest on Multimedia Challenges Beyond Visual Analysis: An overview

Efficient large scale near-duplicate video detection base on spark

A systemic approach to automatic metadata extraction from multimedia content

Audio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal view

Fast near-duplicate detection from image streams on online social media during disaster events

Visual Big Data Analytics for Traffic Monitoring in Smart City

Promoting active participation of the learners in an authoring based learning movie system

Pedestrian tracking from an unmanned aerial vehicle

Video annotation for immersive journalism using masking techniques

Deep Neural Networks for Page Stream Segmentation and Classification

Extraction of visual information in basketball broadcasting video for event segmentation system

Step and activity detection based on the orientation and scale attributes of the SURF algorithm

Visual odometry using motion vectors from visual feature points

Drones for live streaming of visuals for people with limited mobility

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options