Search results

chapter

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Ren Yang, Mai Xu, Zulin Wang

2017 IEEE International Conference on Multimedia and Expo (ICME) > 817 - 822

2017 IEEE International Conference on Multimedia and Expo (ICME)

The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC video streams may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary to enhance visual quality of HEVC videos at the decoder side. To this end, we propose in this paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN)...

chapter

Real time video summarization on mobile platform

Pradeep Choudhary, Sowmya P. Munukutla, K. S. Rajesh, Alok S. Shukla

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1045 - 1050

2017 IEEE International Conference on Multimedia and Expo (ICME)

Recent advances in technology and rapid growth of consumer electronics have made tremendous amount of multimedia information available to the general population. Browsing through large collections of consumer videos and manually creating summaries can be tedious. Automatic summarization techniques will give the user an easy way to look up important content of a collection of media and to browse media...

chapter

Temporally Steered Gaussian Attention for Video Understanding

Shagan Sah, Thang Nguyen, Miguel Dominguez, Felipe Petroski Such, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2208 - 2216

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Recent advances in video understanding are enabling incredible developments in video search, summarization, automatic captioning and human computer interaction. Attention mechanisms are a powerful way to steer focus onto different sections of the video. Existing mechanisms are driven by prior training probabilities and require input instances of identical temporal duration. We introduce an intuitive...

chapter

Gender differences in the use of SCREAM Rhetorical devices displayed on video presentations: (An analysis of undergraduate students' persuasive presentations)

Joice Yulinda Luke, Kiky Soraya

2017 10th International Conference on Human System Interactions (HSI) > 111 - 115

2017 10th International Conference on Human-System Interactions (HSI)

Delivering persuasive presentation is a challenging task for the university students. One of the difficult parts is grasping the attention of audiences during the critical situations, such as in the early morning or after the lunch break. At the situations, students are not paying more attention to any speeches or presentations occurred in the classroom because of some lapses of attention span such...

chapter

Authenticating physical location using QR codes and network latency

Charles Allen, Antony Harfield

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 1 - 6

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)

QR codes are increasingly being used as a mechanism to transmit one time passwords (OTPs) between devices for the purpose of authentication due to their convenience, low cost, and the ubiquity of consumer mobile devices. Existing practice typically utilizes a single QR code which is relatively easy to capture and relay to an offsite attacker or collaborator. We propose a mechanism using a stream of...

chapter

Uncoordinated multi-user video streaming in VANETs using Skype

Evgeny Belyaev, Sergio Moreschini, Alexey Vinel

2017 IEEE 22nd International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD) > 1 - 3

2017 IEEE 22nd International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD)

Real-time video delivery in Vehicle-to-Infrastructure (V2I) scenario enables a variety of multimedia vehicular services. We conduct experiments with Dedicated Short Range Communications (DSRC) transceivers located in the mutual proximity and exchanging Skype video calls traffic. We demonstrate that the lack of coordination between the users both at the application as well as Medium Access Control...

chapter

Manifolds of tool-graspability in the human brain

Xixi Wang, Carol A. Jew, Feng Lin, Rajeev D.S. Raizada

2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI) > 1 - 4

2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI)

Neural representations for object recognition are difficult to construct because vision operates in highdimensional space. This study aims to develop low-dimensional neural representations (“manifolds”) that could contain either rotation or viewpoint information. In our experiments, four rotating tools were used as visual stimuli and brain activity was recorded using functional magnetic resonance...

chapter

WEIDJ: An improvised algorithm for image extraction from web pages

Ily Amalina Ahmad Sabri, Mustafa Man

2017 8th International Conference on Information Technology (ICIT) > 512 - 517

2017 8th International Conference on Information Technology (ICIT)

World wide web (www) is a huge information repository and rapidly growing as source of information. Web pages is known as semi-structured data and it contains variety of information such as text, images, audio, video and other various format. The process of extracting information from the web pages is time consuming and requires correct approach and this paper presents an improvised algorithm in extracting...

chapter

A framework for visual fog computing

Shao-Wen Yang, Omesh Tickoo, Yen-Kuang Chen

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Visual data are rich, which have opened vast analytics opportunities and been widely used in many applications. However, the demanding requirements of computational resources and bandwidth have prevented the data from being useful in an economically efficient manner. A visual fog paradigm is needed for efficient processing of continuous video streams by collaboratively using things in the Internet...

chapter

Analysis of video views in online courses

Esztelecki Peter, Havasi Ferenc

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 778 - 782

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Nowadays, in our fast-paced world there are countless MOOC courses in the Internet with various topics that have been designed to broaden our knowledge. One of the most powerful tools for effective learning are online videos. Many case studies have been carried out in order to specify the qualities of a good educational video. Philip J. Guo, Juho Kim and Rob Rubin published an article (How Video Production...

chapter

Machine learning and coresets for automated real-time video segmentation of laparoscopic and robot-assisted surgery

Mikhail Volkov, Daniel A. Hashimoto, Guy Rosman, Ozanan R. Meireles, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 754 - 759

2017 IEEE International Conference on Robotics and Automation (ICRA)

Context-aware segmentation of laparoscopic and robot assisted surgical video has been shown to improve performance and perioperative workflow efficiency, and can be used for education and time-critical consultation. Modern pressures on productivity preclude manual video analysis, and hospital policies and legacy infrastructure are often prohibitive of recording and storing large amounts of data. In...

chapter

Real-time audiovisual laughter detection

B. Berker Turker, Zana Bucinca, M. Tevfik Sezgin, Yucel Yemez, more

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Laughter detection is an essential aspect towards effective human-computer interaction. This work primarily addresses the problem of laughter detection in a real-time environment. We utilize annotated audio and visual data collected from a Kinect sensor to identify discriminative features for audio and video, separately. We show how the features can be used with classifiers such as support vector...

chapter

The label knows better: The impact of labeling effects on perceived quality of HD and UHD video streaming

Peter A. Kara, Werner Robitza, Alexander Raake, Maria G. Martini

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX) > 1 - 6

2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)

There is an ongoing debate in the research community over the improved visual quality of UHD video in comparison to the still widely-deployed HD standard. It is the inspiration of many scientific studies, yet UHD displays and services are continuously spreading on the consumer market. This paper presents the results of a subjective paired-comparison test with both upscaled HD and UHD video sequences,...

chapter

Vision-based target tracking and autonomous landing of a quadrotor on a ground vehicle

Tru Hoang, Enkhmurun Bayasgalan, Ziyin Wang, Gavriil Tsechpenakis, more

2017 American Control Conference (ACC) > 5580 - 5585

2017 American Control Conference (ACC)

This paper addresses vision-based tracking and landing of a micro-aerial vehicle (MAV) on a ground vehicle (GV). The camera onboard the MAV is mounted so that the optical axis is aligned with the downward-facing axis of the body-fixed frame. A novel supervised learning vision algorithm is proposed as the method to detect the ground vehicle in the image frame. A feedback linearization technique is...

chapter

Unattended object detection based on blob tracking

Murat Peker

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

In this study, an effective approach is proposed for detecting unattended objects in visual surveillance systems. In the proposed approach, the regions of foreground are labeled as moving regions which are determined by background subtraction algorithms in a streamed video from a fixed camera. Moving blobs are used to extract information about events occurring in the environment from which the image...

chapter

FASUM: Feature Accelerated Single-Video Summarization

M. Thahaseen Fathima, S. Chitrakala

2017 International Conference on Technical Advancements in Computers and Communications (ICTACC) > 45 - 49

2017 International Conference on Technical Advancements in Computers and Communications (ICTACC)

Plenty of video stuff is created, broadcasted, shared and stored each and every day by industry experts, beginners, and hobbyists. Video summaries aim at showcasing the semantics and content of a clip in reduced time and space to enable a quick overview of video clip relevance. This paper focuses on static summaries showing key frames from the video. The key frames are extracted by leveraging the...

chapter

Ego-Motion Classification for Driving Vehicle

Li Du, Wenhui Jiang, Zhicheng Zhao, Fei Su

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 276 - 279

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

Accurate prediction of vehicle ego-motion in real time is crucial for an autonomous driving system. In this paper, we formulate the problem of ego-motion classification as video event detection, and we propose an end-to-end deep model to address this problem. In this model, we utilize Convolutional Neural Networks (CNNs) to extract semantic visual feature of each video frame, and employ a Long Short...

chapter

Unsupervised Video Summaries Using Multiple Features and Image Quality

Tongling Hu, Zechao Li, Weiyang Su, Xing Mu, more

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 117 - 120

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

It is important to generate both interesting and representative video summary for massive videos. This work proposes a new method to generate dynamic video summary using multiple features and image quality without human's involvement in the whole procedure. Specifically, we first split a video into several video clips. Second, a set of features including visual attention, exposure of light, saturation,...

chapter

Video quality assessment: A review of full-referenced, reduced-referenced and no-referenced methods

Hoong-Cheng Soong, Phooi-Yee Lau

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA) > 232 - 237

2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA)

Whenever a video is being digitized, compressed and transmitted across the network, some degradation might be introduced that could affect the quality of the video received. Thus, it is essential to provide feedback system to the provider which could allow them the freedom to feedback to the system, if the quality of video being transmitted could be improved, in terms of video quality. This is an...

chapter

Research on video motion characteristics extraction and description based on human visual characteristics

Ying Zhou, Yongsheng Liang, Wei Liu, Lixia Zhao

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 936 - 940

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

With the rapid development of Internet and multimedia, there is new challenge on streaming media processing and transmission. On one hand, the demand for the quantity and quality of multimedia resources are getting higher, which requires the matching capabilities of data storage and transmission. On the other hand, restricted to the computer's processing power, heterogeneous network environment, network...

INFONA - science communication portal

Search results

Decoder-side HEVC quality enhancement with scalable convolutional neural network

Real time video summarization on mobile platform

Temporally Steered Gaussian Attention for Video Understanding

Gender differences in the use of SCREAM Rhetorical devices displayed on video presentations: (An analysis of undergraduate students' persuasive presentations)

Authenticating physical location using QR codes and network latency

Uncoordinated multi-user video streaming in VANETs using Skype

Manifolds of tool-graspability in the human brain

WEIDJ: An improvised algorithm for image extraction from web pages

A framework for visual fog computing

Analysis of video views in online courses

Machine learning and coresets for automated real-time video segmentation of laparoscopic and robot-assisted surgery

Real-time audiovisual laughter detection

The label knows better: The impact of labeling effects on perceived quality of HD and UHD video streaming

Vision-based target tracking and autonomous landing of a quadrotor on a ground vehicle

Unattended object detection based on blob tracking

FASUM: Feature Accelerated Single-Video Summarization

Ego-Motion Classification for Driving Vehicle

Unsupervised Video Summaries Using Multiple Features and Image Quality

Video quality assessment: A review of full-referenced, reduced-referenced and no-referenced methods

Research on video motion characteristics extraction and description based on human visual characteristics

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options