The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The authors have been working on support of coastal fishing utilizing ICT since 2004. We have realized two visualizations for the sustainable fishery until now. One is visualization of resources for capture fishing using digital diary and the other is visualization of environment for culture fishing using small buoys. In this paper, we propose the visualization of technology for octopus drift fishing...
Omnidirectional, also referred to as 360º, visual content provides an immersive experience since it allows users to view a visual scene from different directions. The overall content typically covers a full sphere, and omnidirectional videos or images are processed to obtain a projection on a 2D plane of a fraction of the sphere (aka viewport), which is shown to the user. Therefore, users can look...
Stories can have tremendous power – not only useful for entertainment, they can activate our interests and mobilize our actions. The degree to which a story resonates with its audience may be in part reflected in the emotional journey it takes the audience upon. In this paper, we use machine learning methods to construct emotional arcs in movies, calculate families of arcs, and demonstrate the ability...
The aim of our proposed research work is to identify language of spoken utterance using visual speech recognition and include Marathi language in language identification (LID) system. In this paper we have focused on the task of identifying first three digits in Marathi language. For this first Lips are extracted from video frames of face images and then landmark points on the lips are detected. Then...
Emotions influence our perceptions and decisions and are often felt more strongly in situations related to healthcare. Therefore, it is important to understand how both providers and patients express their emotions in face-to-face scenarios. An ontology is a way to represent domain concepts and the relationships between them in a polyarchical manner. We have created an ontological model called the...
A near-light perspective shape from shading (SfS) technique applied to endoscopy for 3D visualizations of the gastrointestinal tract regions is presented. By utilizing an extensible reflectance model, we study a robust Huber regularization function based variational SfS model. A balancing parameter is used for weighting the irradiance ad smoothness/regularization terms. Experimental results on different...
We propose a novel measure of visual similarity for image retrieval that incorporates both structural and aesthetic (style) constraints. Our algorithm accepts a query as sketched shape, and a set of one or more contextual images specifying the desired visual aesthetic. A triplet network is used to learn a feature embedding capable of measuring style similarity independent of structure, delivering...
In this paper we describe how to use a Game-Based Learning (GBL) approach to help end-user programmers to develop Computational Thinking (CT) skills. CT represents a framework consisting of a set of skills such as: abstraction, decomposition and more in general problem solving expressed in a computational form (algorithmic thinking) that can be easily formalised into a computer program. Those skills...
Mixed reality (MR) technologies can virtually change the appearance of real objects in real time without changing the material attributes of the objects and the associated haptic stimuli. In this study, we use MR-based visuo—haptic to investigate the mechanisms by which vision and haptics interact. In contrast to MR, diminished reality (DR) can virtually erase a real object from our sight. Because...
In this study we provide our methodology and implementation strategy of Sketchnoting in Freshman Engineering and Technological Literacy classes. The objective is to improve students' learning, visualization, and communication proficiencies, as well as to foster advancement in knowledge retention, and critical thinking. This study provides the motivation, supporting research background, design, and...
In this paper, we propose a patched-based deep Boltzmann shape priors for visual tracking. The shape priors are generated from deep Boltzmann machine network. The network consists of three layers of hidden and visible units. The generated shapes not only maintain general shapes from a variety of poses, but also entail local modifications with high probability.
Humans infer rich knowledge of objects from both auditory and visual cues. Building a machine of such competency, however, is very challenging, due to the great difficulty in capturing large-scale, clean data of objects with both their appearance and the sound they make. In this paper, we present a novel, open-source pipeline that generates audiovisual data, purely from 3D object shapes and their...
Electrocorticographic (ECoG) electrode arrays with epidural placement allow establishing neural interfaces which promote investigation of complex distributed brain functions and development of neuroprosthetic systems while having reduced invasiveness in comparison to intracortical or subdural electrode placement. In this work, we present the design and fabrication of a flexible ECoG array with 202...
In visual question answering (VQA), an algorithm must answer text-based questions about images. While multiple datasets for VQA have been created since late 2014, they all have flaws in both their content and the way algorithms are evaluated on them. As a result, evaluation scores are inflated and predominantly determined by answering easier questions, making it difficult to compare different methods...
Face alignment has witnessed substantial progress in the last decade. One of the recent focuses has been aligning a dense 3D face shape to face images with large head poses. The dominant technology used is based on the cascade of regressors, e.g., CNNs, which has shown promising results. Nonetheless, the cascade of CNNs suffers from several drawbacks, e.g., lack of end-to-end training, handcrafted...
We consider 4D shape reconstructions in multi-view environments and investigate how to exploit temporal redundancy for precision refinement. In addition to being beneficial to many dynamic multi-view scenarios this also enables larger scenes where such increased precision can compensate for the reduced spatial resolution per image frame. With precision and scalability in mind, we propose a symmetric...
Two types of information exist in a stereo pair: correlation (matching) and decorrelation (half-occlusion). Vision science has shown that both types of information are used in the visual cortex, and that people can perceive depth even when correlation cues are absent or very weak, a capability that remains absent from most computational stereo systems. As a step toward stereo algorithms that are more...
This paper proposes a systematic framework for applying the Physics of Notations (PoN), a theory for the design of cognitively effective visual notations. The PoN consists of nine principles, but not all principles lend themselves equally to a clear and unambiguous operationalization. As a result, many visual notations designed according to the PoN apply it in different ways. The proposed framework...
This paper presents a serious game designed for children suffering from profound intellectual and multiple disabilities (PIMD) also know as multihandicap, for their evaluation and cognitive training. The specificities of these children must be taken into account for the choice of both the game feedbacks and interfaces.
Although point cloud data (PCD) are easily measured using a RGB-Depth sensor such as Kinect and Xtion, measured PCD have undesirable noise and fluctuation of values. In this paper, a curved surface fitting method using a raster-scanning window is proposed to smooth original organized PCD with noise. The method allows PCD to be fitted to numerous small quadratic curved surfaces and to be smoothed while...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.