The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Humans communicate through natural modalities such as speech, sketching, facial expressions and gestures. Even eye-gaze and forces felt through physical interaction supply subtle, but important, bits of information in human-human communication. However, our communication with computers is primarily over ancient hardware such as mice and keyboards. A new generation of user interfaces, called intelligent...
The interactive 3D technologies group at Microsoft Research Cambridge brings together research in vision, graphics, novel hardware and AR/VR. In this talk, Dr. Izadi will give a broad overview of his new group, highlighting a number of existing projects. The majority of the talk will focus on KinectFusion, a project that demonstrates the radically new user experiences that can be enabled when researching...
Copyright and Reprint Permission: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy beyond the limit of US copyright law for private use of patrons those articles in this volume that carry a code at the bottom of the first page, provided the per-copy fee indicated in the code is paid through Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923. For...
This tutorial will focus on programming with Kinect, Microsoft's motion sensor developed for Xbox 360 using Microsoft Kinect SDK. Sample codes will be given to read and process RGB images from RGB camera and depth maps from the range sensor. Moreover, user tracking will be explained and used in a demo that creates visual effects on video captured by Kinect in real-time. The machine learning algorithm...
The large-scale understanding of personal and social behavior from smartphone sensor data is an emerging trend in computing. Smartphones can constantly sense human location, motion, proximity, and communication, and represent one of the most accurate means of tracing human activities. All this data, as never before, is being generated at massive scales. I will present an overview of recent work in...
In this demo, we present a system which is designed to gather some significant face features by users. These collected features, then, will be used for sketch-photo or caricature-photo or montage portrait-photo matching. While presenting the system, we are also planning to collect data from volunteers.
This work introduces a real-time video-based open-set face recognition system. The system has been developed for the identification of people who stand in front of an interactive screen to communicate with a virtual application. The system uses Discrete Cosine Transform (DCT) features obtained from non-overlapping 20 blocks, and Support Vector Machines (SVM) based verifiers are employed for the classification...
Mobile devices, applications, cloud services and associated mobile experiences have seen exponential advancements in recent years. We have recently seen major improvements in user interfaces such as multi-touch, mobile compute and storage capabilities, wireless network bandwidth as provided by HSPA+ and LTE, sensors such as low-power accelerometers and gyroscopes, location estimation and associated...
This talk is a whirlwind tour through human auditory perception. First, there is a mention of the actual acoustical cues in a performance venue, and then the ear's effects are discussed. In this, some discussion of loudness comes first, along with a bit of the presumed mechanisms. This will be followed by discussion of binaural auditory cues such as ITD, ILD, and HRTF's, and then a bit of discussion...
This tutorial will present a brief overview of MIMO fundamentals, including information theory basics, space-time coding, spatial multiplexing, multiuser MIMO, and MIMO interference alignment. Owing to their great promise, MIMO systems have found their way in several standards for future wireless communications systems, especially for wireless local area networks and cellular networks. Examples of...
The communications theory is a well established field whose origin could be traced back to approximately the early 20th century. This rich theory has found a very successful application since the 1980's - the mobile telephony through wireless cellular networks. Since the standardization process for the 4G technologies (LTE, LTE-Advanced) is in its final stage, it is time for the research community...
This paper presents a grammar-based Turkish interactive voice response(IVR) system used for call centers. In this work, a sample IVR of a telecommunication company call center is realized.
In this demo session, a real-time automatic face detection and recognition system will be demonstrated. The system, which is implemented as a desktop application with a user interface, detects the faces in the images that are grabbed from a web camera using a cascaded classifier consisting of Modified Census Transform features. Then, using the same method, it locates the eyes and the mouth on each...
Forensic image processing software which includes customized filter solutions, is developed based upon the experience of the expert analyst in the field. The legal point of view and the flexible use of the software are both considered. The image processing filters are plug-ins to the forensic investigation software, and thus uprising requests can be satisfied by any third party developer team.
Purpose of Diver Detection Sonar is to perform reconnaissance in order to provide detection and protection for economically and strategically important facilities located on coastal areas, such as harbours, oil refining plants, offshore platforms, oil loading terminals, military facilities, ammunition stores and ships against sneak attacks which can be carried out by divers and autonomous underwater...
Real-time 3D motion capture for the human hand opens many avenues for HCI. This work describes our framework for fitting a 3D skeleton to the human hand using depth images. We represent a human hand by a 3D skeleton with 15 joints. Using this model, various synthetic depth images are generated. Random Decision Forests (RDF) are trained and used to assign each pixel to a hand part. A mean-shift method...
Within Ottoman Text Archive Project a web interface to aid in uploading, binarization, line and word segmentation, labeling, recognition and testing of the Ottoman Turkish texts has been developed. It became possible to retrieve expert knowledge of scholars working with Ottoman archives through this interface, and apply this knowledge in developing further technologies in transliteration of historical...
In this work, the Cumulative Distribution Function (CDF) and the Probability Density Function (PDF) are examined for a data set of finite elements. The CDF and the PDF are valid only for the theoretical asymptotes when the number of elements in the set approaches infinity. The equivalent functions defined for a finite set are currently unknown. In various fields, especially in signal processing, data...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.