Search results

chapter

Patient information extraction in noisy tele-health texts

Mi-Young Kim, Ying Xu, Osmar Zaiane, Randy Goebel

2013 IEEE International Conference on Bioinformatics and Biomedicine > 326 - 329

2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

We explore methods for effectively extracting information from clinical narratives, which are captured in a public health consulting phone service called HealthLink. The currently available data consists of dialogues constructed by nurses while consulting patients on the phone. Since the data are interviews transcribed by nurses during phone conversations, they include a significant volume and variety...

chapter

Cartification: A Neighborhood Preserving Transformation for Mining High Dimensional Data

Emin Aksehirli, Bart Goethals, Emmanuel Muller, Jilles Vreeken

2013 IEEE 13th International Conference on Data Mining > 937 - 942

2013 IEEE International Conference on Data Mining (ICDM)

The analysis of high dimensional data comes with many intrinsic challenges. In particular, cluster structures become increasingly hard to detect when the data includes dimensions irrelevant to the individual clusters. With increasing dimensionality, distances between pairs of objects become very similar, and hence, meaningless for knowledge discovery. In this paper we propose Cartification, a new...

chapter

A Parameter-Free Spatio-Temporal Pattern Mining Model to Catalog Global Ocean Dynamics

James H. Faghmous, Matthew Le, Muhammed Uluyol, Vipin Kumar, more

2013 IEEE 13th International Conference on Data Mining > 151 - 160

2013 IEEE International Conference on Data Mining (ICDM)

As spatio-temporal data have become ubiquitous, an increasing challenge facing computer scientists is that of identifying discrete patterns in continuous spatio-temporal fields. In this paper, we introduce a parameter-free pattern mining application that is able to identify dynamic anomalies in ocean data, known as ocean eddies. Despite ocean eddy monitoring being an active field of research, we provide...

chapter

Extraction of a weak co-channel interfering communication signal using complex Independent Component Analysis

Matthew E. Hagstette, Monique P. Fargues, Roberto Cristi

2013 Asilomar Conference on Signals, Systems and Computers > 1171 - 1175

2013 Asilomar Conference on Signals, Systems and Computers

Independent Component Analysis (ICA) algorithms taking advantage of the potential non-circular property of complex signals have been recently derived and shown to lead to improved performances. We investigate the performance of three ICA approaches to extract a weak co-channel interfering communications signal from a television broadcast signal over varied interference-to-noise ratios: complex maximization...

chapter

A Privacy-Preserving Data Obfuscation Scheme Used in Data Statistics and Data Mining

Pan Yang, Xiaolin Gui, Feng Tian, Jing Yao, more

2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing > 881 - 887

2013 IEEE International Conference on High Performance Computing and Communications (HPCC) & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Many applications are benefited from data sharing, especially data statistics and data mining. But as the shared data may contain private information of data owner, it has a high risk of revealing data owner's privacy. Data obfuscation is proposed to gain a balance between data privacy and data usability. But it is hard for the present obfuscation schemes to remain the usability of data in a fine-grained...

chapter

NCC-RANSAC: A fast plane extraction method for navigating a smart cane for the visually impaired

X. Qian, C. Ye

2013 IEEE International Conference on Automation Science and Engineering (CASE) > 261 - 267

2013 IEEE International Conference on Automation Science and Engineering (CASE 2013)

This paper presents a new RANSAC based method for extracting planes from 3D range data. The generic RANSAC Plane Extranction (PE) method may over-extract a plane. It may fail in the case of a multi-step scene where the RANSAC process results in multiple inlier patches that form a slant plane straddling the steps. The CC-RANSAC algorithm overcomes the latter limitation if the inlier patches are separate...

chapter

Privacy Preserving Frequent Pattern Mining on Multi-cloud Environment

Chih-Hua Tai, Jen-Wei Huang, Meng-Hao Chung

2013 International Symposium on Biometrics and Security Technologies > 235 - 240

2013 International Symposium on Biometrics and Security Technologies (ISBAST)

As the age of big data evolves, outsourcing of data mining tasks to multi-cloud environments has become a popular trend. To ensure the data privacy in outsourcing of mining tasks, the concept of support anonymity was proposed to hide sensitive information about patterns. Existing methods that tackle the privacy issues, however, do not address the related parallel mining techniques. To fill this gap,...

chapter

An Associated Extraction Method of Palmprint Principal Lines

Huang Peng-Di, Shi Jun-Sheng, Xu Guang-Hui

2013 Seventh International Conference on Image and Graphics > 448 - 452

2013 Seventh International Conference on Image and Graphics (ICIG)

The research of palm print features has attracted a lot of attention, and the principal lines which is one of the stable and important features in palm print images can provide effective information for application of palm print technology. Aimed to accurate and natural extraction, in this paper, an associated extraction method of palm print principal lines is presented based on the own properties...

chapter

News Web Text Extraction Based on the Maximum Subsequence Segmentation

Jianzhuo Yan, Hexin Duan, Liying Fang, Wang Ying

2013 International Conference on Computational and Information Sciences > 619 - 622

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

Many people use the web as the main information source in their daily lives. However, most web pages contain non-information components, such as site bars, footers and ads, etc., which make it complicated to extract text from the original HTML documents. Because of the high human intervention and the low results extraction quality, although the web text extraction techniques have been developed, the...

chapter

Efficient noise extraction algorithm and wideband noise measurement system from 0.3 GHz to 67 GHz

Hoang V. Nguyen, Neven Misljenovic, Bryan Hosein

81st ARFTG Microwave Measurement Conference > 1 - 2

2013 81st ARFTG Microwave Measurement Conference (ARFTG)

An efficient and accurate noise parameter statistical extraction algorithm is proposed and validated experimentally using a high performance Silicon MOSFET transistor. The proposed algorithm is applicable to most devices with high input reflection coefficients and operating over wide bandwidth. Measured data agree well with theoretical expectation.

chapter

Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation

Ramin Shokripour, John Anvik, Zarinah M. Kasirun, Sima Zamani

2013 10th Working Conference on Mining Software Repositories (MSR) > 2 - 11

2013 10th IEEE Working Conference on Mining Software Repositories (MSR 2013)

Large software development projects receive many bug reports and each of these reports needs to be triaged. An important step in the triage process is the assignment of the report to a developer. Most previous efforts towards improving bug report assignment have focused on using an activity-based approach. We address some of the limitations of activity-based approaches by proposing a two-phased location-based...

chapter

Deciphering the story of software development through frequent pattern mining

Nicolas Bettenburg, Andrew Begel

2013 35th International Conference on Software Engineering (ICSE) > 1197 - 1200

2013 35th International Conference on Software Engineering (ICSE)

Software teams record their work progress in task repositories which often require them to encode their activities in a set of edits to field values in a form-based user interface. When others read the tasks, they must decode the schema used to write the activities down. We interviewed four software teams and found out how they used the task repository fields to record their work activities. However,...

chapter

Application of data mining for identifying topics at the document level

Marifa Farzin Reza, Rizwana Matin

2013 International Conference on Informatics, Electronics and Vision (ICIEV) > 1 - 6

2013 2nd International Conference on Informatics, Electronics and Vision (ICIEV)

Data mining techniques are very popular in modern days and are used in NLP (Natural Language Processing). It allows users to analyze data from many different perspectives, categorize it, and summarize the relationships identified. One of the techniques, clustering items to groups, has been very popular. We use this technique here to find different topics in a document. We aim to replicate previous...

chapter

Fault-tolerant mining algorithm of sampling data from dynamic system

Hu Shaolin, Li Ye, Zhang Dong

2013 25th Chinese Control and Decision Conference (CCDC) > 4927 - 4931

2013 25th Chinese Control and Decision Conference (CCDC)

Time series data mining is an useful tool for us to design data-driven condition monitoring as well as fault diagnosis system. Aiming at monitoring abnormal changes of dynamic process, a series of mining algorithms are built up to mine signal form, model structure of process and statistical properties of noise in sampling data series, the architecture of information mining system of sampling time...

chapter

Unsupervised multimodal VAD using sequential hierarchy

Rameez Ahmad, Syed Paymaan Raza, Hafiz Malik

2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) > 174 - 177

2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)

In speech processing systems, the performance of the Voice Activity Detector (VAD) is a bottleneck to the whole system. Traditional VADs are solely based on acoustic features. Additional modality in form of visual information is used to make robust VADs. In this paper, we propose a multimodal VAD based on decision fusion between two modalities. Visual VAD (VVAD) decision vectors are interpolated so...

chapter

A principled approach to mining from noisy logs using Heuristics Miner

Philip Weber, Behzad Bordbar, Peter Tino

2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) > 119 - 126

2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)

Noise is a challenge for process mining algorithms, but there is no standard definition of noise nor accepted way to quantify it. This means it is not possible to mine with confidence from event logs which may not record the underlying process correctly. We discuss one way of thinking about noise in process mining. We consider mining from a ‘noisy log’ as learning a probability distribution over traces,...

chapter

Identification of Anomalies in Processes of Database Alteration

Francesco Mercaldo

2013 IEEE Sixth International Conference on Software Testing, Verification and Validation > 513 - 514

2013 IEEE Sixth International Conference on Software Testing, Verification and Validation (ICST)

Data, especially in large item sets, hide a wealth of information on the processes that have created and modified them. Often, a data-field or a set of data-fields are not modified only through well-defined processes, but also through latent processes; without the knowledge of the second type of processes, testing cannot be considered exhaustive. As a matter of fact, changes in the data deriving from...

chapter

Points-of-Interest Mining from People's Photo-Taking Behavior

Ickjai Lee, Guochen Cai, Kyungmi Lee

2013 46th Hawaii International Conference on System Sciences > 3129 - 3136

2013 46th Hawaii International Conference on System Sciences (HICSS)

Millions of geo-tagged photos are becoming available due to the widespread of photo-sharing websites. These social medias capture attractive points-of-interest and contain interesting photo-taking patterns. Massive amount of these user-oriented data produces new challenges and understanding people's photo-taking behavior is of great importance for local tourism-related businesses. This paper analyzes...

chapter

Generating Diverse Realistic Data Sets for Episode Mining

Albrecht Zimmermann

2012 IEEE 12th International Conference on Data Mining Workshops > 611 - 618

2012 IEEE 12th International Conference on Data Mining Workshops

Frequent episode mining has been proposed as a data mining task with the goal of recovering sequential patterns from temporal data sequences. While several episode mining approaches have been proposed in the last fifteen years, most of the developed techniques have not been evaluated on a common benchmark data set, limiting the insights gained from experimental evaluations. In particular, it is unclear...

chapter

Logical Itemset Mining

Shailesh Kumar, Chandrashekar V., C.V. Jawahar

2012 IEEE 12th International Conference on Data Mining Workshops > 603 - 610

2012 IEEE 12th International Conference on Data Mining Workshops

Frequent Item set Mining (FISM) attempts to find large and frequent item sets in bag-of-items data such as retail market baskets. Such data has two properties that are not naturally addressed by FISM: (i) a market basket might contain items from more than one customer intent(mixture property) and (ii) only a subset of items related to a customer intent are present in most market baskets (projection...

INFONA - science communication portal

Search results

Patient information extraction in noisy tele-health texts

Cartification: A Neighborhood Preserving Transformation for Mining High Dimensional Data

A Parameter-Free Spatio-Temporal Pattern Mining Model to Catalog Global Ocean Dynamics

Extraction of a weak co-channel interfering communication signal using complex Independent Component Analysis

A Privacy-Preserving Data Obfuscation Scheme Used in Data Statistics and Data Mining

NCC-RANSAC: A fast plane extraction method for navigating a smart cane for the visually impaired

Privacy Preserving Frequent Pattern Mining on Multi-cloud Environment

An Associated Extraction Method of Palmprint Principal Lines

News Web Text Extraction Based on the Maximum Subsequence Segmentation

Efficient noise extraction algorithm and wideband noise measurement system from 0.3 GHz to 67 GHz

Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation

Deciphering the story of software development through frequent pattern mining

Application of data mining for identifying topics at the document level

Fault-tolerant mining algorithm of sampling data from dynamic system

Unsupervised multimodal VAD using sequential hierarchy

A principled approach to mining from noisy logs using Heuristics Miner

Identification of Anomalies in Processes of Database Alteration

Points-of-Interest Mining from People's Photo-Taking Behavior

Generating Diverse Realistic Data Sets for Episode Mining

Logical Itemset Mining

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options