Search results

chapter

Discretizing Numerical Attributes in Decision Tree for Big Data Analysis

Yiqun Zhang, Yiu-Ming Cheung

2014 IEEE International Conference on Data Mining Workshop > 1150 - 1157

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

The decision tree induction learning is a typical machine learning approach which has been extensively applied for data mining and knowledge discovery. For numerical data and mixed data, discretization is an essential pre-processing step of decision tree learning. However, when coping with big data, most of the existing discretization approaches will not be quite efficient from the practical viewpoint...

chapter

Process and deviation exploration through Alpha-algorithm and Heuristic miner techniques

Parham Porouhan, Nipat Jongsawat, Wichian Premchaiswadi

2014 Twelfth International Conference on ICT and Knowledge Engineering > 83 - 89

2014 12th International Conference on ICT and Knowledge Engineering (ICT & Knowledge Engineering 2014)

In this paper, we applied two methods of process mining techniques (from Discovery class/approach) in order to extract knowledge from event logs recorded by an online information system. The event log was created via information received from an online proceedings review system in Thailand. Accordingly, Alpha and Heuristic algorithms were used with the objective of automatically visualizing the models...

chapter

A combined approach for ice sheet elevation extraction from lidar point clouds

Jie Yang, John Kerekes

2014 IEEE Western New York Image and Signal Processing Workshop (WNYISPW) > 15 - 18

2014 IEEE Western New York Image and Signal Processing Workshop (WNYISPW)

Understanding the total mass balance of Earth's polar cryosphere is a significant aspect of estimating sea level rise due to climate change. Measuring ice sheet elevation and ground height is essential for determining the total glacier mass balance. Satellite-based and airborne photon-counting lidar sensors, such as the upcoming Ice, Cloud and land Elevation Satellite 2 (ICESat-2), will provide accurate...

chapter

A Review: The Effects of Imperfect Data on Incremental Decision Tree

Hang Yang, Aidong Xu, Huajun Chen, Cai Yuan

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing > 34 - 41

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

Decision tree, as one of the most widely used methods in data mining, has been used in many realistic application. Incremental decision tree handles streaming data scenario that is applicable for big data analysis. However, imperfect data are unavoidable in real-world applications. Studying the state-of-art incremental decision tree induction using Hoeffding bound, we investigated the influence of...

chapter

An Asynchronous Periodic Sequential Patterns Mining Algorithm with Multiple Minimum Item Supports

Xiangzhan Yu, Haining Yu

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing > 274 - 281

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

Original sequential pattern mining model only considers occurrence frequentness of sequential patterns, disregards their occurrence periodicity. We propose the asynchronous periodic sequential pattern mining model to discover the sequential patterns which are not only occurring frequently, but also appearing periodically. For this mining model, we propose a pattern-growth mining algorithm to mine...

chapter

Considerations on the information and entropy of ordinal data

Iulian Petrila, Florina Ungureanu, Vasile Manta

2014 18th International Conference on System Theory, Control and Computing (ICSTCC) > 732 - 736

2014 18th International Conference on System Theory, Control and Computing (ICSTCC)

This study attempts to establish methods for characterizing the complexity of ordinal data through the information and entropy parameters. In this respect, there were examined the methods for measuring the complexity of data with similar statistical characteristics and the parameters that can make the difference between them were established. For this purpose, the analysis was applied to three data...

chapter

On new sequential hard c-means and its kernelization

Yukihiro Hamasuna, Yasunori Endo

2014 IEEE International Conference on Granular Computing (GrC) > 82 - 87

2014 IEEE International Conference on Granular Computing (GrC)

This paper presents a new sequential clustering algorithm based on sequential hard c-means clustering. The word sequential cluster extraction means that the algorithm extract one cluster at a time. The sequential hard c-means is one of the typical and conventional sequential clustering methods. The proposed new sequential clustering algorithm is based on Dave's noise clustering approach. A characteristic...

chapter

Palmprint principal lines extraction

Alessandro Bruno, Paolino Carminetti, Vito Gentile, Marco La Cascia, more

2014 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BIOMS) Proceedings > 50 - 56

2014 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BIOMS)

The palmprint recognition has become a focus in biological recognition and image processing fields. In this process, the features extraction (with particular attention to palmprint principal line extraction) is especially important. Although a lot of work has been reported, the representation of palmprint is still an open issue. In this paper we propose a simple, efficient, and accurate palmprint...

chapter

Hyperspectral images compression based on independent component analysis: ROI-based compression algorithm for hyperspectral images

Yu Yang, Bin Liu, Xiaoping Duan, Yongjian Nian

2014 7th International Congress on Image and Signal Processing > 771 - 777

2014 7th International Congress on Image and Signal Processing (CISP)

This paper addresses the problem of lossy compression for hyperspectral images and presents an efficient compression algorithm based on FastICA. Firstly, an efficient algorithm for segmentation of hyperspectral images is proposed. Secondly, based on the targets, a lossy compression based on ROI (Region of Interest) is proposed for hyperspectral compression, which employs KLT(Karhunen-Loève transform)...

chapter

Mining approximate multi-relational patterns

Eirini Spyropoulou, Tijl De Bie

2014 International Conference on Data Science and Advanced Analytics (DSAA) > 477 - 483

2014 International Conference on Data Science and Advanced Analytics (DSAA)

Three recent trends aim to make local pattern mining more directly suited for use on data as it presents itself in practice, namely in a multi-relational form and affected by noise. The first of these trends is the generalisation of local pattern syntaxes to approximate, noise-tolerant, variants (notably fault-tolerant itemset mining and community detection). The second of these trends is to develop...

chapter

Face recognition analysis for noise images based on combinational mirror-like odd and even features

Jianhua Zhao, Shunfang Wang, Hao Zhang

2014 7th International Congress on Image and Signal Processing > 275 - 280

2014 7th International Congress on Image and Signal Processing (CISP)

Since mirror-like odd and even features in face recognition reflect the symmetrical and asymmetrical image information, respectively, their proper combination can improve the recognition rates to some extent. However, the face imaging process can easily be affected by external factors and encounter the noise signal, which disturbs the effect of face recognition based on combinational mirror-like odd...

chapter

Analysis of visually guided tracking performance in Parkinson's disease

Yi Liu, Chonho Lee, Bu-Sung Lee, James K.R. Stevenson, more

2014 IEEE 16th International Conference on e-Health Networking, Applications and Services (Healthcom) > 164 - 169

2014 IEEE 16th International Conference on e-Health Networking, Applications and Services (Healthcom 2014)

Recent studies have suggested significant differences in motor performances of Parkinson's Disease (PD) patients who have L-dopa induced dyskinesias (LIDs), even when off of L-dopa medication. The pathophysiology of LIDs remains obscure, so applying data-mining techniques to the patients' motor performance may provide some heuristic insight. This paper investigated visually-guided tracking performance...

chapter

Overcoming the domain barrier in opinion extraction

Alexandru Cristian Cosma, Vlad-Vasile Itu, Darius Andrei Suciu, Mihaela Dinsoreanu, more

2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP) > 289 - 296

2014 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

Considering the wide spectrum of both practical and research applicability, opinion mining has attracted increased attention in recent years. This article focuses on breaking the domain-dependency barrier which occurs in supervised opinion mining strategies by using a semi-supervised approach, which ensures domain independence. Our work devises a generalized methodology by considering a set of grammar...

chapter

Structural analysis and regular expressions based noise elimination from web pages for web content mining

Amit Dutta, Sudipta Paria, Tanmoy Golui, Dipak K. Kole

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1445 - 1451

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Commercial websites usually contain noisy information blocks along with main content. Noisy information degrades the performance of web content mining. Web content mining is used for discovering useful knowledge or information from the web page. In this paper, we propose noise elimination method that uses tag based filtering followed by structural analysis of the web page. The proposed tag based filtering...

chapter

Examining Case Management Demand Using Event Log Complexity Metrics

Marian Benner-Wickner, Matthias Book, Tobias Bruckmann, Volker Gruhn

2014 IEEE 18th International Enterprise Distributed Object Computing Conference Workshops and Demonstrations > 108 - 115

2014 IEEE 18th International Enterprise Distributed Object Computing Conference Workshops and Demonstrations (EDOCW)

One of the main goals of process mining is to automatically discover meaningful process models from event logs. Since these logs are the essential source of information for discovery algorithms, their quality is of high importance. In recent years, many studies on the quality of resulting process models have been conducted. However, the analysis of event log quality prior to the generation of models...

chapter

A fast clustering method based on multi-splitting grid

Meng Fanyu, Xu Yajing, Gao Zhe, Lin Zhiqing

2014 4th IEEE International Conference on Network Infrastructure and Digital Content > 449 - 452

2014 4th IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC)

Clustering algorithms based on Grid are attractive for the task of data partition in spatial database. In the background of big data more and more research focuses on how to solve the conflict between efficiency and accuracy of clustering. Existing Grid-based clustering algorithms generally have a high time efficiency without considering the distribution of the data inside a grid. In this paper, a...

chapter

Web Data Extraction Based on Visual Information and Partial Tree Alignment

Siwu Fan, Xinjun Wang, Yongquan Dong

2014 11th Web Information System and Application Conference > 18 - 23

2014 11th Web Information System and Application Conference (WISA)

Web databases contain a huge amount of structured data which are easily obtained via their query interfaces only. The query results are presented in dynamically generated web pages, usually in the form of data records, for human use. The automatical web data extraction is critical in web integration. A number of approaches have been proposed. The early work are most based on the source code or the...

chapter

A Practical Approach on Cleaning-Up Large Data Sets

Marius Barat, Dumitru Bogdan Prelipcean, Dragos Teodor Gavrilut

2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing > 280 - 284

2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)

In this paper we propose a noise detection system based on similarities between instances. Having a data set with instances that belongs to multiple classes, a noise instance denotes a wrongly classified record. The similarity between different labeled instances is determined computing distances between them using several metrics among the standard ones. In order to ensure that this approach is computational...

chapter

Novel Proposal and Evaluation of Information Extraction Method from Artificial Fiber Pattern Using a Camera

Kitahiro Kaneda, Tomoki Inui, Keiichi Iwamura, Isao Echizen

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 502 - 506

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

We previously proposed the artificial fiber (AF) patterns in order to be able to hide information in printed documents. AF pattern uses the features of the medium (e.g., paper). It has features of rotational invariance, low visibility of the hidden information. But it still suffered extraction threshold instability when using a camera to extract the information. This problem has now been overcome...

chapter

Multi-party Conversation Summarization Based on Sentence Selection Using Verbal and Nonverbal Information

Yo Tokunaga, Kazutaka Shimada

2014 IIAI 3rd International Conference on Advanced Applied Informatics > 464 - 469

2014 IIAI 3rd International Conference on Advanced Applied Informatics (IIAIAAI)

In this paper, we propose a method for conversation summarization. For the method, we combine two approaches, a scoring method and a machine learning technique (SVMs). First we compare important utterance extraction by the scoring method and SVMs. In the machine learning technique, we introduce verbal features, such as relations between utterances and anaphora features, and nonverbal features. Next...

INFONA - science communication portal

Search results

Discretizing Numerical Attributes in Decision Tree for Big Data Analysis

Process and deviation exploration through Alpha-algorithm and Heuristic miner techniques

A combined approach for ice sheet elevation extraction from lidar point clouds

A Review: The Effects of Imperfect Data on Incremental Decision Tree

An Asynchronous Periodic Sequential Patterns Mining Algorithm with Multiple Minimum Item Supports

Considerations on the information and entropy of ordinal data

On new sequential hard c-means and its kernelization

Palmprint principal lines extraction

Hyperspectral images compression based on independent component analysis: ROI-based compression algorithm for hyperspectral images

Mining approximate multi-relational patterns

Face recognition analysis for noise images based on combinational mirror-like odd and even features

Analysis of visually guided tracking performance in Parkinson's disease

Overcoming the domain barrier in opinion extraction

Structural analysis and regular expressions based noise elimination from web pages for web content mining

Examining Case Management Demand Using Event Log Complexity Metrics

A fast clustering method based on multi-splitting grid

Web Data Extraction Based on Visual Information and Partial Tree Alignment

A Practical Approach on Cleaning-Up Large Data Sets

Novel Proposal and Evaluation of Information Extraction Method from Artificial Fiber Pattern Using a Camera

Multi-party Conversation Summarization Based on Sentence Selection Using Verbal and Nonverbal Information

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options