Search results

Items from 1 to 18 out of 18 results

chapter

Classification Uncertainty of Multiple Imputed Data

Tuomo Alasalmi, Heli Koskimaki, Jaakko Suutala, Juha Roning

2015 IEEE Symposium Series on Computational Intelligence > 151 - 158

2015 IEEE Symposium Series on Computational Intelligence (SSCI)

Every classification model contains uncertainty. This uncertainty can be distributed evenly or into certain areas of feature space. In regular classification tasks, the uncertainty can be estimated from posterior probabilities. On the other hand, if the data set contains missing values, not all classifiers can be used directly. Imputing missing values solves this problem but it suppresses variation...

chapter

A nonlinear hybrid fuzzy least-squares regression model

O Poleshchuk, E Komarov

2011 Annual Meeting of the North American Fuzzy Information Processing Society > 1 - 6

NAFIPS 2011 - 2011 Annual Meeting of the North American Fuzzy Information Processing Society

A method for quadratic hybrid fuzzy least-squares regression is developed in this paper. Input and output information is presented in the form of trapezoidal fuzzy numbers. The method of regressions creation is based on the transformation of the input and output fuzzy numbers into intervals, which are called weighted intervals. The proposed method extends a group of initial data membership functions...

chapter

A Comparison of Techniques for Handling Incomplete Input Data with a Focus on Attribute Relevance Influence

M Millán-Giraldo, J S Sánchez, V J Traver

2010 Ninth International Conference on Machine Learning and Applications > 819 - 822

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

This work presents a new approach based on support vector regression to deal with incomplete input (unseen) data and compares it to other existing techniques. The empirical analysis has been done over 18 real data sets and using five different classifiers, with the aim of foreseeing which technique can be deemed as more suitable for each classifier. Also, this study tries to devise how the relevance...

chapter

The Research on Chinese Coreference Resolution Based on Support Vector Machines

Yihao Zhang, Peng Jin

2010 Fourth International Conference on Genetic and Evolutionary Computing > 169 - 172

2010 Fourth International Conference on Genetic and Evolutionary Computing (ICGEC 2010)

Coreference is a common linguistic phenomenon in natural language understanding, it plays an important role in simplifying the expression and linking up the context. In this paper, the algorithm of support vector machines is applied to solve the problem of Chinese coreference, we consider fully the important characteristics which related to coreference and integrate them effectively to build model...

chapter

Aggregating Multiple Biological Measurements Per Patient

V B Zubek, F M Khan

2010 Ninth International Conference on Machine Learning and Applications > 788 - 792

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent...

chapter

Improving Kernel Methods through Complex Data Mapping

Hang Zhou, Fabio Ramos, Eric Nettleton

2010 IEEE International Conference on Data Mining > 669 - 678

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

This paper introduces a simple yet powerful data transformation strategy for kernel machines. Instead of adapting the parameters of the kernel function w.r.t. the given data (as in conventional methods), we adjust both the kernel hyper-parameters and the given data itself. Using this approach, the input data is transformed to be more representative of the assumptions encoded in the kernel function...

chapter

Statistics for characterizing data on the periphery

J Theiler, D Hush

2010 IEEE International Geoscience and Remote Sensing Symposium > 4764 - 4767

2010 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2010)

We introduce a class of statistics for characterizing the periphery of a distribution, and show that these statistics are particularly valuable for problems in target detection. Because so many detection algorithms are rooted in Gaussian statistics, we concentrate on ellipsoidal models of high-dimensional data distributions (that is to say: covariance matrices), but we recommend several alternatives...

chapter

Active learning support vector machines to classify imbalanced reservoir simulation data

Tina Yu

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2010 International Joint Conference on Neural Networks (IJCNN 2010)

Reservoir modeling is an on-going activity during the production life of a reservoir. One challenge to constructing accurate reservoir models is the time required to carry out a large number of computer simulations. To address this issue, we have constructed surrogate models (proxies) for the computer simulator to reduce the simulation time. The quality of the proxies, however, relies on the quality...

chapter

Data reconciliation of nonlinear dynamic process based on LSSVM

Xiufang Zhang, Kechang Fu, Rong Zhou

2010 International Conference on Mechanic Automation and Control Engineering > 5714 - 5717

2010 International Conference on Mechanic Automation and Control Engineering (MACE)

A new data reconciliation algorithm based on least squares support vector machines (LSSVM) for nonlinear dynamic process is proposed in this work. Firstly, response of processes and training data is obtained by computation tools or simulation software. Secondly, the local models of processes are identified by LSSVM. Finally, process data reconciliation is transformed to nonlinear program problem with...

chapter

A New Robust Classification Method for CRM

Li Xiaoyu, He Changzheng, Panos Liatsis

2010 Asia-Pacific Conference on Wearable Computing Systems > 70 - 73

2010 Asia-Pacific Conference on Wearable Computing Systems (APWC 2010)

Customer classification is a key step in customer relationship management (CRM), and there are many methods used for it, such as Neural Net, association rules, SOM model, etc. However, most existing methods don't take noise which is very common in reality into consideration. In this paper, we combine Croup Method of Data Handling (GMDH) with Takagi and Sugeno fuzzy model (TS) to form a new classification...

chapter

A hybrid algorithm applied to classify unbalanced data

C Y Lee, M R Yang, L Y Chang, Z J Lee

The 6th International Conference on Networked Computing and Advanced Information Management > 618 - 621

2010 Sixth International Conference on Networked Computing and Advanced Information Management (NCM 2010)

Unbalanced data, minority classes with few samples, present in many applications. It is difficult to solve the problems of unbalanced data by traditional methods. In this paper, a hybrid algorithm based on random over-sampling, decision tree (DT), particle swarm optimization (PSO) and feature selection is proposed to classify unbalanced data. The proposed algorithm has the ability to select beneficial...

chapter

A New Support Vector Data Description with Fuzzy Constraints

M. GhasemiGol, M. Sabzekar, R. Monsefi, M. Naghibzadeh, more

2010 International Conference on Intelligent Systems, Modelling and Simulation > 10 - 14

UKSim/AMSS First International Conference on Intelligent Systems, Modelling and Simulation (ISMS 2010)

This paper presents a novel approach to eliminate the effect of noisy samples from the learning step of support vector data description (SVDD) method. SVDD is a popular kernel method which tries to fit a hypersphere around the target object and can obtain more flexible and more accurate data descriptions by using proper kernel functions. Nonetheless, the SVDD could sometimes generate such a loose...

chapter

Application of Genetic Programming on Temper Mill Datasets

M. Kommenda, G. Kronberger, S. Winkler, M. Affenzeller, more

2009 2nd International Symposium on Logistics and Industrial Informatics > 1 - 5

2009 2nd International Symposium on Logistics and Industrial Informatics (LINDI 2009)

Temper rolling is essential for the quality of steel sheets. The degree of temper rolling determines the mechanical properties of the steel sheet and is highly influenced by the rolling force or strip tension. Since mathematical models generate unsatisfactory results for the calculation of these two process parameters, other methods for the presetting of tempers mills must be used. The parameter presetting...

chapter

Using Anonymized Data for Classification

A. Inan, M. Kantarcioglu, E. Bertino

2009 IEEE 25th International Conference on Data Engineering > 429 - 440

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

In recent years, anonymization methods have emerged as an important tool to preserve individual privacy when releasing privacy sensitive data sets. This interest in anonymization techniques has resulted in a plethora of methods for anonymizing data under different privacy and utility assumptions. At the same time, there has been little research addressing how to effectively use the anonymized data...

chapter

Semi-supervised Learning from General Unlabeled Data

Kaizhu Huang, Zenglin Xu, I. King, M.R. Lyu

2008 Eighth IEEE International Conference on Data Mining > 273 - 282

ICDM 2008. Eighth IEEE International Conference on Data Mining

We consider the problem of semi-supervised learning (SSL) from general unlabeled data, which may contain irrelevant samples. Within the binary setting, our model manages to better utilize the information from unlabeled data by formulating them as a three-class (-1,+1, 0) mixture, where class 0 represents the irrelevant data. This distinguishes our work from the traditional SSL problem where unlabeled...

chapter

Relevant pattern selection for subspace learning

Jin Hee Na, Seok Min Yun, Minsoo Kim, Jin Young Choi

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

In this paper, we propose a scheme to improve the performance of subspace learning by using a pattern (data) selection method as preprocessing. Generally, a training set for subspace learning contains irrelevant or unreliable samples, and removing these samples can improve the learning performance. For this purpose, we use pattern selection preprocessing which discriminates decision boundary/non-boundary...

chapter

Simulation of Time Series Prediction Based on Hybrid Support Vector Regression

Ling Xiang, Gui-ji Tang, Chao Zhang

2008 Fourth International Conference on Natural Computation > 2 > 167 - 171

2008 Fourth International Conference on Natural Computation (ICNC)

The paper proposes a hybrid methodology that exploits the unique strength of the autoregressive integrated moving average model and the support vector machine model in forecasting time series. The simulation experiment results showed that the hybrid model is superior to the individual models for the test values of the turbo-generator vibration. Most of the individual models evaluated showed poor ability...

chapter

A data-driven classification framework for conflict and instability analysis

Kihoon Choi, K.R. Pattipati, V. Asal

2008 IEEE International Conference on Systems, Man and Cybernetics > 114 - 119

2008 IEEE International Conference on Systems, Man and Cybernetics (SMC 2008)

Is it possible to identify and even forecast well in advance (6-12 months) the relative stability of a state to enable policy makers to successfully intervene? How does one acquire that understanding? One technique is to model and understand the social factors, which summarize the background conditions, attributes and performance factors of the country over time. The purpose of this paper is to: (1)...

Filter options

Keywords:
DATA MODELS
DATA HANDLING

Publication date

Set your own date range

Keywords

TRAINING (6)
DATA MINING (5)
KERNEL (5)
CLASSIFICATION ALGORITHMS (4)
COMPUTATIONAL MODELING (4)
OPTIMIZATION (4)
PATTERN CLASSIFICATION (4)
REGRESSION ANALYSIS (4)
TRAINING DATA (4)
ANALYTICAL MODELS (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
MATHEMATICAL MODEL (3)
ACCURACY (2)
DIGITAL SIMULATION (2)
FORECASTING (2)
FUZZY SET THEORY (2)
GAUSSIAN PROCESSES (2)
LEAST SQUARES APPROXIMATIONS (2)
MACHINE LEARNING ALGORITHMS (2)
NOISE (2)
NOISE MEASUREMENT (2)
PATTERN RECOGNITION (2)
PROBABILITY DISTRIBUTION (2)
ACTIVE LEARNING SUPPORT VECTOR MACHINES (1)
AGGREGATING MULTIPLE MEASUREMENTS PER FEATURE (1)
ANOMALY DETECTION (1)
ANONYMIZATION METHODS (1)
ANONYMIZED DATA HANDLING (1)
ARGON (1)
ASSOCIATION RULES (1)
ATTRIBUTE RELEVANCE INFLUENCE (1)
AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODEL (1)
AUTOREGRESSIVE MOVING AVERAGE PROCESSES (1)
BAYES METHODS (1)
BAYESIAN FORMULATION (1)
BAYESIAN METHODS (1)
BENCHMARK CLASSIFIERS (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOPSY (1)
BUILDINGS (1)
CANCER (1)
CHINESE COREFERENCE RESOLUTION (1)
CLASS INFORMATION (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHM (1)
CLASSIFIERS (1)
COMPLEX MAPPING (1)
COMPUTATION TOOLS (1)
COMPUTER SIMULATIONS (1)
COMPUTERS (1)
CONFLICT ANALYSIS (1)
CONFLICT FORECASTING (1)
COREFERENCE RESOLUTION (1)
CORRELATION (1)
COUNTRY INSTABILITY (1)
COVARIANCE MATRIX (1)
CRM (1)
CUSTOMER (1)
CUSTOMER RELATIONSHIP MANAGEMENT (1)
DAIRY PRODUCTS (1)
DATA AGGREGATION METHODS (1)
DATA DISTRIBUTIONS (1)
DATA HANDLING TECHNIQUES (1)
DATA IMPUTATION (1)
DATA MAPPING (1)
DATA PRIVACY (1)
DATA RECONCILIATION (1)
DATA SCALING TECHNIQUES (1)
DATA SELECTION METHOD (1)
DATA TRANSFORMATION (1)
DATA-DRIVEN (1)
DATA-DRIVEN CLASSIFICATION (1)
DECISION BOUNDARY PATTERN (1)
DECISION NONBOUNDARY PATTERN (1)
DECISION TREE (1)
DECISION TREES (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTION PERIPHERY (1)
ELLIPSOIDAL MODELS (1)
ELLIPSOIDS (1)
ENGINEERING COMMUNITY (1)
EVOLUTIONARY INSPIRED BASED MODELING TECHNIQUE (1)
EXPECTATION-MAXIMIZATION (1)
EXPERIMENT (1)
FEATURE EXTRACTION (1)
FEATURE SELECTION (1)
FEATURE SUBSET (1)
FORCE (1)
FORECASTING THEORY (1)
FREQUENCY DOMAIN (1)
FREQUENCY METRICS (1)
FUZZY CONSTRAINTS (1)
FUZZY CONTROL (1)
FUZZY SETS (1)
GAUSSIAN MIXTURE MODELS (1)
GAUSSIAN PROCESS (1)
GAUSSIAN PROCESS LEARNING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options