Search results

Items from 1 to 13 out of 13 results

chapter

Classifying commit messages: A case study in resampling techniques

SeyedHamid Shekarforoush, Robert Green, Robert Dyer

2017 International Joint Conference on Neural Networks (IJCNN) > 1273 - 1280

2017 International Joint Conference on Neural Networks (IJCNN)

In practice, there are a variety of real-world datasets that have an imbalanced nature where one of two classes dominates the data. These datasets are generally difficult to classify using machine learning algorithms as the skewed nature of the data has a significant impact on the training process. In order to combat this difficulty, many methods of under sampling and over sampling have been proposed...

chapter

Applying Combinatorial Testing to Data Mining Algorithms

Jaganmohan Chandrasekaran, Huadong Feng, Yu Lei, D. Richard Kuhn, more

2017 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW) > 253 - 261

2017 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)

Data mining algorithms are used to analyze and discover useful information from data. This paper presents an experiment that applies Combinatorial Testing (CT) to five data mining algorithms implemented in an open-source data mining software called WEKA. For each algorithm, we first run the algorithm with 51 datasets to study the impact different datasets have on the test coverage. We select one dataset...

chapter

Auto-Tagging for Massive Online Selection Tests: Machine Learning to the Rescue

S. Krithivasan, S. Gupta, S. Shandilya, K. Arya, more

2016 IEEE Eighth International Conference on Technology for Education (T4E) > 204 - 207

2016 IEEE Eighth International Conference on Technology for Education (T4E)

Difficulty Level of a question is relative to that of other questions in a test and also to the test takers, hence manually assigning Difficulty Level tags may not be accurate. There is a need to infer them from historical data pertaining to the performance of students in a test. e-Yantra Robotics Competition (eYRC) is an annual competition having around 5000 teams (20,000 students) registering in...

chapter

Using personal preference in calculating rating scores for recommendations

Chie-Hong Lee, Yann-Yean Su, Pang-Ming Chu, Shie-Jue Lee

2016 IEEE International Conference on Information and Automation (ICIA) > 1149 - 1153

2016 IEEE International Conference on Information and Automation (ICIA)

Online shopping is a common shopping style for human being nowadays. Rating mechanisms usually exist in most of the shopping sites. Therefore, predicting which products a customer is going to buy next from the rating information becomes possible, making recommender systems important for online shopping. The success of an online shopping site can be dominated by the quality of the recommender system...

chapter

Edge-based depth gradient refinement for 2D to 3D learned prior conversion

Jose L. Herrera, Carlos R. del-Blanco, Narciso Garcıa

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) > 1 - 4

2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON 2015)

2D-to-3D conversion is an important task for reducing the current gap between the number of 3D displays and the available 3D content. Here, we present an automatic 2D-to-3D image conversion approach based on machine learning principles. Stemming from the hypothesis that images with a similar structure have likely a similar 3D structure, the depth of a query color image is estimated using a color plus...

chapter

A Practical Approach on Cleaning-Up Large Data Sets

Marius Barat, Dumitru Bogdan Prelipcean, Dragos Teodor Gavrilut

2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing > 280 - 284

2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)

In this paper we propose a noise detection system based on similarities between instances. Having a data set with instances that belongs to multiple classes, a noise instance denotes a wrongly classified record. The similarity between different labeled instances is determined computing distances between them using several metrics among the standard ones. In order to ensure that this approach is computational...

chapter

Zebrafish Larva Locomotor Activity Analysis Using Machine Learning Techniques

Hao Zhang, Scott C. Lenaghan, Michelle H. Connolly, Lynne E. Parker

2013 12th International Conference on Machine Learning and Applications > 1 > 161 - 166

2013 12th International Conference on Machine Learning and Applications (ICMLA)

Zebra fish larvae have become a popular model organism to investigate genetic and environmental factors affecting behavior. However, difficulties exist in the analysis of complex behaviors from a large array of larvae. In this paper, we present the new application of machine learning techniques in bioinformatics to automatically detect and investigate the locomotor activities of zebra fish larvae...

chapter

A novel semi-supervised approach for network traffic clustering

Yu Wang, Yang Xiang, Jun Zhang, Shunzheng Yu

2011 5th International Conference on Network and System Security > 169 - 175

2011 5th International Conference on Network and System Security (NSS)

Network traffic classification is an essential component for network management and security systems. To address the limitations of traditional port-based and payload-based methods, recent studies have been focusing on alternative approaches. One promising direction is applying machine learning techniques to classify traffic flows based on packet and flow level statistics. In particular, previous...

chapter

Graph-Cut Based Iterative Constrained Clustering

Masayuki Okabe, Seiji Yamada

2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology > 3 > 126 - 129

2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)

This paper proposes a constrained clustering method that is based on a graph-cut problem formalized by SDP (Semi-Definite Programming). Our SDP approach has the advantage of convenient constraint utilization compared with conventional spectral clustering methods. The algorithm starts from a single cluster of a complete dataset and repeatedly selects the largest cluster, which it then divides into...

chapter

A new feature selection method based on clustering

Huawen Liu, Yuchang Mo, Jiyi Wang, Jianmin Zhao

2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 2 > 965 - 969

2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Feature selection is an effective technique to put the high dimension of data down, which is prevailing in many application domains, such as text categorization and bio-informatics, and can bring many advantages, such as improving efficiency and avoiding over-fitting, to learning algorithms. Currently, many efforts have been attempted in this field and various feature selection methods have been developed...

chapter

Learning a Combination of Dissimilarities from a Set of Equivalence Constraints

Manuel Martín-Merino

2010 IEEE International Conference on Data Mining Workshops > 41 - 48

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Applications have emerged in the last years in which several dissimilarities and data sources provide complementary information about the problem. Therefore, metric learning algorithms should be developed that integrate all this information in order to reflect better which is similar for the user and the problem at hand. In this paper, we propose a semi-supervised algorithm to learn a linear combination...

chapter

Online topic detection and tracking of financial news based on hierarchical clustering

Xiang-Ying Dai, Qing-Cai Chen, Xiao-Long Wang, Jun Xu

2010 International Conference on Machine Learning and Cybernetics > 6 > 3341 - 3346

2010 International Conference on Machine Learning and Cybernetics (ICMLC 2010)

In this paper, we apply TDT technology to the vertical search engine in the financial field. The returned results are grouped into several topics with the stock as the unit. Then we show the topics to the users in time series order. As a result, users can easily learn about the important events which belong to a stock. Moreover, the causes and the effects of these events can also be found out easily...

chapter

Low-rank kernel learning for semi-supervised clustering

M Soleymani Baghshah, S Bagheri Shouraki

9th IEEE International Conference on Cognitive Informatics (ICCI'10) > 567 - 572

2010 9th IEEE International Conference on Cognitive Informatics (ICCI)

In the last decade, there has been a growing interest in distance function learning for semi-supervised clustering settings. In addition to the earlier methods that learn Mahalanobis metrics (or equivalently, linear transformations), some nonlinear metric learning methods have also been recently introduced. However, these methods either allow limited choice of distance metrics yielding limited flexibility...

Filter options

Data set:
ieee
Keywords:
CLUSTERING ALGORITHMS
MEASUREMENT
MACHINE LEARNING
Publication type:
book

Publication date

Set your own date range

Keywords

CLASSIFICATION ALGORITHMS (3)
CLUSTERING METHODS (3)
KERNEL (3)
ALGORITHM DESIGN AND ANALYSIS (2)
BIOINFORMATICS (2)
CLUSTERING (2)
CONSTRAINED CLUSTERING (2)
DATA MINING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MACHINE LEARNING ALGORITHMS (2)
OPTIMIZATION (2)
PARTITIONING ALGORITHMS (2)
PATTERN CLUSTERING (2)
PATTERN RECOGNITION (2)
TESTING (2)
2D-TO-3D CONVERSION (1)
ACCURACY (1)
AGGLOMERATIVE HIERARCHICAL CLUSTERING (1)
AGGLOMERATIVE HIERARCHICAL CLUSTERING ALGORITHM (1)
ARTIFICIAL NEURAL NETWORKS (1)
AVERAGE-LINK METHOD (1)
BRANCH COVERAGE (1)
COLOR (1)
COMBINATORIAL TESTING (1)
COMPANIES (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
CONSTRAINTS (1)
CORRELATION (1)
CYBERNETICS (1)
DATABASES (1)
DECISION MAKING (1)
DEPTH MAPS (1)
DEPTH PRIOR (1)
DISTANCE FUNCTION LEARNING (1)
DISTANCE LEARNING (1)
DISTANCE METRICS (1)
E-YANTRA. (1)
EQUIVALENCE CONSTRAINTS (1)
ERROR FUNCTION (1)
ESTIMATION (1)
FINANCIAL NEWS (1)
GRAPH CUT (1)
IMAGE EDGE DETECTION (1)
IMBALANCE DATASED (1)
INFORMATION RETRIEVAL (1)
INPUT PARAMETER MODELING (1)
KERNEL LEARNING (1)
LEARNING SYSTEMS (1)
LIBRARIES (1)
LOCOMOTOR BEHAVIOR ANALYSIS (1)
LOW-RANK KERNEL MATRIX (1)
MAHALANOBIS METRICS (1)
MALWARE (1)
MANUALS (1)
MATHEMATICAL MODEL (1)
MATRIX DECOMPOSITION (1)
METRIC LEARNING ALGORITHMS (1)
MOTION PICTURES (1)
MUTATION TESTING (1)
MUTUAL INFORMATION (1)
NEODYMIUM (1)
NOISE (1)
NOISE REDUCTION (1)
NONLINEAR METRIC LEARNING METHOD (1)
NONPARAMETRIC KERNEL MATRIX (1)
ONLINE RATING (1)
ONLINE TESTING ENVIRONMENT (1)
ONLINE TOPIC DETECTION (1)
ONLINE TOPIC TRACKING (1)
OPTIMISATION (1)
PAIRWISE CONSTRAINTS (1)
PAYLOADS (1)
PORTALS (1)
PROGRAMMING (1)
PROTOCOLS (1)
QUADRATIC OPTIMIZATION ALGORITHM (1)
QUADRATIC PROGRAMMING (1)
RECOMMENDER SYSTEM (1)
RECOMMENDER SYSTEMS (1)
REDUNDANCY (1)
RESAMPLING (1)
RETROSPECTIVE TOPIC DETECTION (1)
ROBOTICS COMPETITION (1)
ROBOTS (1)
SAMPLING METHODS (1)
SEARCH ENGINES (1)
SELECTION TEST (1)
SEMANTICS (1)
SEMI-SUPERVISED ALGORITHM (1)
SEMI-SUPERVISED LEARNING (1)
SEMIDEFINITE PROGRAMMING (1)
SEMIDEFINITE PROGRAMMING PROBLEM (1)
SEMISUPERVISED CLUSTERING (1)
SINGLE PASS CLUSTERING ALGORITHM (1)
SOFTWARE ALGORITHMS (1)
STABILITY ANALYSIS (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options