2015 IEEE International Conference on Information Reuse and Integration (IRI)

Items from 1 to 18 out of 18 results

chapter

Observing the Effect of the Choice of Classifier on Bioinformatics Data with Varying Levels of Data Quality and Class Balance

Alireza Fazelpour, Taghi M. Khoshgoftaar, David J. Dittman, Ahmad Abu Shanab

2015 IEEE International Conference on Information Reuse and Integration > 372 - 379

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Noise is a prominent challenge found in many bioinformatics datasets and it refers to erroneous or missing data. The presence of noise in gene expression datasets has adverse effects on machine-learning techniques, such as supervised classification algorithms and feature selection techniques. Additionally, the identification of noise and its quantification are challenging tasks that require a proper...

chapter

Alterations to the Bootstrapping Process within Random Forest: A Case Study on Imbalanced Bioinformatics Data

Taghi M. Khoshgoftaar, Alireza Fazelpour, David J. Dittman, Amri Napolitano

2015 IEEE International Conference on Information Reuse and Integration > 342 - 348

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Class imbalance is a significant challenge that practitioners in the field of bioinformatics are faced with on a daily basis. It is a phenomenon that occurs when number of instances of one class is much greater than number of instances of the other class(es) and it has adverse effects on the performance of classification models built on this skewed data. Random Forest as a robust classifier has been...

chapter

Fast Text Classification Using Randomized Explicit Semantic Analysis

Aibek Musaev, De Wang, Saajan Shridhar, Calton Pu

2015 IEEE International Conference on Information Reuse and Integration > 364 - 371

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Document classification or document categorization is one of the most studied areas in computer science due to its importance. The problem is to assign a document using its text to one or more classes or categories from a predefined set. We propose a new approach for fast text classification using randomized explicit semantic analysis (RS-ESA). It is based on a state of the art approach for word sense...

chapter

Using Ensemble Learners to Improve Classifier Performance on Tweet Sentiment Data

Joseph Prusa, Taghi M. Khoshgoftaar, Daivd J. Dittman

2015 IEEE International Conference on Information Reuse and Integration > 252 - 257

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Sentiment analysis of tweets requires the ability to reliably and accurately identify the emotional polarity (positive or negative) of instances. This can be challenging, particularly when the data quality is questionable due to noise or imbalance. Ensemble learning algorithms have been shown to offer superior performance compared to non-ensemble techniques in many domains, but have not been thoroughly...

chapter

Ensemble Learning from Imbalanced Data Set for Video Event Detection

Yimin Yang, Shu-Ching Chen

2015 IEEE International Conference on Information Reuse and Integration > 82 - 89

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Learning from imbalanced data sets is a hot and challenging research topic with many real world applications. Many studies have been conducted on integrating sampling-based techniques and ensemble learning for imbalanced data sets. However, most existing sampling methods suffer from the problems of information loss, over-fitting, and additional bias. Moreover, there is no single model that can be...

chapter

The Effect of Data Sampling When Using Random Forest on Imbalanced Bioinformatics Data

David J. Dittman, Taghi M. Khoshgoftaar, Amri Napolitano

2015 IEEE International Conference on Information Reuse and Integration > 457 - 463

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Ensemble learning is a powerful tool that has shown promise when applied towards bioinformatics datasets. In particular, the Random Forest classifier has been an effective and popular algorithm due to its relatively good classification performance and its ease of use. However, Random Forest does not account for class imbalance which is known for decreasing classification performance and increasing...

chapter

Improving Pharmacological Research of HIV-1 Integrase Inhibition Using Differential Evolution - Binary Particle Swarm Optimization and Nonlinear Adaptive Boosting Random Forest Regression

Richard Adrian Galvan, Ahmad Reza Hadaegh, Matinehalsadat Kashani Moghaddam

2015 IEEE International Conference on Information Reuse and Integration > 485 - 490

2015 IEEE International Conference on Information Reuse and Integration (IRI)

In this work, we present results produced from a nonlinear QSAR model developed and implemented using evolutionary computation and Random Forest Regression to study the effectiveness of dimeric Aryl ß-Diketo Acids on HIV-1 Integrase enzyme inhibition. Dimeric Aryl ß-Diketo Acids have been proven to be effective inhibitors of the biological mechanism of protein transfer known as HIV-integrase. This...

chapter

Neural Network-Based Vector Representation of Documents for Reader-Emotion Categorization

Yu-Lun Hsieh, Shih-Hung Liu, Yung-Chun Chang, Wen-Lian Hsu

2015 IEEE International Conference on Information Reuse and Integration > 569 - 573

2015 IEEE International Conference on Information Reuse and Integration (IRI)

In this paper, we propose a novel approach for reader-emotion categorization using word embedding learned from neural networks and an SVM classifier. The primary objective of such word embedding methods involves learning continuous distributed vector representations of words through neural networks. It can capture semantic context and syntactic cues, and subsequently be used to infer similarity measures...

chapter

PNA: Partial Network Alignment with Generic Stable Matching

Jiawei Zhang, Weixiang Shao, Senzhang Wang, Xiangnan Kong, more

2015 IEEE International Conference on Information Reuse and Integration > 166 - 173

2015 IEEE International Conference on Information Reuse and Integration (IRI)

To enjoy more social network services, users nowadays are usually involved in multiple online social networks simultaneously. The shared users between different networks are called anchor users, while the remaining unshared users are named as non-anchor users. Connections between accounts of anchor users in different networks are defined as anchor links and networks partially aligned by anchor links...

chapter

Gender Prediction in Random Chat Networks Using Topological Network Structures and Masked Content

Michael Crawford, Xingquan Zhu

2015 IEEE International Conference on Information Reuse and Integration > 174 - 181

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Social media is becoming a critical avenue for businesses today to target new customers and create brand loyalty. In order to target users effectively, companies need to know basic information about their users. However, in many cases, user profiles are either incomplete or completely wrong, and one of the most critical pieces of private information is gender. In this paper we examine the case of...

chapter

Visual and Textual Feature Fusion for Automatic Customs Tariff Classification

Bilgehan Turhan, Gozde B. Akar, Cigdem Turhan, Cihan Yukse

2015 IEEE International Conference on Information Reuse and Integration > 76 - 81

2015 IEEE International Conference on Information Reuse and Integration (IRI)

The Harmonized Tariff Schedule for the classification of goods is a major determinant of customs duties and taxes. The basic HS Code is 6 digits long but can be extended according to the needs of the countries such as application of custom duties based on details of the product. Finding the correct, consistent, legally defensible HS Code is at the heart of Import Compliance. However finding the best...

chapter

Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data

Joseph Prusa, Taghi M. Khoshgoftaar, David J. Dittman, Amri Napolitano

2015 IEEE International Conference on Information Reuse and Integration > 197 - 202

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Sentiment classification of tweets is used for a variety of social sensing tasks and provides a means of discerning public opinion on a wide range of topics. A potential concern when performing sentiment classification is that the training data may contain class imbalance, which can negatively affect classification performance. A classifier trained on imbalanced data may be biased in favor of the...

chapter

Enabling Linguistic Analysis of Scientific Metadata through Internationalizing NASA JPL's PODAAC

Lewis J. McGibbney, Kim D. Whitehall, Chris A. Mattmann, Phillip M. Carter

2015 IEEE International Conference on Information Reuse and Integration > 207 - 210

2015 IEEE International Conference on Information Reuse and Integration (IRI)

This paper describes the iPReS project, which provides a web service-based framework for i18n-type (internationalized) access to scientific data products and product metadata contained within the NASA Jet Propulsion Laboratory Physical Oceanography Distributed Active Archive Center, otherwise known as PO.DAAC. PO.DAAC is an element of the EOSDIS, which freely provides science data to the global community...

chapter

A Multi-topic Meta-classification Scheme for Analyzing Lobbying Disclosure Data

Xinpeng L. Liao, Chengcui Zhang, Ariel D. Smith, Grant T. Savage

2015 IEEE International Conference on Information Reuse and Integration > 349 - 356

2015 IEEE International Conference on Information Reuse and Integration (IRI)

For the functioning of American democracy, the Lobbying Disclosure Act (LDA), for the very first time, provides data to empirically research interest groups behaviors and their influence on congressional policymaking. One of the main research challenges is to automatically find the topic(s), by short & sparse text classification, in a large corpus of unorganized, semi-structured, and poorly...

chapter

Classifying Galaxy Images through Support Vector Machines

Kathy Applebaum, Du Zhang

2015 IEEE International Conference on Information Reuse and Integration > 357 - 363

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Galaxies in the universe are commonly classified by their morphology, or visual appearance. The morphology of a galaxy tells us about the history and physical make-up of the galaxy. With the fast pace at which digital galaxy images are captured and a slow and biased human pattern recognition process, finding an efficient way to automate the galaxy image classification process can help advance the...

chapter

Gaussian Mixture Model-Based Subspace Modeling for Semantic Concept Retrieval

Chao Chen, Mei-Ling Shyu, Shu-Ching Chen

2015 IEEE International Conference on Information Reuse and Integration > 258 - 265

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Data mining and machine learning methods have been playing an important role in searching and retrieving multimedia information from all kinds of multimedia repositories. Although some of these methods have been proven to be useful, it is still an interesting and active research area to effectively and efficiently retrieve multimedia information under difficult scenarios, i.e., detecting rare events...

chapter

Efficacy of Season Prediction for Geo-locations Using Geo-tagged Images

Shesha Sreenivasamurthy, Shayna Frank

2015 IEEE International Conference on Information Reuse and Integration > 476 - 484

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Tens of thousands of pictures are taken at different locations throughout the year. People often visit places and take pictures to remember their visits. We believe that the seasonal travel patterns of people to specific locations will create a correlation between a location and the season of the images taken in that location. For example, fewer people visit Bear Valley, California during the summer...

chapter

A Low-Cost Haptic System for Wrist Rehabilitation

Daniela D'Auria, Fabio Persia, Bruno Siciliano

2015 IEEE International Conference on Information Reuse and Integration > 491 - 495

2015 IEEE International Conference on Information Reuse and Integration (IRI)

While the regular treatment for wrist stiffness is physical therapy or surgery, researchers are looking for an alternative, more efficient and automatic procedure by means of robotic applications. In this paper, we propose a low-cost system exploiting a haptic interface aided by a glove sensorized on the wrist allowing the identification of the wrist orientation, in this way, by using virtual reality,...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (5)
ACCURACY (4)
SUPPORT VECTOR MACHINES (4)
BIOINFORMATICS (3)
BIOLOGICAL SYSTEM MODELING (3)
CLASSIFICATION (3)
DATA MINING (3)
DATA MODELS (3)
RADIO FREQUENCY (3)
ROBUSTNESS (3)
SEMANTICS (3)
VEGETATION (3)
ANALYSIS OF VARIANCE (2)
CLASS IMBALANCE (2)
CONTEXT (2)
DATA SAMPLING (2)
DATABASES (2)
EVENT DETECTION (2)
MATHEMATICAL MODEL (2)
MEASUREMENT (2)
MEDIA (2)
METADATA (2)
PREDICTIVE MODELS (2)
RANDOM FOREST (2)
RANDOM FOREST (RF) (2)
SENTIMENT ANALYSIS (2)
STANDARDS (2)
TRAINING DATA (2)
TWEET MINING (2)
ACQUIRED IMMUNODEFICIENCY SYNDROME (AIDS) (1)
ADAPTATION MODELS (1)
ADAPTIVE BOOSTING (ADABOOST) (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AND RANDOM FOREST REGRESSION (RFR) (1)
APACHE TIKA (1)
BAGGING (1)
BIOCHEMISTRY (1)
BOOSTING (1)
BOOTSTRAP (1)
BRIDGES (1)
CLASS NOISE (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
CONFERENCES (1)
DATA PRIVACY (1)
DATA QUALITY (1)
DATA-MINING (1)
DIFFERENTIAL EVOLUTION-BINARY PARTICLE SWARM OPTIMIZATION (DE-BPSO) (1)
DOCUMENT REPRESENTATION (1)
DRUGS (1)
EFFICACY OR PREDICTION (1)
ELECTRONIC PUBLISHING (1)
ENCODING (1)
ENCYCLOPEDIAS (1)
ENSEMBLE LEARNING (1)
EUROPE (1)
EXPLICIT SEMANTIC ANALYSIS (1)
FILTER-BASED SUBSET EVALUATOR (1)
FRNN (1)
GALAXY IMAGE CLASSIFICATION (1)
GALAXY ZOO 2 PROJECT (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURE MODEL (GMM) (1)
GENDER PREDICTION (1)
GEO-LOCATION (1)
GEO-TAG (1)
GRAY-SCALE (1)
HAPTIC INTERFACE (1)
HAPTIC INTERFACES (1)
HAPTIC SYSTEM (1)
HS CODE (1)
IMAGE EDGE DETECTION (1)
IMBALANCED DATA (1)
IMBALANCED DATA SET (1)
INFORMATION FUSION (1)
INTERNATIONAL TRADE (1)
INTERNET (1)
JOINING PROCESSES (1)
KERNEL (1)
KNN (1)
LEARNING SYSTEMS (1)
MACHINE LEARNING (1)
MACHINE LEARNING ALGORITHMS (1)
MACHINE LEARNING APPLICATIONS (1)
MACHINE TRANSLATION (1)
MACHINE-LEARNING (1)
MASKED CONTENT (1)
MEDICAL TREATMENT (1)
META-CLASSIFIER (1)
METEOROLOGY (1)
MORPHOLOGY (1)
MULTI-CLASS &AMP; MULTI-LABEL CLASSIFICATION (1)
MULTIPLE CORRESPONDENCE ANALYSIS (MCA) (1)
MULTIPLE HETEROGENEOUS SOCIAL NETWORKS (1)
NASA (1)
NEURAL NETWORK (1)
NEURAL NETWORKS (1)
NOISE (1)
OCEANOGRAPHY (1)
more

INFONA - science communication portal

2015 IEEE International Conference on Information Reuse and Integration (IRI) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2015 IEEE International Conference on Information Reuse and Integration (IRI)