2015 IEEE International Conference on Information Reuse and Integration (IRI)

książka

2015 IEEE International Conference on Information Reuse and Integration

IEEE

rozdział

Author index

2015 IEEE International Conference on Information Reuse and Integration > 614 - 617

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Presents an index of the authors whose articles are published in the conference proceedings record.

rozdział

Observing the Effect of the Choice of Classifier on Bioinformatics Data with Varying Levels of Data Quality and Class Balance

Alireza Fazelpour, Taghi M. Khoshgoftaar, David J. Dittman, Ahmad Abu Shanab

2015 IEEE International Conference on Information Reuse and Integration > 372 - 379

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Noise is a prominent challenge found in many bioinformatics datasets and it refers to erroneous or missing data. The presence of noise in gene expression datasets has adverse effects on machine-learning techniques, such as supervised classification algorithms and feature selection techniques. Additionally, the identification of noise and its quantification are challenging tasks that require a proper...

rozdział

Alterations to the Bootstrapping Process within Random Forest: A Case Study on Imbalanced Bioinformatics Data

Taghi M. Khoshgoftaar, Alireza Fazelpour, David J. Dittman, Amri Napolitano

2015 IEEE International Conference on Information Reuse and Integration > 342 - 348

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Class imbalance is a significant challenge that practitioners in the field of bioinformatics are faced with on a daily basis. It is a phenomenon that occurs when number of instances of one class is much greater than number of instances of the other class(es) and it has adverse effects on the performance of classification models built on this skewed data. Random Forest as a robust classifier has been...

rozdział

Fast Text Classification Using Randomized Explicit Semantic Analysis

Aibek Musaev, De Wang, Saajan Shridhar, Calton Pu

2015 IEEE International Conference on Information Reuse and Integration > 364 - 371

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Document classification or document categorization is one of the most studied areas in computer science due to its importance. The problem is to assign a document using its text to one or more classes or categories from a predefined set. We propose a new approach for fast text classification using randomized explicit semantic analysis (RS-ESA). It is based on a state of the art approach for word sense...

rozdział

Test Reactive Systems with Buchi Automata: Acceptance Condition Coverage Criteria and Performance Evaluation

Bolong Zeng, Li Tan

2015 IEEE International Conference on Information Reuse and Integration > 380 - 387

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Buchi automata have been used to specify and reason linear temporal requirements of reactive systems. A reactive system interacts with its environment constantly, and its executions may be modeled as infinite words. A key question in testing a reactive system is how to make testing relevant to the system's requirement, that is, to focus testing on the required behaviors in terms of infinite words...

rozdział

Choosing an Appropriate Ensemble Classifier for Balanced Bioinformatics Data

Alireza Fazelpour, Taghi M. Khsohgoftaar, David J. Dittman, Amri Napolitano

2015 IEEE International Conference on Information Reuse and Integration > 17 - 24

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Bioinformatics datasets contain a number of characteristics, such as noisy data and difficult to learn class boundaries, which make it challenge to build effective predictive models. One option for improving results is the use of ensemble learning methods, which involve combining the results of multiple predictive models into a single decision. Since we do not rely on a single model, we reduce the...

rozdział

International Technical Program Committee

2015 IEEE International Conference on Information Reuse and Integration > xix - xx

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Provides a listing of current committee members and society officers.

rozdział

[IEEE IRI 2015 Invited Industry Speakers - 3 abstracts]

2015 IEEE International Conference on Information Reuse and Integration > xxxiii - xxxv

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Provides an abstract for each of the three keynote presentations and a brief professional biography of each presenter. The complete presentations were not made available for publication as part of the conference proceedings. The titles of the presentations are: "Data science enabled resiliency analytics and beyond;" "Multi-Layered Access Control with Oracle Database Vault;" and...

rozdział

Keynotes

2015 IEEE International Conference on Information Reuse and Integration > xxii - xxviii

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Provides an abstract for each of the keynote presentations and a brief professional biography of each presenter. The complete presentations were not made available for publication as part of the conference proceedings.

rozdział

A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data

Aaron N. Richter, Taghi M. Khoshgoftaar, Sara Landset, Tawfiq Hasanin

2015 IEEE International Conference on Information Reuse and Integration > 1 - 8

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Big data is a big business, and effective modeling of this data is key. This paper provides a comprehensive multidimensional analysis of various open source tools for machine learning with big data. An evaluation standard is proposed along with detailed comparisons of the frameworks discussed, with regard to algorithm availability, scalability, speed, and more. The major tools profiled are Mahout,...

rozdział

SDPA: Sensor Data Processing Architecture for Modeling Semantic Data from Sensor Steams

Seungmin Seo, Sejin Chun, Byungkook Oh, Kyong-Ho Lee

2015 IEEE International Conference on Information Reuse and Integration > 9 - 16

2015 IEEE International Conference on Information Reuse and Integration (IRI)

With the rapid deployment of a number of sensors, it is crucial to efficiently manage their data streams with heterogeneous properties. To achieve various sensor applications such as discovery and mashup, a method of retrieving meaningful information from raw sensor data is required. However, it is hard to analyze and represent the sensor data since sensors generate streaming data of different patterns...

rozdział

Steering Committee

2015 IEEE International Conference on Information Reuse and Integration > xxi

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Provides a listing of current committee members and society officers.

rozdział

Link Analysis of Wikipedia Documents Using MapReduce

Vasa Hardik, Vasudevan Anirudh, Palanisamy Balaji

2015 IEEE International Conference on Information Reuse and Integration > 582 - 588

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Wikipedia, a collaborative and user driven encyclopedia is considered to be the largest content thesaurus on the web, expanding into a massive database housing a huge amount of information. In this paper, we present the design and implementation of a MapReduce-based Wikipedia link analysis system that provides a hierarchical examination of document connectivity in Wikipedia and captures the semantic...

rozdział

Computational Cost of Querying for Related Entities in Different Ontologies

Chung Ming Cheung, Yinuo Zhang, Anand Panangadan, Viktor K. Prasanna

2015 IEEE International Conference on Information Reuse and Integration > 534 - 541

2015 IEEE International Conference on Information Reuse and Integration (IRI)

The computational cost of querying for similar entities across ontologies is high since, in the worst case, every pair of entities will have to be considered. Therefore, links discovered during ontology alignment have been used to speed up querying across ontologies by following relatedness links to discover similar entities. We derive the computational complexity of querying across ontologies using...

rozdział

Uncertainty Nonlinear Systems Modeling with Fuzzy Equations

Raheleh Jafari, Wen Yu

2015 IEEE International Conference on Information Reuse and Integration > 182 - 188

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Many uncertain nonlinear systems can be modeled by linear-in-parameter models. The uncertainties can be regarded as parameter changes, which can be described as fuzzy numbers. These models are fuzzy equations. They are alternative models for uncertain nonlinear systems. The modeling of the uncertain nonlinear systems is to find the coefficients of the fuzzy equation. Since the coefficients are in...

rozdział

DCCSOA: A Dynamic Cloud Computing Service-Oriented Architecture

Mehdi Bahrami. Mukesh Singhal

2015 IEEE International Conference on Information Reuse and Integration > 158 - 165

2015 IEEE International Conference on Information Reuse and Integration (IRI)

The emerging field of Cloud Computing provides several advantages over traditional in-house IT services, such as accessing to elastic on-demand computing and storage over the Internet, and cost effective pay-per-use subscription plans. However, according to the International Data Corporation (IDC), cloud computing has several issues, such as a lack of standardization, a lack of customization, and...

rozdział

Negative-Based Sampling for Multimedia Retrieval

Hsin-Yu Ha, Shu-Ching Chen, Mei-Ling Shyu

2015 IEEE International Conference on Information Reuse and Integration > 64 - 71

2015 IEEE International Conference on Information Reuse and Integration (IRI)

Nowadays, in such a high-tech living lifestyle, profusion of multimedia data are produced and propagated around the world. To identify meaningful semantic concepts from the large amount of data, one of the major challenges is called the data imbalance problem. Data imbalance occurs when the number of positive instances (i.e., instances which contain the target concept) is greatly less than the number...

rozdział

State Estimation of a Distribution System Using WLS and EKF Techniques

Faridoon Shabani, Masoumeh Seyedyazdi, Mohanmad Vaziri, Mahyar Zarghami, więcej

2015 IEEE International Conference on Information Reuse and Integration > 609 - 613

2015 IEEE International Conference on Information Reuse and Integration (IRI)

State estimation of a discernible active distribution network is researched. A system is discernible if the state of the network can be completely ascertained. An active distribution network is one that includes Distributed Generation (DG) units. A modified IEEE34-node test feeder including two DG units is analyzed. The forward-backward sweep technique provides the state estimation measurement data...

INFONA - portal komunikacji naukowej

2015 IEEE International Conference on Information Reuse and Integration (IRI)