Search results

Items from 141 to 160 out of 33,675 results

1 ...
5
6
7
8
9
10
11

chapter

The Significant Effects of Data Sampling Approaches on Software Defect Prioritization and Classification

Kwabena Ebo Bennin, Jacky Keung, Akito Monden, Passakorn Phannachitta, more

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) > 364 - 373

2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)

Context: Recent studies have shown that performance of defect prediction models can be affected when data sampling approaches are applied to imbalanced training data for building defect prediction models. However, the magnitude (degree and power) of the effect of these sampling methods on the classification and prioritization performances of defect prediction models is still unknown. Goal: To investigate...

chapter

Research on the Application of Data Mining Technology in Campus Card System

Dan Su, Xiaoxi Liu, Tongjun Jiang, Zhuoyue Li

2017 International Conference on Smart City and Systems Engineering (ICSCSE) > 199 - 201

2017 International Conference on Smart City and Systems Engineering (ICSCSE)

Campus card system in generated a lot of data during it's operation, and the system itself cannot analyze these data. How it can be learned from these massive, outdated data for student management to assist decision-making becomes a very realistic subject. This paper takes the transaction data of campus card as the research object, and uses the comprehensive application of data warehouse, online analysis...

chapter

Analysis of clustering algorithms in biological networks

Asuda Sharma, Hesham H. Ali

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2303 - 2305

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Biological data is often represented as networks, as in the case of protein-protein interactions and metabolic pathways. Modeling, analyzing, and visualizing networks can help make sense of large volumes of data generated by high-throughput experiments. However, due to their size and complex structure, biological networks can be difficult to interpret without further processing. Cluster analysis is...

chapter

AProvBio: An architecture for data provenance in bioinformatics workflows using graph database

Rodrigo Almeida, Waldeyr da Silva, Klayton Castro, Maria Emilia Walter, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2139 - 2144

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Many scientific experiments in Bioinformatics are executed as computational workflows. Frequently, it is necessary to re-run an experiment under the original circumstances in which it was run to recognize and validate it. Data provenance concerns the origin of data. Knowing the data source facilitates the understanding and analysis of the results, by detailing and documenting the history and the paths...

chapter

Temporal reflected logistic regression for probabilistic heart failure survival score prediction

Mingjie Qian, Jyotishman Pathak, Naveen L. Pereira, Chengxiang Zhai

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 410 - 416

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Heart failure (HF) has a highly variable annual mortality rate and there is an urgent need of determining patient prognosis to enable informed decision-making about heart failure treatment strategies. Existing survival risk prediction models either require features that limit their applicability or pose difficulties for parameter estimation as physicians have to use a limited set of variables with...

chapter

Data provenance management for bioinformatics workflows using NoSQL database systems in a cloud computing environment

Fernanda Hondo, Polyane Wercelens, Waldeyr da Silva, Klayton Castro, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1929 - 1934

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Computer science solutions for molecular biology problems are often presented in the form of workflows. There is a set of activities performed by different processing entities through managed tasks. Knowledge about the data trajectory throughout a given workflow enables reproducibility by data provenance. In order to reproduce an in silico bioinformatics experiment one must consider other aspects...

chapter

Co-Training for Demographic Classification Using Deep Learning from Label Proportions

Ehsan Mohammady Ardehaly, Aron Culotta

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 1017 - 1024

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Deep learning algorithms have recently produced state-of-the-art accuracy in many classification tasks, but this success is typically dependent on access to many annotated training examples. For domains without such data, an attractive alternative is to train models with light, or distant supervision. In this paper, we introduce a deep neural network for the Learning from Label Proportion (LLP) setting,...

chapter

Discovering Cooperative Structure Among Online Items for Attention Dynamics

Kanji Matsutani, Masahito Kumano, Masahiro Kimura, Kazumi Saito, more

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 1033 - 1041

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Social Media allows people to post widely and share the posted online-items. Such items gain their popularity by the amount of attention received. Thus, studies on modeling the arrival process of attention to an individual item have recently attracted a great deal of interest. In this paper, we propose, by combining a Dirichlet process with a Hawkes process in a novel way, a probabilistic model, called...

chapter

Ranking from Crowdsourced Pairwise Comparisons via Smoothed Matrix Manifold Optimization

Jialin Dong, Kai Yang, Yuanming Shi

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 949 - 956

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

As the blooming development of data mining in social computing systems (e.g., crowdsourcing system), statistical inference from crowdsourced data severs as a powerful tool to provide diversified services. To support critical applications (e.g., recommendation), in this paper, we shall focus on the collaborative ranking problems and construct a system of which the input is crowdsourced pairwise comparisons...

chapter

Improving Multivariate Time Series Forecasting with Random Walks with Restarts on Causality Graphs

Piotr Przymus, Youssef Hmamouche, Alain Casali, Lotfi Lakhal

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 924 - 931

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Forecasting models that utilize multiple predictors are gaining popularity in a variety of fields. In some cases they allow constructing more precise forecasting models, leveraging the predictive potential of many variables. Unfortunately, in practice we do not know which observed predictors have a direct impact on the target variable. Moreover, adding unrelated variables may diminish the quality...

chapter

The design and implementation of the elderly healthcare information mining platform

Rongzhen Yan, Chunshan Li, Dianhui Chu

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1501 - 1506

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Global aging brings new challenges to elderly healthdata management. Existing systems such as HIS and CIS focus on the storage and management of information, the key limitations are that they lack effective mining approaches and usually cannot handle the large-scale health data, these drawbacks make them very hard to be a robust and light-weight system. In this paper, we develop a memory computing...

chapter

A Practically Competitive and Provably Consistent Algorithm for Uplift Modeling

Yan Zhao, Xiao Fang, David Simchi-Levi

2017 IEEE International Conference on Data Mining (ICDM) > 1171 - 1176

2017 IEEE International Conference on Data Mining (ICDM)

Randomized experiments have been critical tools of decision making for decades. However, subjects can show significant heterogeneity in response to treatments in many important applications. Therefore it is not enough to simply know which treatment is optimal for the entire population. What we need is a model that correctly customize treatment assignment base on subject characteristics. The problem...

chapter

Extracting User-Reported Mobile Application Defects from Online Reviews

Yue Wang, Hongning Wang, Hui Fang

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 422 - 429

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

User-generated mobile application reviews have become a gold mine for timely identifying functional defects in this type of software artifacts. In this work, we develop a hidden structural SVM model for extracting detailed defect descriptions from user reviews at the sentence level. Structured features and constraints are introduced to reduce the demand of exhaustive manual annotation at the sentence...

chapter

Data integration through ontology-based data access to support integrative data analysis: A case study of cancer survival

Hansi Zhang, Yi Guo, Qian Li, Thomas J. George, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1300 - 1303

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

To improve cancer survival rates and prognosis, one of the first steps is to improve our understanding of contributory factors associated with cancer survival. Prior research has suggested that cancer survival is influenced by multiple factors from multiple levels. Most of existing analyses of cancer survival used data from a single source. Nevertheless, there are key challenges in integrating variables...

chapter

MDL for Causal Inference on Discrete Data

Kailash Budhathoki, Jilles Vreeken

2017 IEEE International Conference on Data Mining (ICDM) > 751 - 756

2017 IEEE International Conference on Data Mining (ICDM)

The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as the direction with the lowest Kolmogorov complexity. This notion is very powerful as it can detect any causal dependency that can be explained by a physical process. However, due to the halting problem, it is also not computable. In this paper we propose an computable...

chapter

Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions

Ehsan Mohammady Ardehaly, Aron Culotta

2017 IEEE International Conference on Data Mining (ICDM) > 733 - 738

2017 IEEE International Conference on Data Mining (ICDM)

Opinion mining and demographic attribute inference have many applications in social science. In this paper, we propose models to infer daily joint probabilities of multiple latent attributes from Twitter data, such as political sentiment and demographic attributes. Since it is costly and time-consuming to annotate data for traditional supervised classification, we instead propose scalable Learning...

chapter

Reconstruction of high read-depth signals from low-depth whole genome sequencing data using deep learning

Yao-zhong Zhang, Seiya Imoto, Satoru Miyano, Rui Yamaguchi

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1227 - 1232

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Motivation: Next-generation sequencing (NGS) technologies using DNA, RNA, or methylation sequencing are prevailing tools used in modern genome research. For DNA sequencing, whole genome sequencing (WGS) and whole exome sequencing (WES) are two typical applications with a different preference on the trade-off between sequencing depth and base coverage. Although sequencing costs have been greatly reduced,...

chapter

Online and Distributed Robust Regressions Under Adversarial Data Corruption

Xuchao Zhang, Liang Zhao, Arnold P. Boedihardjo, Chang-Tien Lu

2017 IEEE International Conference on Data Mining (ICDM) > 625 - 634

2017 IEEE International Conference on Data Mining (ICDM)

In today's era of big data, robust least-squares regression becomes a more challenging problem when considering the adversarial corruption along with explosive growth of datasets. Traditional robust methods can handle the noise but suffer from several challenges when applied in huge dataset including 1) computational infeasibility of handling an entire dataset at once, 2) existence of heterogeneously...

chapter

Differentially Private Mixture of Generative Neural Networks

Gergely Acs, Luca Melis, Claude Castelluccia, Emiliano De Cristofaro

2017 IEEE International Conference on Data Mining (ICDM) > 715 - 720

2017 IEEE International Conference on Data Mining (ICDM)

Generative models are used in an increasing number of applications that rely on large amounts of contextually rich information about individuals. Owing to possible privacy violations, however, publishing or sharing generative models is not always viable. In this paper, we introduce a novel solution for privately releasing generative models and entire high-dimensional datasets produced by these models...

chapter

Telling Cause from Effect Using MDL-Based Local and Global Regression

Alexander Marx, Jilles Vreeken

2017 IEEE International Conference on Data Mining (ICDM) > 307 - 316

2017 IEEE International Conference on Data Mining (ICDM)

We consider the fundamental problem of inferring the causal direction between two univariate numeric random variables X and Y from observational data. The two-variable case is especially difficult to solve since it is not possible to use standard conditional independence tests between the variables. To tackle this problem, we follow an information theoretic approach based on Kolmogorov complexity...

1 ...
5
6
7
8
9
10
11

Data set:
ieee
Keywords:
DATA MODELS

Publication date

Set your own date range

Content availability

Available (33 376)
None (299)

Publication language

English (33 674)
Undetermined (1)

Keywords

COMPUTATIONAL MODELING (6 973)
MATHEMATICAL MODEL (6 151)
ANALYTICAL MODELS (4 793)
PREDICTIVE MODELS (4 732)
DATA MINING (4 304)
BIOLOGICAL SYSTEM MODELING (2 876)
TRAINING (2 847)
DATABASES (1 903)
SOFTWARE (1 591)
ACCURACY (1 566)
SOLID MODELING (1 499)
ESTIMATION (1 445)
ARTIFICIAL NEURAL NETWORKS (1 416)
ALGORITHM DESIGN AND ANALYSIS (1 353)
MONITORING (1 342)
LOAD MODELING (1 338)
OPTIMIZATION (1 328)
FEATURE EXTRACTION (1 169)
ADAPTATION MODELS (1 164)
SERVERS (1 163)
CORRELATION (1 135)
INDEXES (1 120)
SUPPORT VECTOR MACHINES (1 106)
ATMOSPHERIC MODELING (1 064)
BUSINESS (1 034)
EDUCATIONAL INSTITUTIONS (974)
SEMANTICS (959)
EQUATIONS (938)
COMPUTER ARCHITECTURE (935)
TESTING (924)
HIDDEN MARKOV MODELS (906)
INTERNET (903)
UNIFIED MODELING LANGUAGE (894)
OBJECT ORIENTED MODELING (892)
CLUSTERING ALGORITHMS (874)
SIMULATION (859)
TIME SERIES ANALYSIS (851)
STANDARDS (850)
ADAPTATION MODEL (849)
CONTEXT (848)
DISTRIBUTED DATABASES (816)
FORECASTING (811)
NUMERICAL MODELS (791)
PROTOCOLS (767)
XML (764)
TRAINING DATA (756)
CLOUD COMPUTING (752)
NOISE (734)
KERNEL (724)
SENSORS (723)
VEHICLES (722)
BIG DATA (713)
BUILDINGS (708)
SECURITY (704)
DECISION MAKING (684)
INTEGRATED CIRCUIT MODELING (684)
RELIABILITY (679)
ONTOLOGIES (677)
MACHINE LEARNING (670)
DATA VISUALIZATION (663)
CLASSIFICATION ALGORITHMS (660)
DATA ANALYSIS (645)
MOBILE COMMUNICATION (637)
WIRELESS SENSOR NETWORKS (630)
COMPUTERS (625)
GEOGRAPHIC INFORMATION SYSTEMS (602)
VISUALIZATION (600)
VECTORS (589)
COMPANIES (580)
REGRESSION ANALYSIS (575)
ROADS (566)
UNCERTAINTY (564)
BAYES METHODS (556)
PROCESS CONTROL (549)
ROBUSTNESS (534)
MEASUREMENT (533)
LEARNING (ARTIFICIAL INTELLIGENCE) (530)
NEURAL NETS (530)
PREDICTION ALGORITHMS (529)
CONFERENCES (518)
ORGANIZATIONS (516)
ENGINES (515)
WEB SERVICES (515)
REAL-TIME SYSTEMS (508)
PRODUCTION (507)
PROBABILITY DENSITY FUNCTION (504)
STATISTICAL ANALYSIS (504)
COMPLEXITY THEORY (500)
ELECTRONIC MAIL (499)
THREE DIMENSIONAL DISPLAYS (499)
REMOTE SENSING (496)
NEURAL NETWORKS (492)
REAL TIME SYSTEMS (485)
SHAPE (481)
METEOROLOGY (467)
MARKOV PROCESSES (466)
COLLABORATION (457)
STOCHASTIC PROCESSES (455)
DATA STRUCTURES (448)
more

INFONA - science communication portal

Search results

The Significant Effects of Data Sampling Approaches on Software Defect Prioritization and Classification

Research on the Application of Data Mining Technology in Campus Card System

Analysis of clustering algorithms in biological networks

AProvBio: An architecture for data provenance in bioinformatics workflows using graph database

Temporal reflected logistic regression for probabilistic heart failure survival score prediction

Data provenance management for bioinformatics workflows using NoSQL database systems in a cloud computing environment

Co-Training for Demographic Classification Using Deep Learning from Label Proportions

Discovering Cooperative Structure Among Online Items for Attention Dynamics

Ranking from Crowdsourced Pairwise Comparisons via Smoothed Matrix Manifold Optimization

Improving Multivariate Time Series Forecasting with Random Walks with Restarts on Causality Graphs

The design and implementation of the elderly healthcare information mining platform

A Practically Competitive and Provably Consistent Algorithm for Uplift Modeling

Extracting User-Reported Mobile Application Defects from Online Reviews

Data integration through ontology-based data access to support integrative data analysis: A case study of cancer survival

MDL for Causal Inference on Discrete Data

Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions

Reconstruction of high read-depth signals from low-depth whole genome sequencing data using deep learning

Online and Distributed Robust Regressions Under Adversarial Data Corruption

Differentially Private Mixture of Generative Neural Networks

Telling Cause from Effect Using MDL-Based Local and Global Regression

Filter options

Publication date

Content availability

Publication language

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication language

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options