Search results

chapter

Findings from GitHub: Methods, Datasets and Limitations

Valerio Cosentino, Javier Luis Canovas Izquierdo, Jordi Cabot

2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR) > 137 - 141

2016 IEEE/ACM 13th Conference on Mining Software Repositories (MSR)

GitHub, one of the most popular social coding platforms, is the platform of reference when mining Open Source repositories to learn from past experiences. In the last years, a number of research papers have been published reporting findings based on data mined from GitHub. As the community continues to deepen in its understanding of software engineering thanks to the analysis performed on this platform,...

chapter

How Developers Use Exception Handling in Java?

Muhammad Asaduzzaman, Muhammad Ahasanuzzaman, Chanchal K. Roy, Kevin A. Schneider

2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR) > 516 - 519

2016 IEEE/ACM 13th Conference on Mining Software Repositories (MSR)

Exception handling is a technique that addresses exceptional conditions in applications, allowing the normal flow of execution to continue in the event of an exception and/or to report on such events. Although exception handling techniques, features and bad coding practices have been discussed both in developer communities and in the literature, there is a marked lack of empirical evidence on how...

chapter

A self-contained steganography combining LSB substitution with MSB matching

Wenxu Jiang, Zhenbo Guo, Kaixi Wang, Yongfeng Huang

2016 5th International Conference on Computer Science and Network Technology (ICCSNT) > 635 - 640

2016 5th International Conference on Computer Science and Network Technology (ICCSNT)

LSB substitution steganography only takes the least significant bits in the carrier into account, which has the problems of low security and poor robustness. This paper proposes a self-contained steganography combining the MSB matching and LSB substitution. It contains two types of encoding rules to define the matching result between the secret information binary stream and the most two-significant-bit...

chapter

Video saliency from compressed domain coding length

Chunyang Liu, Zemin Wu, Zhaofeng Zhang, Qingzhu Jiang, more

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) > 217 - 221

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

In order to simulate this feature and detect the salient region rapidly, we propose the Spatial-Temporal Feature in Compress Domain (STFCD) model. By respectively using H.264 residual coding length and motion vector coding length, we simulate the salient stimulus intensity and then get video saliency features. Finally, we use the linear weighted fusion algorithm to get the final video saliency maps...

chapter

Multi-bit Data Hiding in Randomly Chosen LSB Layers of an Audio

Biswajita Datta, Prithwish Kumar Pal, Samir Kumar Bandyopadhyay

2016 International Conference on Information Technology (ICIT) > 283 - 287

2016 International Conference on Information Technology (ICIT)

LSB techniques generally embed data in the same LSB position of consecutive samples which helps intruders to extract secret information easily. This paper solve this problem by introducing a robust audio steganography technique where data is embedded in multiple layers of LSB chosen randomly and in non-consecutive samples. The choice of random LSB layers and non-consecutive pixels for embedding increases...

chapter

Regressor Basis Learning for anchored super-resolution

Eirikur Agustsson, Radu Timofte, Luc Van Gool

2016 23rd International Conference on Pattern Recognition (ICPR) > 3850 - 3855

2016 23rd International Conference on Pattern Recognition (ICPR)

A+ aka Adjusted Anchored Neighborhood Regression - is a state-of-the-art method for exemplar-based single image super-resolution with low time complexity at both train and test time. By robustly training a clustered regression model over a low-resolution dictionary, its performance keeps improving with the dictionary size - even when using tens of thousands of regressors. However, this can pose a...

chapter

Structure Selection for Convolutive Non-negative Matrix Factorization Using Normalized Maximum Likelihood Coding

Atsushi Suzuki, Kohei Miyaguchi, Kenji Yamanishi

2016 IEEE 16th International Conference on Data Mining (ICDM) > 1221 - 1226

2016 IEEE 16th International Conference on Data Mining (ICDM)

Convolutive non-negative matrix factorization (CNMF) is a promising method for extracting features from sequential multivariate data. Conventional algorithms for CNMF require that the structure, or the number of bases for expressing the data, be specified in advance. We are concerned with the issue of how we can select the best structure of CNMF from given data. We first introduce a framework of probabilistic...

chapter

Fast CU-splitting decisions based on data mining

Kang Li, Jian Wang

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China) > 1 - 5

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China)

The HEVC(H.265) has brought in significant improvements in terms of coding efficiency. However, the reduction in bitrates comes along with an increment in computational complexity. This paper presents a data mining approach to reduce the complexity of inter partition modes in HEVC. Determining the CU-splitting in inter partition modes requires substantial resources, so the goal of the work is to terminate...

chapter

AWarp: Fast Warping Distance for Sparse Time Series

Abdullah Mueen, Nikan Chavoshi, Noor Abu-El-Rub, Hossein Hamooni, more

2016 IEEE 16th International Conference on Data Mining (ICDM) > 350 - 359

2016 IEEE 16th International Conference on Data Mining (ICDM)

Dynamic Time Warping (DTW) distance has been effectively used in mining time series data in a multitude of domains. However, in its original formulation DTW is extremely inefficient in comparing long sparse time series, containing mostly zeros and some unevenly spaced non-zero observations. Original DTW distance does not take advantage of this sparsity, leading to redundant calculations and a prohibitively...

chapter

A big data platform integrating compressed linear algebra with columnar databases

Vishnu Gowda Harish, Vinay Kumar Bingi, John A. Miller

2016 IEEE International Conference on Big Data (Big Data) > 2344 - 2352

2016 IEEE International Conference on Big Data (Big Data)

Key foundational components of Big Data frameworks include efficient large-scale storage and high-performance linear algebra. This paper discusses efficient implementations that utilize compression techniques inspired by columnar relational databases for improving space and time profiles for vector and matrix operations. In addition, linear algebra operations are integrated with columnar relational...

chapter

A Friend Recommendation Algorithm Based on Trajectory Mining

Bolong Cui

2016 9th International Symposium on Computational Intelligence and Design (ISCID) > 2 > 338 - 341

2016 9th International Symposium on Computational Intelligence and Design (ISCID)

In recent years, with the popularity of the running and other sports software, friend recommendation algorithm based on trajectory is gradually becoming a hot research. In this paper, the θ-ADBSCAN algorithm is used to dig the hot trail and the resident points of user's trajectory, then the trajectory segmentation algorithm is described, and the trajectory is replaced by the MTR which is composed...

chapter

Relative Label Encoding for the Prediction of Airline Passenger Nationality

Alejandro Mottini, Rodrigo Acuna-Agost

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW) > 671 - 676

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

In the airline industry, a Passenger Name Record (PNR) stores the travel itinerary of an individual or group of passengers travelling together. A PNR always contains all the flight information regarding each segment of a journey, and may contain additional important information such as nationality, gender and age of the passengers. From a commercial point of view, these passenger attributes are of...

chapter

Scalable Block Scheduling for Efficient Multi-database Record Linkage

Thilina Ranbaduge, Dinusha Vatsalan, Peter Christen

2016 IEEE 16th International Conference on Data Mining (ICDM) > 1161 - 1166

2016 IEEE 16th International Conference on Data Mining (ICDM)

Record linkage (RL) is a task in data integration that aims to identify matching records that refer to the same entity from different databases. When records from more than two databases are to be linked RL is significantly challenged by the intrinsic exponential growth in the number of potential record comparisons to be conducted. We propose a scalable meta blocking protocol to be used for Multi-Database...

chapter

Web content extraction based on maximum continuous sum of text density

Kai Sun, Miao Li, Jinhua Du, Lei Chen, more

2016 International Conference on Asian Language Processing (IALP) > 288 - 292

2016 International Conference on Asian Language Processing (IALP)

Generally different websites have different web page structures, which would heavily affect the extraction quality when the web content is automatically collected. On the basis of a statistical analysis on content features and structure characteristics of News domain web pages, this paper proposes a maximum continuous sum of text density (MCSTD) method to efficiently and effectively extract web content...

chapter

Methods of generating key sequences based on keystroke dynamics

Pavel Lozhnikov, Ekaterina Buraya, Alexey Sulavko, Alexander Eremenko

2016 Dynamics of Systems, Mechanisms and Machines (Dynamics) > 1 - 5

2016 Dynamics of Systems, Mechanisms and Machines (Dynamics)

The paper presents several variations of fuzzy extractors to generate cryptographic keys and password based on parameters of keystroke dynamics. A series of simulation experiments was run to estimate the efficiency of these methods, the best parameters of fuzzy extractors were found. The best result was: FRR=0.061, FAR=0.023 with a key length 192 bits.

chapter

Itemset Mining with Penalties

Said Jabbour, Souhila Kaci, Lakhdar Sais, Yakoub Salhi

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI) > 962 - 966

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)

We introduce a preferences-based itemset mining framework. Preferences are encoded by a penalty function over the transactions in a database. We define an itemset mining problem where we associate to each transaction a penalty value. This problem consists in generating the frequent itemsets with a maximum penalty threshold. We then provide a propositional satisfiability based encoding. We extend the...

chapter

A SAT-Based Approach for Enumerating Interesting Patterns from Uncertain Data

Imen Ouled Dlala, Said Jabbour, Badran Raddaoui, Lakhdar Sais, more

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI) > 255 - 262

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)

Discovering useful patterns plays an essential role in data management and data mining. Frequent itemset mining in uncertain transaction databases semantically and computationally differs from traditional techniques applied on (standard) precise transaction databases. Uncertain transaction databases consist of sets of existentially uncertain items. The uncertainty of items in transactions makes traditional...

chapter

Adaptive run-length encoding circuit based on cascaded structure for target region data extraction of remote sensing image

Haoyang Li, Hong Zheng, Chuanzhao Han

2016 International Conference on Integrated Circuits and Microsystems (ICICM) > 20 - 27

2016 International Conference on Integrated Circuits and Microsystems (ICICM)

In order to reduce the pressure of data storage and transmission on satellite, researchers implemented a method of object region data extraction from remote sensing image in orbit. This method stores and downloads pixels of interesting region through interesting region labeling. But encoding data volume (EDV), hardware scale and real-time property (RTP) are difficult to be balanced. To solve this...

chapter

Using Grounded Theory Approach to Identify Value-Based Factors in Software Development

Noor Azura Zakaria, Suhaimi Ibrahim, Mohd Naz'ri Mahrin

2016 6th International Conference on Information and Communication Technology for The Muslim World (ICT4M) > 205 - 210

2016 6th International Conference on Information and Communication Technology for The Muslim World (ICT4M)

Grounded theory is an approach that can be used to analyse qualitative data. It is a systematic approach for data collection, handling and analysis. The objective of this paper is to present adapted grounded theory approach as data analysis strategy to identify value-based factors in software development. The grounded theory procedure started with data extraction and initial coding, memo writing and...

chapter

Web Behavior Analysis Using Sparse Non-Negative Matrix Factorization

Akihiro Demachi, Shin Matsushima, Kenji Yamanishi

2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 574 - 583

2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

We are concerned with the issue of discovering behavioral patterns on the web. When a large amount of web access logs are given, we are interested in how they are categorized and how they are related to activities in real life. In order to conduct that analysis, we develop a novel algorithm for sparse non-negative matrix factorization (SNMF), which can discover patterns of web behaviors. Although...

INFONA - science communication portal

Search results

Findings from GitHub: Methods, Datasets and Limitations

How Developers Use Exception Handling in Java?

A self-contained steganography combining LSB substitution with MSB matching

Video saliency from compressed domain coding length

Multi-bit Data Hiding in Randomly Chosen LSB Layers of an Audio

Regressor Basis Learning for anchored super-resolution

Structure Selection for Convolutive Non-negative Matrix Factorization Using Normalized Maximum Likelihood Coding

Fast CU-splitting decisions based on data mining

AWarp: Fast Warping Distance for Sparse Time Series

A big data platform integrating compressed linear algebra with columnar databases

A Friend Recommendation Algorithm Based on Trajectory Mining

Relative Label Encoding for the Prediction of Airline Passenger Nationality

Scalable Block Scheduling for Efficient Multi-database Record Linkage

Web content extraction based on maximum continuous sum of text density

Methods of generating key sequences based on keystroke dynamics

Itemset Mining with Penalties

A SAT-Based Approach for Enumerating Interesting Patterns from Uncertain Data

Adaptive run-length encoding circuit based on cascaded structure for target region data extraction of remote sensing image

Using Grounded Theory Approach to Identify Value-Based Factors in Software Development

Web Behavior Analysis Using Sparse Non-Negative Matrix Factorization

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options