The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Missing values becomes one of the problems that frequently occur in the data observation or data recording process. The needs of data completeness of the observation data for the uses of advanced analysis becomes important to be solved. Conventional method such as mean and mode imputation, deletion, and other methods are not good enough to handle missing values as those method can caused bias to the...
Open Source Software (OSS) is distributed and maintained collaboratively by developers all over the world. However, frequent personnel turnover and lack of organizational management makes it difficult to capture the actual development effort. Various OSS maintenance effort estimation approaches have been developed to provide a way to understand and estimate development effort. The goal of this study...
Identifying and detecting the unknown abnormal sparse signal has become an important issue for distributed networks. In this paper, we proposed a new detection scheme based on convex optimization for wireless sensor networks. Under the Neyman-Pearson testing framework, the detection scheme first estimates the unknown signal by employing the convex optimization at the fusion center. Then the sensor...
Constraint-based sequential pattern mining algorithms discover sequential patterns among from sequence data and the resultant sequential patterns satisfy a given constraint. For time stamped sequences duration and/or gap constraints can be applied to obtain corresponding constraint-based sequential patterns. One of the shortcomings of existing algorithms is the requirement to pre-specify a time window...
The research reported in this paper is related to an assistive technology that may be used by healthcare services for elderly and disabled people. A system for non-intrusive monitoring of the movements of such persons in their home environment, based on impulse-radar sensors, is adressed. A large family of new procedures for the estimation of position and walking velocity of a monitored person, on...
For multi-sensor data fusion applications the accurate alignment of different sensor data is essential for the proper combination of matching features. In food inspection system the boxing often is in a rectangular shape. This knowledge can be used to rectify the image data, an important step in the alignment stage. In case of low contrast between boxing and background, the detected contour may differ...
Determining similar temporal patterns and unearthing eccentric patterns require an efficient similarity measure and approach for association patterns support estimation. This research addresses the similarity measure for revealing similar temporal patterns using a similarity measure and approach for estimating support bounds of temporal patterns. A case study is demonstrated to show working of the...
In this paper we propose a new machine learning concept called randomized machine learning, in which model parameters are assumed random and data are assumed to contain random errors. Distinction of this approach from “classical” machine learning is that optimal estimation deals with the probability density functions of random parameters and the “worst” probability density of random data errors. As...
Anomaly detection is a hot research field in the area of machine learning and data mining. The current outlier mining approaches which are based on the distance or the nearest neighbor are resulted in too long operation time results when using for the high-dimensional and massive data. Many improvements have been proposed to improve the results of the algorithms, but not yet satisfy the demand of...
Changes in the left ventricle function produce alternans in the hemodynamic and electric behavior of the cardiovascular system. A total of 49 cardiomyopathy patients have been studied based on the blood pressure signal (BP), and were classified according to the left ventricular ejection fraction (LVEF) in low risk (LR: LVEF>35%, 17 patients) and high risk (HR: LVEF<35, 32 patients) groups. We...
Anomaly detection in bipartite graph is of great use in many real applications and therefore it attracts numerous research efforts. This work formulates the supervision on abnormal activities in vehicle inspection stations as an anomaly detection problem in weighted bipartite graph. Relevance scores and normality scores are computed for registration districts and inspection stations. The suspicion...
An important problem that remains in online data mining systems is how to accurately and efficiently detect changes in the underlying distribution of large data streams. The challenge for change detection methods is to maximise the accumulative effect of changing regions with unknown distribution, while at the same time providing sufficient information to describe the nature of the changes. In this...
Current research in the field of Wireless Acoustic Sensor Networks (WASN) is gradually introducing the use of sound spatial techniques in the field of binaural hearing aids, in which sound environment information must be extracted in order to tune up the main hearing aid algorithms. In binaural hearing aids, computational capability, memory and data transmission are strictly constrained, which makes...
Land damage resulting from mining activities, especially precious farmland, is an increasingly hot topic, and damaged land should be properly reused to meet human demand through land reclamation. Accurate estimation of land damage degree is of great importance in the reuse planning of damaged land resources. Generally, it is inaccurately analysis without considering original terrain factors. This...
Two extraction models applied in estimating forest height and above-ground biomass (AGB) were developed using the X-band Interferometric Synthetic Aperture Radar (InSAR) data, which was acquired from the China airborne SAR system in 2013 covering part of forested area in northeast China. The models using multi-passes InSAR data for the estimations of forest height and AGB were introduced respectively...
Mining spatial co-location pattern is one of the most important researches in the field of spatial data mining. In the past researches, many spatial co-location pattern mining algorithms and the expansions about these algorithms have been proposed. However, some of these methods often produce a large number of patterns which are difficult to use. If we want to use the subset of the prevalent co-location...
This paper presents a symbolic dynamic method for real-time estimation of battery state-of-charge (SOC). In the proposed method, symbol strings are generated by partitioning (finite-length) time windows of synchronized input-output (e.g., current-voltage) pairs in the respective two-dimensional space. Then, a special class of probabilistic finite state automata (PFSA), called D-Markov machine, is...
In this report, we discuss about a data mining method for multi-modal in-vehicle sensor signals. Though several methods were previously proposed for symbolization of the multi-modal in-vehicle signals, the effectiveness of such method were not discussed in detail. We adopt the Bag-of-System (BoS) technique, a variation of the Bag-of-Features for sensor signals used in motion analysis, and evaluate...
Estimation process for baseline of Serum Creatinine (SCr) is constructed for Acute Kidney Injury (AKI) where baseline is necessary for definition. In order to deal with missing value, the estimation process calculates the baseline values for different stable interval, which the definition process select the appropriate interval based on the number of days from the target test day. The estimation process...
Given a common dataset, two methods operating on that dataset and reported equal-error rate (EER) for each method, then we can estimate whether the two methods differ significantly at the threshold leading to the EER. This enables the calculation of a boundary on the significance for methods where the significance was not reported in the original paper or to compare new methods to older ones by evaluating...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.