The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
User reported experiences and opinions are used by peers to make decisions about where to go and what to buy. Unfortunately, not all users or opinions are honest. Many opinions are fabricated and may be submitted by automated systems or by people who are recruited by businesses and search engine optimizers to write good reviews. Such reviews and ratings are called spam reviews. These are misleading...
This paper presents iNNE (isolation using Nearest Neighbour Ensemble), an efficient nearest neighbour-based anomaly detection method by isolation. Inne runs significantly faster than existing nearest neighbour-based methods such as Local Outlier Factor, especially in data sets having thousands of dimensions or millions of instances. This is because the proposed method has linear time complexity and...
Labeled data, in real world, is quite scarce compared with unlabeled data. Manual annotation is usually expensive and inefficient. Active learning paradigm is used to handle this problem by identifying the most informative instances to annotate. In this paper, we proposed a new active learning algorithm based on nonparallel support vector machine. Numeric experiment shows the effective performance...
Since link prediction helps improve our understandings about the structure, functions, and evolution of networks, it has drawn much attention from both computer science and physical communities. Among many mainstream proposed algorithms, the common-neighbor based ones show prominent efficiency but neglect the influence of community structure. Based on the assumption that in the same communities common...
A land cover map that represents the land surface of the earth is based primarily on analysis of remotely sensed images. However, the rate of concordance of existing land cover maps is not high. This lack of concordance results from a difference in classification methods and observation conditions of remotely sensed images. Also, conducting field surveys around the world is unrealistic. Therefore,...
We explore the feasibility of measuring learner engagement and classifying the engagement level based on machine learning applied on data from 2D/3D camera sensors and eye trackers in a 1:1 learning setting. Our results are based on nine pilot sessions held in a local high school where we recorded features related to student engagement while consuming educational content. We label the collected data...
Recent surveys show that there is enormous increase of organizations intending to adopt cloud, but one of their major obstructions is the trustworthiness evaluation of cloud service candidates. Performing evaluations of cloud service candidates is expensive and time consuming, especially with the breadth of services available today. In this situation, this paper proposes a novel trustworthiness measurement...
A novel eye detection method based on template matching is proposed for glasses-free 3D device. Before matching, get the average eye template through a great quantity of eye images, splice several average templates into a chessboard template. Then locate the position of eyes by calculating the correlation coefficient between template and the candidate image. It has been testified that these algorithm...
In wireless localization problems, prior to the implementation of the sensor networks, it is important and valuable to know that, given the localization accuracy constraints, i.e. To ensure the localization error lower than e m at the confidence level of 1-c, then (1) how many location-known sensors (anchors) needed at least and at most? (2) how to select out optimal locations for these anchors from...
The paper presents Echo State Network (ESN) as classifier to diagnose the abnormalities in mammogram images. Abnormalities in mammograms can be of different types. An efficient system which can handle these abnormalities and draw correct diagnosis is vital. We experimented with wavelet and Local Energy based Shape Histogram (LESH) features combined with Echo State Network classifier. The suggested...
Effective machine-learning handles large datasets efficiently. One key feature of handling large data is the use of databases such as MySQL. The freeware fuzzy decision tree induction tool, FDT, is a scalable supervised-classification software tool implementing fuzzy decision trees. It is based on an optimized fuzzy ID3 (FID3) algorithm. FDT 2.0 improves upon FDT 1.0 by bridging the gap between data...
In this preliminary research we examine the suitability of hierarchical strategies of multi-class support vector machines for classification of induced pluripotent stem cell (iPSC) colony images. The iPSC technology gives incredible possibilities for safe and patient specific drug therapy without any ethical problems. However, growing of iPSCs is a sensitive process and abnormalities may occur during...
In order to utilize identification to the best extent, we need robust and fast algorithms and systems to process the data. Having palmprint as a reliable and unique characteristic of every person, we extract and use its features based on its geometry, lines and angles. There are countless ways to define measures for the recognition task. To analyze a new point of view, we extracted textural features...
In this paper, bootstrap percolation is introduced to control the information propagation for efficient cooperative positioning in wireless networks. Particularly, we obtain a novel linear least square (LLS) estimator for the localization of agent nodes. Exploiting the idea of bootstrap percolation, agent nodes sequentially get activated and estimate their positions with an adaptive location updating...
Triple negative breast cancers (TNBC) are clinically heterogeneous, an aggressive subtype with poor diagnosis and strong resistance to therapy. There is a need to identify novel robust biomarkers with high specificity for early detection and therapeutic intervention. Microarray gene expression-based studies have offered significant advances in molecular classification and identification of diagnostic/prognostic...
Many governments and institutions have guidelines for health-enhancing physical activity. Additionally, according to recent studies, the amount of time spent on sitting is a highly important determinant of health and wellbeing. In fact, sedentary lifestyle can lead to many diseases and, what is more, it is even found to be associated with increased mortality.
As one tool for structuring a massive volume of archived news videos based on their semantic contents, this paper proposes a method to detect scene duplicates from news videos. A scene duplicate is a pair of video segments taken at the same event from different viewpoints. Referring to the audio channel is effective to detect scene duplicates regardless of viewpoints, but it cannot be relied on when...
This paper presents an effective method for improving the accuracy of sleep detection using actigraphic algorithm. We have collected 2 sets of comparative data, wrist movement acceleration and brain wave analysis. In the first mode, we use the acceleration on x-axis as the input of actigraphic algorithm, in imitation of uni-axial accelerometer used in former study. In the second mode, we calculate...
Many real-world applications involve multi-label data streams, so effective concept drift detection methods should be able to consider the unique properties of multi-label stream data, such as label dependence. To deal with these challenges, we proposed an efficient and effective method to detect concept drift based on label grouping and entropy for multi-label data. Two methods are proposed to group...
Incomplete data clustering plays an important role in the big data analysis and processing. Existing algorithms for clustering incomplete high-dimensional big data have low performances in both efficiency and effectiveness. The paper proposes an incomplete high-dimensional big data clustering algorithm based on feature selection and partial distance strategy. First, a hierarchical clustering-based...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.