The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Because of the worldwide aging population, more and more elders suffer from dementia. Nowadays, it is inconvenient and time-consuming for doctors to diagnose whether elders who live independently have dementia because lots of diagnostic questions on a checklist must be asked first, and part of them even require a long-term observation. In order to help doctors and make this diagnostic process easier,...
Traditional vehicular routing protocols cannot accurately foresee future location of each vehicle for efficient packet forwarding. Recently, the data mining approach has been applied to analyze huge vehicle trajectory data. In this paper, we propose a novel trajectory-based routing (NTR) protocol to improve the packet replication efficiency of vehicles in the Vehicular Delay Tolerant Network (VDTN)...
The construction of knowledge graph of dangerous goods (KGDG) is with great significance of inferring relative information of dangerous goods, developing corresponding policy for its storage and transport, preventing disaster caused by dangerous goods(DG), and providing emergency plan when the disaster happens. Since distributed representation of natural language is an effective method for knowledge...
The random Fourier Features method has been found very effective in approximating the kernel functions. Our former studies show that through a mixing mechanism of the feature space formed by random Fourier features and certain linear algorithms, the fuzzy clustering results in the approximated feature space are comparable to or even exceed the classical kernel-based algorithms. To increase the robustness...
A symptom is the physical indication of an unstable state or the beginning of diseases. Symptom analysis is an essential factor in the medical area, where it is used for disease diagnosis, drug prescription, and the development of new pharmaceuticals. Commensurate with its importance, symptom analysis has been the subject of various studies in recent years. However, prior literature on this topic...
The paper proposes a data mining method to find the relationships between fan page users' sentiment and customers' purchase behavior in order to predict those customers' purchase intention in the future. The business companies create their own fan pages and post advertisements to prompt their products. The fan page users always post their opinions on the wall to tell the feelings about products. Since...
How to reduce the computation time and how to improve the quality of the clustering result are the two major research issues. Although several efficient and effective clustering algorithms have been presented, none of which is perfect. As such, an effective clustering algorithm, which is based on the prediction of searching information to determine the search directions at later iterations and employs...
In recent years, many online health communities (OHCs) are established to provide the patients with the services of disease prevention and self-management. Patients in those online health communities discuss their health conditions and share their experiences with other patients using narrative texts in the posts. Those posts contain a vast amount of patients' information, including drugs, symptoms,...
Safety-critical systems in domains such as aviation, railway, and automotive are often subject to a formal process of safety certification. The goal of this process is to ensure that these systems will operate safely without posing risks to the user, the public, or the environment [1]. It is typically expensive and time consuming for companies to certify their software. Therefore, any attempt to automate...
Different from full periodic patterns, partial periodic patterns could ignore the occurrence of some events in time positions. In this paper, we have presented a gradually pruning algorithm (GPA) for reducing the number of candidate patterns in the mining process. It is based on the two-phased periodic utility upper-bound (PUUB) model and could avoid information loss. Compared to the original approach...
Real-time applications are usually well-defined and operate based on a particular system model. However, in practical scenarios, the applications can perform differently because of the uncertainties in the environment. The system can use video streams to capture sequential real-time information of its surroundings. The system also needs to identify various constraints that have significant effects...
QuantMiner, proposed by Salleb-Aouissi et al., is one of the well-known systems for mining quantitative association rules using a genetic algorithm (GA). We have applied the GA-based methods of QuantMiner to multi-relational data mining (MRDM), where mining rules involves multiple relations from a relational database, and our preliminary experiments showed that a straightforward application of QuantMiner...
Spectral clustering is one of the most effective methods of data mining, in which the adjacency matrix is constructed by using the similarity matrix. In this paper, to extend spectral clustering method for uncertain data clustering, we propose a new spectral clustering method based on JS-divergence. In the proposed method, the JS-divergence is used to construct the adjacency matrix in the spectral...
In biology, text-mining is widely used to extract relationships between biological entities. Gene prioritization is also important to analyze diseases, because mutated or dysregulated genes play an important role in pathogenesis. Here, we propose a method to identify disease-related genes using seed genes and network analysis. We constructed an integrating gene network for lung cancer by combining...
In this paper, we propose to determine whether the viewer's behavior changes or not before, during and after watching a TV program. Are there any behaviors specific to each particular phase of viewing? Here, we propose a flexible and nonintrusive method based on the use of three categories of everyday connected objects (i.e. Smartphone, smartwatch and remote control). Data were collected during participants'...
The unstructured data, which volume grows exponentially, often hide important and even vital information for society and companies. It takes a lot of work to extract information such as the nature of consumption in a category of individuals, trends, etc. When it comes to statistical data, it is often very useful to synthesize this kind of information in the form of graphical representations. In this...
The purpose of this study is to clarify the applicability of data-driven approach in accounting area. As the first stage, focusing on the model comparison, this paper shows the effectiveness of model selection with data mining technique for the development of earnings prediction model based on financial statement data. In accounting area, researchers have not considered the characteristic of financial...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.