The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper presents a novel technique of document clustering based on frequent concepts. The proposed FCDC (Frequent Concepts based Document Clustering), a clustering algorithm works with frequent concepts rather than frequent itemsets used in traditional text mining techniques. Many well known clustering algorithms deal with documents as bag of words while they ignore the important relationship between...
Recently, bag-of-visual-words has been paid attention to as an image retrieval approach that uses the defining features of images. However, k-means clustering generally used in bag-of-visual-words has a drawback such that its result is affected by setting up initial points and their number. Additionally, the more keypoints increase, the more expensive processing becomes. We resolve the problem of...
Ontology alignment is a time consuming process, especially when the two ontologies to be aligned are large. A fast and accurate ontology similarity can help the user to avoid aligning ontologies without significant similarities. In this paper, we propose an Asymmetric Similarity Measure for Ontologies (ASMO) that measures how similar the source ontology is to the target ontology. Many efficient ontology...
With the rapid development of the Internet and communication technology, huge data is accumulated. Short text such as conversation in chatting room and email is common in such data. It is useful to cluster such short documents to get the structure of the data or to help building other data mining applications. But most of the current clustering algorithms can not get acceptable clustering accuracy...
In SOA (service oriented architecture) and RTE (real-time enterprise) environment, an assurance of data quality is important. Because we do not assure data accuracy among dynamic clustering data set. Traditional methodology for assuring data quality is data profiling and data auditing. However, that is needed lots of time and cost to analysis of metadata and business process for integrating system...
Gene expression profiling plays an important role in a broad range of areas in biology. The raw gene expression data, may contain missing values. It is an important preprocessing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis. Numerous methods have been developed to deal with missing values. In this paper,...
On the deep Web, a significant amount of information can only be accessed through query interfaces, so an important step is the integration of these interfaces. In this paper, we aim to construct automatically a query interface that integrates a set of interfaces in the same domain and permit users to access information uniformly from multiple sources. The integration of query interfaces can be divided...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.