The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Due to the explosive growth of information sources available on the World Wide Web, it has become increasingly necessary for users to utilize automated tools and design in finding the desired information resources. Web Mining can be broadly defined as extraction of interesting and potentially useful patterns and implicit information from artifacts or activity related to the World Wide Web. In this...
Signal-to-Noise Ratio (SNR) and t-statistics are widely used for gene ranking in the analysis of microarray gene expression data. By implementing these filtering techniques directly to the microarray data may give redundant features, as we may have redundant expression values of number of genes in the data set. By grouping the genes bearing similar expression values in a single cluster and then implementing...
Search engines generally return a large number of pages in response to user queries. To assist the users to navigate in the result list, ranking methods are applied on the search results. Most of the ranking algorithms proposed in the literature are either link or content oriented, which do not consider user usage trends. In this paper, a page ranking mechanism called Page Ranking based on Visits...
The FP-growth algorithm is currently one of the fastest approaches to frequent item set mining. Fuzzy logic provides a mathematical framework where the entire range of the data lies in between 0 and 1. The PSO algorithm was developed from observations of the social behavior of animals, including bird flocking and fish schooling. It is easier to implement than evolutionary algorithms because it only...
Cluster detection in Spatial Databases is an important task for discovery of knowledge in spatial databases and in this domain density based clustering algorithms are very effective. Density Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm effectively manages to detect clusters of arbitrary shape with noise, but it fails in detecting local clusters as well as clusters of different...
Information drives today's businesses and the internet is a powerhouse of information. So data integration gives the user with a unified view of all heterogeneous data sources. The basic service provided by data integration is query processing. But if we are considering a query that involves multiple domains, then we find that general purpose search engines fail to answer such queries and domain specific...
In this paper we are suggesting improvements over an existing C4.5 Algorithm. This is a very popular tree based classification algorithm, used to generate decision tree from a set of training examples. The heuristic function used in this algorithm is based on the concept of information entropy. We are proposing two new heuristic functions which are better than the one used by C4.5 Algorithm by some...
Present days humans are associated with many electronic gadgets which generate large amount of data on regular basis. The sole purpose of generated data was to meet the immediate needs and no attempt in organizing the data for later efficient retrieval was attempted. Over the period of time, the data generated became voluminous, this paper attempts to classify the huge data into different categories...
Many real world data sets have an imbalanced distribution of the instances. Learning from such data sets results in the classifier being biased towards the majority class, thereby tending to misclassify the minority class samples. In this paper, we provide a technique, SkewBoost which classifies the minority instances correctly without compromising much on the correct classification of the majority...
This paper presents an effective clustering method which can detect embedded and nested clusters over variable density space. The proposed method, VDSC uses a density based approach for detecting clusters of arbitrary shapes, sizes and densities. VDSC was compared with several other comparable algorithms and the experimental results show that our method could detect all clusters effectively.
A fuzzy logic based approach for the design of optimal automatic generation controllers of three area interconnected power system is proposed in this paper. A three area interconnected power system model consisting of identical power plants with reheat thermal turbines is considered as a test system. The HVDC link in parallel with EHV AC transmission line is incorporated as an area interconnection...
In this paper, we presents a new approach to document image retrieval based on signature. The database contains document images with English text combined with headlines, ruling lines, logo, trade mark and signature. In searching a repository of business documents, task of interest is that of using a query signature image to retrieve from a database. The signature retrieval task involves a two step...
Due to the semantic gap between low-level image features and high level concepts, content-Based image retrieval (CBIR) systems are incapable to provide the effective results to the user. To address this problem, we have presented a framework for effective image retrieval by proposing a novel idea of cumulative learning using Support Vector Machines (SVM). It creates a knowledge base model to increase...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.