The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The following topics are dealt with: computational linguistics; data mining; data warehousing; bioinformatics; distributed computing; information security; ad hoc networks; information management; wireless sensor networks; and digital image processing.
In this work we investigate several issues in order to improve the performance of decision trees. Firstly, we introduced or adopt a new composite splitting criterion aimed to improve classification accuracy. Secondly, we derive a new pruning technique using expert knowledge, which is able to significantly reduce the size of tree without degrading the classification accuracy. Finally, we implemented...
This paper mainly deals with design of an algorithm in which the customer transitions are characterized by more than one closed Markov chain. Generating functions are implemented to derive closed form of solutions and product form solution with the parameters such as stability, normalizations constant and marginal distributions. For such a system with `N' servers and `L' chains, the solutions are...
The Indian Material Database (IMDB) is a national project aiming to develop a database through compilation of materials property data available in different laboratories in India. Selecting the appropriate data modeling technique is crucial for the successful deployment of such a system. Dimensional modeling is a logical design technique that seeks to present the data in a standard, intuitive framework...
This paper presents a detail review and implementation issue of fetal ECG extraction and enhancement. The focus is also made on proper placement of electrodes for fetal ECG monitoring in twins and multi-fetal prenatal. Various extraction methods like correlation, subtraction, matched filtering, linear regression and independent component analysis are discussed. For enhancement neural networks, fuzzy...
This paper mainly deals with the selection of call center by using FUZZY VIKOR and FUZZY TOPSIS. The selection of call center location is a complex multi criteria task which includes both quantitative and qualitative factors are in conflict and also may be uncertain. The objective of the call center is to select the most appropriate location among the alternatives. The article proposes a fuzzy Multiple...
Communication through web is becoming increasingly popular thanks to wireless and cellular networks. As this awareness spreads far and wide in different countries, significant complexities arise in terms of language and communication means for extracting information on the web. This is particularly true in India where more than fifteen officially recognized language texts and more variations in local...
As the Unified Modeling Language (UML) becomes an industrial standard for object-oriented software development, many system models have been specified in UML notation. A system can be described in terms of the functional view through the use case model, the static view through the class model, and the dynamic view through activity or sequence model. In particular, activity model has more to do with...
Due to the widespread computerization and affordable storage facilities, there exists enormous amount of information in databases belonging to different enterprises. The ultimate intent of this massive data collection is the utilization of this to achieve competitive benefits, by determining formerly unidentified patterns in data that can direct the process of decision making. Data mining, the core...
Data mining has been defined as "The nontrivial extraction of implicit, previously unknown, and potentially useful information from data". Clustering is the automated search for group of related observations in a data set. The K-Means method is one of the most commonly used clustering techniques for a variety of applications. This paper proposes a method for making the K-Means algorithm...
Data mining has become an important topic in effective analysis of gene expression data due to its wide application in the biomedical industry. Within a gene expression matrix there are usually several particular macroscopic phenotypes of samples. Selection of genes most relevant and informative for certain phenotypes is an important aspect in gene expression analysis. Currently most of the research...
Record matching is an essential step in duplicate detection as it identifies records representing same real-world entity. Supervised record matching methods require users to provide training data and therefore cannot be applied for web databases where query results are generated on-the-fly. To overcome the problem, a new record matching method named Unsupervised Duplicate Elimination (UDE) is proposed...
Association rule mining is one of the most important problems in image mining. We propose a new approach to apply association rule on weather forecasting image. We build an image system to store weather forecasting images and retrieve them later for further research, for example, to predict future temperature, relative humidity, rainfall, wind speed and atmospheric pressure. Image retrieval technology...
Frequent pattern mining has been an emerging and active field in data mining research. There is a huge collection of data from the hi-tech communication equipments associated with moving objects. To find the frequent patterns existing in this data and derive knowledge out of it is of great concern. Frequently occurring paths of flying object trajectories are data dependent and cannot be derived before...
Top-k processing in Uncertain Databases is semantically and computationally different from traditional top-k processing. The interplay between score and uncertainty information makes traditional top-k processing techniques inapplicable to uncertain databases. The existing approaches are all based on the assumption that the underlying data are exact (or certain). We construct a framework that encapsulates...
Detection of outliers and relevant features are the most important process before classification. In this paper, a novel semi-supervised k-means clustering is proposed for outlier detection in mammogram classification. Initially the shape features are extracted from the digital mammograms, and k-means clustering is applied to cluster the features, the number of clusters is equal with the number of...
The field of Information Retrieval plays an important role in searching on the Internet. Most of the information retrieval systems are limited to the query processing based on keywords. In information retrieval system the matching of the query against a set of text record is the core of the system. Retrieval of the relevant natural language text document is of more challenge. Today's most search engines...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.