The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Data quality is the most important variables for data warehousing. Numerous data warehouse ventures fall flat because of low quality of the data. It is trusted that the issues can be altered later and hence, a great deal of the reality of the situation will become obvious eventually spent to settle the errors. In the event that low-quality data nourished in the data warehouse, the outcome will be...
The objective of this work is to perform usage analytics in Amrita-Virtual Interactive E-learning World (A-VIEW) using the technique of association rule mining. Apriori algorithm is used in this work for analyzing the usage of A-VIEW features with users' voting pattern. A-VIEW synthetic data, which has the features like document sharing and video with their user ratings, is applied to evaluate the...
Currently there are many techniques based on information technology and communication aimed at assessing the performance of students. Data mining applied in the educational field (educational data mining) is one of the most popular techniques that are used to provide feedback with regard to the teaching-learning process. In recent years there have been a large number of open source applications in...
The criminal behavior is a disorderliness that is a combined result of social and economic aspects. The crime rate has expanded and the activities of criminals have broaden in last few decades due to better communication system and transport. Crimes cause terror and damage our community enormously in several means. In cities and towns the crime trends rises due to fast developmental activities and...
The Internet provides an excellent extent of useful information that is sometimes arranged for its users, that makes it difficult to extract relevant information from various sources. So that, this paper proposes a hybrid Artificial Bee Colony and Improved K-means bunch algorithmic program provides all types data of data repository and has been terribly successful in dispersive information to users...
Availability of information in profusion in the internet and databases is common knowledge. It has to be viewed in the backdrop of chances for disclosure of such information by a third party. Privacy Preserving Data Mining (PPDM) is in use for maintaining the privacy of individuals. Numerous updated methods are available for the purpose. Evolutionary Algorithms (EA's) are able to provide effective...
Diagnosing liver disease is the challenging task for many public health physicians. In this study, we propose the framework to diagnose the hepatitis disease. For this study the adaptive rule based induction were formulated and the adaptive rule implemented in combined Robust BoxCox Transformation (RBCT) and Neural Network (NN) methods. The performance of proposed model is compared and results are...
Electronic commerce includes all business conduct through information and communication technology. Development of infrastructure, telecommunications, mobile technologies, the internet and social media in recent years, made a tremendous growth in business through e-commerce. Now e-commerce is a vital part of the economic development and helps in employment, FDI and GDP growth in the country. More...
Information retrieval and integration of web data is recent trend in today's world of technology. Huge amount of data is available in online repositories but most of it is hidden under deep web interfaces. As deep web is growing at a very fast rate it is becoming difficult to efficiently locate the deep-web interfaces and retrieving the required data. The large volume of web resources and the dynamic...
Kidney disease is become a popular disease in around the world. The prediction of kidney disease is highly complex task while handling huge dataset. The kidney disease dataset contain patients information such as age, blood Pressure levels, albumin, sugar, counts of red blood cells etc., in the dataset there may be some missing values in some features that values may be important to predict kidney...
Patent documents are provide a significant source of knowledge about future technologies. Many attempts have been conducted to mine important knowledge from patents to analyze new technology trends. In this paper, we will to analyze implicit knowledge derived from the patents dataset of Big Data domain from KIPRIS. Keywords that occur in the title of patents are classified into three categories: Approach,...
Pill identification is a serious concern for pharmacists due to similarity of pill appearances. Pill imprints usually contain important information that can be used to add or search for pill information on existing pill databases. However, current techniques for extracting imprints often give results as vectors which cannot be used with existing databases. Thus, this paper proposed an approach for...
Many studies analyze issue tracking repositories to understand and support software development. To facilitate the analyses, we share a Mozilla issue tracking dataset covering a 15-year history. The dataset includes three extracts and multiple levels for each extract. The three extracts were retrieved through two channels, a front-end (web user interface (UI)), and a back-end (official database dump)...
Stack Overflow is a popular question answering site that is focused on programming problems. Despite efforts to prevent asking questions that have already been answered, the site contains duplicate questions. This may cause developers to unnecessarily wait for a question to be answered when it has already been asked and answered. The site currently depends on its moderators and users with high reputation...
The study of the evolution of highly configurable systems requires a thorough understanding of thee core ingredients of such systems: (1) the underlying variability model; (2) the assets that together implement the configurable features; and (3) the mapping from variable features to actual assets. Unfortunately, to date no systematic way to obtain such information at a sufficiently fine grained level...
Databases have become one of the most important components in modern software systems. For example, web services, cloud computing systems, and online transaction processing systems all rely heavily on databases. To abstract the complexity of accessing a database, developers make use of Object-Relational Mapping (ORM) frameworks. ORM frameworks provide an abstraction layer between the application logic...
Many software development projects have introduced manda-tory code review for every change to the code. This meansthat the project needs to devote a significant effort to re-view all proposed changes, and that their merging into thecode base may get considerably delayed. Therefore, all thoseprojects need to understand how code review is working, andthe delays it is causing in time to merge.This is...
Frequent pattern mining discovers associations among different items in large sets of data. In many real-world applications, the presence of an object or a characteristic cannot be given exactly all the time. Instead, they can be better expressed in terms of probability and such data is called uncertain data. Mining frequent patterns from uncertain data is challenging due to presence of existential...
Graph data analysis is one of the upcoming methodologies in various niches of computer science. Traditionally for storing, retrieving and experimenting test data, researchers start with mysql database which is more approachable and easier to build their test experimentation platform. These test bed mysql databases will store data in the form of rows and columns, over which various SQL queries are...
Customer Relationship Management (CRM) is an overall process of building and retaining profitable customers with an organization and directed towards improving business relationship with customers. With analysis of customer data in the CRM database helps to create new approach to lead the business strategies. Analytical CRM helps to analyze customer data and interactions through various data mining...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.