Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
Intrusion detection in computer networks is a important topic in information security. Due to numerous cases of security breaches that caused economic and social losses in recent years, this topic has been the subject of several studies in order to mitigate problems related to network intrusion and computer attacks. Information security systems have been using different techniques for network intrusion...
NewsOne is a dedicated platform that aggregates all the latest news updates from multiple national and international resources and summarizes them to present in a short and crisp words. This online platform provides a service oriented interaction among the users from across web. The main motto of this application is to access the news fast. It will bring news directly without wasting any time for...
For a large sum of data collected and stored continually, it is more and more necessary to mine association rules from database, and the Apriori algorithm of association rules mining is the most classical algorithm of database mining and is widely used. However, Apriori algorithm has some disadvantages such as low efficiency of candidate item sets and scanning data frequently. Support and confidence...
The growing crime rate of any country is always one of the biggest obstacles to its growth and development. With more manpower it certainly helps to keep the crime rate at bay, but is manpower the ideal solution? No. This may come as a surprise that despite Big Data being the boom of the century is yet to take firm roots when it comes to helping solve a Criminal Investigation. With the aim of changing...
This paper gives details about web-based department automation system which will be implemented at educational institution level for maintaining faculty details and records. The proposed application aims at providing efficient and hassle-free working environment for faculty of the organization as it reduces the amount of paperwork involved. This system is based on the modern approach of data mining...
Many computer users lack awareness of the connections their computers make to hosts on the Internet. Some realtime tools exist for host-based network monitoring, but they are too complex for average computer users as they require expert knowledge to use and interpret. Connection Cartographer is a tool to let non-expert users visualize geographical information about their network connections in real...
In previous works, we presented Cross Motif Search (CMS), an algorithm designed to explore new techniques in the field of protein motif retrieval and identification. The novelty of the CMS approach is to look for geometrical similarities in the secondary structures of proteins, instead of homologous topology. Put in other words, while the connections among different secondary structures are still...
In today's scenario of data mining, there are so many upgraded versions of traditional Apriori has been launched in association due to its limitation of suffering from number of inefficiencies. Which have procreate other algorithms. The actual concept of this research topic is also one of them and it mainly focus on the description of the new version of hash based association using association rule...
Given a common dataset, two methods operating on that dataset and reported equal-error rate (EER) for each method, then we can estimate whether the two methods differ significantly at the threshold leading to the EER. This enables the calculation of a boundary on the significance for methods where the significance was not reported in the original paper or to compare new methods to older ones by evaluating...
Data Mining is the process of identifying new patterns, insights in data and knowledge discovery, and is at the intersection of multiple research areas, including Machine Learning, Statistics, Pattern Recognition, Databases, and Visualization. With the maturity of databases and constant improvements in computational speed, data mining algorithms that were too expensive to execute are now within reach...
This paper proposes a new system of categorization and classification using data mining techniques based on certain criteria/topics. We describe the design and implementation of proposed system that automatically categorizes a restaurant as being good or bad, using data mining techniques, based on users' reviews. For this study we took a data set consisting of approximately 9,000 reviews for 2,355...
Prefixspan algorithm with GRC constraints which generates sequential patterns by using prefix projected pattern growth approach is implemented. Other than frequency this algorithm also uses gap, compactness and recency constraints during sequential pattern mining process. The gap constraint applies limit on the separation of two consecutive transactions of discovered patterns, recency constraint makes...
One of the important approach in data mining is sequential pattern mining that is used for discovering behaviors of sequential databases. There are various challenges in sequential pattern mining such as efficiency and effectiveness. In this paper different sequential pattern mining algorithm are discussed such as GSP, FreeSpan, PrefixSpan, and CAI-PrefixSpan to improve performance to finding sequential...
In today's digital world scenario, digital data is coming in and going out faster than ever before. This data is of no use until we extract some useful content from it. But, it is impractical and inefficient to use traditional database management techniques on big data. That's why, big data technologies like Hadoop comes to existence. Hadoop is an open source framework, which can be used to process...
Organization of transactional data is one of the important steps in Knowledge Discovery. Compact Pattern Tree (CPTree) organization of the data is apt for the FP-Tree, CAN-Tree, CATS-Tree etc., Construction of CPTree has been dealt within two phase method. This paper exploits the transactional data representation in a structured form using one of the data structures for subsequent representation of...
This working paper argues that many data-mining projects in the humanities limit themselves by choosing words as their default unit of analysis. Some authors, problems, and forms are better illuminated by analysis of individual textual symbols, others by examination of multiword constructions. Insights about the nature of code from mathematical information theory, long but perhaps prematurely rejected...
Sequential pattern mining is valuable approach to uncover consumer buying behaviour from huge sequence database. Weather prediction, web log analysis, stock market analysis, scientific research, sales analysis, and so on are the application of sequential pattern mining. The pattern that is recent and profitable can't discover by conventional sequential pattern mining. So, RFM-based sequential pattern...
With the rapid development of Internet, online survey becomes an emerging industry. It is a very challenging task to get interesting knowledge from the large-scale behavioral data of respondents. This paper firstly makes reduction of user properties and behavior data from an online survey company, and based on which we construct an online survey user model, then, an improved generalized sequential...
Classification is one of the main issues of data mining. Knowledge hidden in the data can be discovered by induction of decision rules. However, with the increase in the size of the decision tables there is a need to decompose the problem. An appropriate solution to this problem may be hierarchical induction of decision rules. In this article the decomposition algorithm of decision tables containing...
There is huge growth of online text documents in the Internet today. We can easily find documents written in languages from all over part of the just from a single click. Increasing number of online text document in Internet makes the increased availability of information on the Internet. In fact that none in the world can understand all languages of the digital documents. Hence, there is a significant...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.