Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
Given a database of spatial trajectories reporting the movement of a set of objects in a time frame, the problem is to discover the groups of objects that stay in close proximity within a geographical area for a significant time. To deal with the problem, techniques for the discovery of collective patterns, e.g. the meeting pattern, have been proposed. Such techniques, however, impose stringent constraints...
Text segmentation is an important problem in document analysis related applications. We address the problem of classifying connected components of a document image as text or non-text. Inspired from previous works in the literature, besides common size and shape related features extracted from the components, we also consider component images, without and with context information, as inputs of the...
Density peak (DP) based clustering algorithm is a recently proposed clustering approach and has been shown to be with great potential. This algorithm is based on the simple assumption that cluster centers have high local density and they are relatively far from each other. This observation is used to isolate cluster centers from other data. By making use of the density relationship among neighboring...
With the advance of mobile electronic devices and the development of positioning technology, a large volume of spatio-temopral data are collected in the form of desultorily data streams, which contain a lot of potential information. In this study, we focus on discovering the composition relationships between observation moving objects in a long period. Such research can be widely used in military...
Clustering is a well-recognized data mining technique which enables the determination of underlying patterns in datasets. In electric power systems, it has been traditionally utilized for different purposes like defining customer load profiles, tariff designs and improving load forecasting. Some surveys summarized different clustering techniques which were traditionally used for customer segmentation...
Topological data analysis is a noble method to analyze high-dimensional qualitative data using a set of properties from topology. In this paper, we explore the feasibility of topological data analysis for mining social media data by investigating the problem of image popularity. We randomly crawl images from Instagram, convert their captions to 300 dimensional numerical vectors using Word2vec, calculate...
A curve fitting method based on automatically extracting subsection points is proposed to fit the wheel tread profile accurately. Firstly, the subsection points are determined by segmenting discrete points of wheel tread profile based on given errors and threshold. Secondly, the segmentation interval is determined with the respective segment point as the center, and the least squares curve fitting...
Social media has now become a pervasive global communication channel. Many applications and platforms have become available for users to post messages, follow friends and share experiences. Due to the high frequency with which users update their states, a large amount of data is being generated around the world every second. By analyzing this data, valuable patterns can be extracted such as the distribution...
Cluster analysis aims at classifying data elements into different categories according to their similarity. It is a common task in data mining and useful in various field including pattern recognition, machine learning, information retrieval and so on. As an extensive studied area, many clustering methods are proposed in literature. Among them, some methods are focused on mining clusters with arbitrary...
Time series similarity measure is an essential issue in time series data mining, which can be widely used in various applications. With an eye to the fact that most current measures neglect the shape characteristic of time series, this paper proposes a shape based similarity measure. By introducing a shape coefficient into the traditional weighted dynamic time warping algorithm, an improved version,...
A method for classifying objects into categories and indexing is proposed to implement object recognition. The relational measurements such as the distance between two points, color comparison is encoded by the attributed relational graph (ARG) representation to provide one-to-one correspondence between models and object features. If the contour is traversed counterclockwise, a sequence can be formed...
In order to classify the eaglewood, the work proposed a method of wood fiber segmentation and characteristic extraction based on the eaglewood micrographs. The active contour model was used to extract the contours of the eaglewood micrographs. After screening of wood fiber, the geometric features and shape factors were extracted to form characteristic vectors. After that, SVM was used to achieve the...
Fault classification in power systems is a challenging and complex task as the variety and variability of the electrical parameters of the various network components in spatial and temporal scales. The majority of machine learning methods for event detection require the labeled data sets or examples of previous events. However, the recorded event data happen in different locations, time and system...
A dynamic industrial design optimization requires high-quality optimization algorithms as well as adaptive representations to find the global solution for a given problem. For adapting the representation to changing environments or to new input we utilize the concept of evolvability, which in our interpretation consists of three criteria: variability, regularity, and improvement potential, where regularity...
In this paper, an overview on existing data mining techniques for time series modeling and analysis will be provided. Classification of available literature on time series data mining shows that the main research orientations can be divided into three subfields: Dimensionality Reduction (Time Series Representation), Similarity Measures and Data Mining Tasks.
We present Random Forest, Support Vector Machine and Feedforward Neural Network models to classify 2519 variable star light curves. These light curves are generated from a reduction of non-survey optimized observational images gathered by wide-field cameras mounted on the Liverpool Telescope. We extract 16 features found to be highly informative in previous studies and achieve an area under the curve...
Flatness is one of the most important specifications for strip products in cold rolling processes. Shape control of cold rolled product is often characterized as a complex process with multiple operation conditions, multi-variables, time-varying parameters, strong coupling and nonlinearity. Accurate online shape defect diagnosis is still a difficult task. This paper proposed a frequent pattern mining...
Attributes are human-annotated semantic descriptions of label classes. In zero-shot learning (ZSL), they are often used to construct a semantic embedding for knowledge transfer from known classes to new classes. While collecting all attributes for the new classes is criticized as expensive, a subset of these attributes are often easy to acquire. In this paper, we extend ZSL methods to handle this...
Clustering is a classical unsupervised learning task, which is aimed to divide a data set into several groups with similar objects. Clustering problem has been studied for many years, and many excellent clustering algorithms have been proposed. In this paper, we propose a novel clustering method based on density, which is simple but effective. The primary idea of the proposed method is given as follows...
Distributed Denial of Service (DDoS) attack is a congestion-based attack that makes both the network and host-based resources unavailable for legitimate users, sending flooding attack packets to the victim's resources. The non-existence of predefined rules to correctly identify the genuine network flow made the task of DDoS attack detection very difficult. In this paper, a combination of unsupervised...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.