The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Crime prediction plays a crucial role in addressing crime, violence, conflict and insecurity in cities to promote good governance, appropriate urban planning and management. Plenty efforts have been made on developing crime prediction models by leveraging demographic data, but they failed to capture the dynamic nature of crimes in urban. Recently, with the development of new techniques for collecting...
There has been a surge in research interest in learning feature representation of networks in recent times. Researchers, motivated by the recent successes of embeddings in natural language processing and advances in deep learning, have explored various means for network embedding. Network embedding is useful as it can exploit off-the-shelf machine learning algorithms for network mining tasks like...
Natural language processing methods are widely used to study the relationship between traditional Chinese medicine (TCM) prescriptions and diseases in textual data, and the results can discover the essence of TCM literature. In this paper, we get TCM treatment information from the abstract text at first by using the web crawlers. Second, the eigenvectors will be selected from the cleaned abstract...
Many of today's machine learning (ML) systems are composed by an array of primitive learning modules (PLMs). The heavy use of PLMs significantly simplifies and expedites the system development cycles. However, as most PLMs are contributed and maintained by third parties, their lack of standardization or regulation entails profound security implications. In this paper, for the first time, we demonstrate...
It is attractive to extract and determine the key features of traffic patterns for mitigating road congestion and predicting travel time of vehicles in traffic analysis. Based on previous works that is a scalable approach via Hadoop MapReduce programming model, and can extract maximal repeats from a huge amount of tagged sequences, this paper adapts that approach to extract significant patterns of...
Authorship analysis deals with the identification of authors which is a problem of text data mining and classification. There are numerous techniques and algorithms that have been published so far, in the field of stylometry. In this regard, the primary objective of the present review is to provide the status of the different studies carried out on authorship analysis based on the important research...
Recruiters evaluate and filter job seekers, ranking them on various criteria. This includes how much of the required and desired requirements are satisfied, ensuring the candidate is the “best match” to vacancy. However, most vacancies do not classify the set of skills as required and desired explicitly. Required skills are those skills a job seeker must have in order to be considered for the job...
Poor sitting postures influence one's health and can cause upper limb and neck disorder. Current solutions for siting posture recognition, however, are impractical due to intrusiveness, high cost or low generalization capability. Particularly, most of the existing solutions are chair-dependent, which are highly coupled with certain types of chairs. In this paper, we design Postureware, a smart cushion,...
Stack Overflow is one of the most popular question-and-answer sites for programmers. However, there are a great number of duplicate questions that are expected to be detected automatically in a short time. In this paper, we introduce two approaches to improve the detection accuracy: splitting body into different types of data and using word-embedding to treat word ambiguities that are not contained...
In this study, we focus on extraction of latent topic transition from POS data. POS analysis is conducted to obtain the frequent pattern of customer's behavior. The fundamental method for POS analysis is to conduct market basket analysis. By doing Market basket analysis, the sets of products that are often bought at the same time can be extracted. In market basket analysis, however, the effect of...
Understanding the value of a football player is a challenging problem. Player valuation is not only critical for scouting, bidding and negotiation processes but also attracts a large media and fan interest. Due to the complexities which arise from the fact that player pool is distributed over hundreds of different leagues and many different playing positions, many clubs hire domain experts (often...
According to the Merriam-Webster dictionary, satire is a trenchant wit, irony, or sarcasm used to expose and discredit vice or folly. Though it is an important language aspect used in everyday communication, the study of satire detection in natural text is often ignored. In this paper, we identify key value components and features for automatic satire detection. Our experiments have been carried out...
In order to generate effective results, it is essential for a recommender system to model the information about the user interests (user profiles). A profile usually contains preferences that reflect the recommendation technique, so collaborative systems represent a user with the ratings given to items, while content-based approaches assign a score to semantic/text-based features of the evaluated...
In the airline industry, a Passenger Name Record (PNR) stores the travel itinerary of an individual or group of passengers travelling together. A PNR always contains all the flight information regarding each segment of a journey, and may contain additional important information such as nationality, gender and age of the passengers. From a commercial point of view, these passenger attributes are of...
In today's world SNS i.e. social networking sites have become an integral part of our day to day life. SNS is the place where a person is free to express his views and opinions about others, share information with others. Millions of visitors daily share their views and opinions on websites like twitter, MySpace and Facebook which produces enormous data. It has become important to analyze this data...
In smart environments, the extraction of relevant information in large volumes of data collected from intelligent devices is a crucial issue. The extracted information can assist in automation of user activities and on daily chores, either suggesting or even changing the state of devices based on his/her routine. In this work, we propose a prediction architecture which combines an innovative preprocessing...
Cardiovascular risk prediction is a vital aspect of personalized health care. In this study, retinal vascular function is assessed in asymptomatic participants who are classified into risk groups based on Framingham Risk Score. Feature selection, oversampling and state-of-the-art classification methods are applied to provide a sound individual risk prediction based on Retinal Vessel Analysis (RVA)...
This dataset is used to detect undoing style in CSS code. In total, this dataset contains 41 subjects. Each subject has its own folder, which contains the captured states, a states.html file, is used to load all captured states in one document, and a folder called results, which contains the detected undoing styles, the refactored style sheets and the detected semantic changes. The file states.html...
In recent years the social network analysis has been increased, in this paper we focus mining the small network formed by social gathering or business meeting. The main objective is to discover the existence of correlation and influence among the group of people by analysing their social affinity and emotions, currently we are interested only in facial emotions. Whereas the approach for emotion detection...
Feature-based opinion mining for product review is the field of study that analyzes user's attitude towards product attributes, which has been witnessed a booming interest in the last one and half decades, due to its importance to business and society as a whole. This paper proposed a POS patterns matching method to identify feature words, opinion bearing words, as well as negative words based on...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.