The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The field of opinion mining is expanding rapidly with the widespread use of internet for e-commerce and social interaction. One of the interesting use of opinion mining is in the field of online producer-consumer industry. The primary goal of the work presented in this paper is to perform a semi-automated sentiment classification on online product reviews for product evaluation using machine learning...
In this poster, we discuss ideas on how to support cross-language communication, which tends to arouse cultural gaps, even if trans-lingual interpretation is provided. Many people usually write article (diary/essay/ etc.) on their daily experiences as UGC (User-Generated Content) through social media. Among them there may be certain situation of miscommunication caused by cultural gaps. Using text...
Internet and Social media are widely used by terrorist organizations to spread their ideas and recruit foreign fighters. The aim of SAFFRON project is to build a system able to support early detection of foreign fighters' recruitment by terrorist groups in Europe. It consists in studying recruitment communication strategies on social media (e.g. narrations, argumentative tropes and myths used), and...
Streaming information flow allows identification of linguistic similarities between language pairs in real time as it relies on pattern recognition of grammar rules, semantics and pronunciation especially when analyzing so called international terms, syntax of the language family as well as tenses transitivity between the languages. Overall, it provides a backbone translation knowledge for building...
Written texts have perhaps never been so widely used as they are in today's social media context, with people constantly writing, sharing, commenting, getting involved. At the same time, Linked Data is emerging as an increasingly important topic, and research in this area has resulted in massive amounts of structured linguistic data. In this climate, we intend to analyze how linked data can help to...
As the amount of documents continues to increase steadily, it has become an important issue to shorten processing time in the field of natural language processing. In this paper, we describe a method to reduce the execution speed of the Korean temporal information extraction module from a development perspective. While the rule-based approach is useful for finding time representations from natural...
In current times, there has been a surge in the amount of collected data from computational systems. The vast amount of data can be useful in many applications and fields, particularly so in Big Data Analytics. However with a large collection of data there is a difficulty discovering important information. Automatic Document Summarization (ADS) systems are suitable for the task of outlining useful...
Sentiment analysis, which is also known as opinion mining, aims to recognise the attitude or emotion of people through natural language processing, text analysis and computational linguistics. In recent years, many studies have focused on sentiment classification in the context of machine learning, e.g. to identify that a sentiment is positive or negative. In particular, the bag-of-words method has...
This paper presents a method to improve Thai-English word alignment in statistical machine translation (SMT) for interrogative sentences in a parallel corpus. We utilize the Thai and English grammatical knowledge i.e. tense, part of speech (POS), and question inversion pattern. The proposed method handles the difference of Thai and English interrogative sentences using sentence transformation, interrogative...
In recent years, fuzzy utility mining has become an area of interest due to advancement of human reasoning. With regards to real applications, transactions in a database often involve things, such as transaction time, stamp, and much more. It is also noted that not all products in a store are displayed on the shelf, especially the seasonal ones. This paper, therefore, addresses these issues by presenting...
Large amount of data is created and stored in electronic media. Agriculture is no exception. Large unprocessed text are available on the various Government and other websites. Despite of large volume and availability, this data is underutilized. This data should be converted to an effective form so as to facilitate better information dissemination. Ontology is an efficient medium to carry out this...
Due to more and more on-premises services are migrating onto cloud, user behavioral analysis then gets popular as a data-driven way to administer lots accounts of on-cloud services. This paper proposes a novel rule-based approach, GMiner, for mining different types of Google cloud drive usages as an unsupervised account-management approach. Experiment results show that GMiner provides accurate, inter-pretable,...
According to the Merriam-Webster dictionary, satire is a trenchant wit, irony, or sarcasm used to expose and discredit vice or folly. Though it is an important language aspect used in everyday communication, the study of satire detection in natural text is often ignored. In this paper, we identify key value components and features for automatic satire detection. Our experiments have been carried out...
Grammar teaching and learning have always been important and difficult parts in L2 Chinese. This paper demonstrates a method for automatically extracting and recommending Grammar Points to L2 Chinese teachers and learners. First, a L2 Chinese grammar syllabus is reconstructed based on a corpus of international Chinese teaching materials. Second, a regular expression-based learning algorithm is explored...
To increase the learning effectiveness and willingness of students became the most important issue for the Universities in Taiwan. Therefore, we must find the important factors of the learning effectiveness to improve the learn willingness of students. However, it is not easy to measure the learning effectiveness because the subjective judgment of evaluators and the attributes of factors are always...
Researchers must constantly produce academic publications to promote their contributions and findings. The Research Article (RA) genre consists of sub-genres based on the typical IMRAD (Introduction, Method, Result and Discussion) structure that is commonly used for research in science and engineering. Researchers still find difficulties in learning systematically how to write RAs during university...
Pivot methods have shown to be an effective solution to overcome the problem of unavailable large bilingual corpora in statistical machine translation. The representative approach of pivot methods is the phrase pivot translation which is based on common pivot phrases to produce connections between source-pivot and pivot-target phrase tables. Nevertheless, this approach produces insufficient connections...
Over the past decade, the application of data science techniques to clinical data has allowed practitioners and researchers to develop a sundry of analytical models. These models have traditionally relied on structured data drawn from Electronic Medical Records (EMR). Yet, a large portion of EMR data remains unstructured, primarily held within clinical notes. While recent work has produced techniques...
Concept maps are resources for the representation and construction of knowledge. They allow the showing, through concepts and relationships, how knowledge about a subject is organized. Technological advances have boosted the development of technological approaches that help the automatic construction of a map, in order to facilitate and provide the benefits of that resource more broadly. Because of...
There has been a growing need to automatically identify, extract and analyze risk related statements from textual data. In this paper, we have exploited natural language processing research to develop a risk analytics framework that processes human-reported risk statements to analyzes the enterprise risk description texts to classify them into valid and invalid risk categories, and perform analytics...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.