The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Fuzzy ontology is based on the concept that each index object is related to every other object in the ontology, with a degree of membership assigned to that relationship based on fuzzy set theory. This paper proposes use cases based on the related process of the terrorism event extraction using fuzzy ontology, especially the terrorism fuzzy ontology construction methodology. The related use cases...
The aim of this paper is to study and compare several machine learning methods for implementing a Thai terrorism event extraction system. The main function of the system is to extract information related to terrorism events found in Thai news articles. The terrorism events can then be classified and presented to intelligence officers who can further analyze and predict terrorism events. This paper...
A large-scale R&D project collaboration requires various areas of expertise, i.e, multidisciplinary, with multiple partners. Such R&D problems include global warming, emerging infectious diseases, and energy issues. One typical approach for identifying a group of expert candidates is to first come up with an initial expert and then use his/her referral to find additional experts. Hence the...
Finding relevant information from a long list of search results returned by general search engine can be difficult. The categorization technique is applied to solve this problem. One possible approach is by using some external resources such as Open Directory Project (ODP) to map search result's URLs into the ODP categories. However, the ODP can only map some part of all URLs that returned from search...
We propose a feature called category browsing to enhance the full-text search function of Thai-language news article search engine. The category browsing allows users to browse and filter search results based on some predefined categories. To implement the category browsing feature, we applied and compared among several text categorization algorithms including decision tree, Naive Bayes (NB) and Support...
Two important factors which indirectly influence the Internet shoppers to make some online purchases are the visual layout and the presentation of web page. In this paper, we propose an approach of web page layout analysis in order to assess the design of e-commerce Web sites. Firstly, our proposed method segments each web page into five different blocks: top, left, center, right and bottom. We study...
Science and technology (S&T) information presents a rich resource, vital for managing research and development (R&D) programs. Modern S&T electronic abstract databases such as Science Citation Index and INSPEC provide comprehensive information on research activities in many different domains. These databases mostly include English language publications. However for a country such as Thailand...
In this paper, we analyze and compare various approaches for Thai word segmentation. The word segmentation approaches could be classified into two distinct types, dictionary based (DCB) and machine learning based (MLB). The DCB approach relies on a set of terms for parsing and segmenting input texts. Whereas the MLB approach relies on a model trained from a corpus by using machine learning techniques...
Nowadays, Internet is widely used almost all over the world including Thailand. People can find Web site or information that they need by using search engines like Sansarn or Google. When users type words or phrases into the search box, sometimes they are not satisfied with the returned results. One of the most important problems is misspelled query due to typographical and cognitive errors. To address...
This paper presents a new algorithm called ldquoconcept-groupingrdquo that adapts an association rule mining technique to construct term thesaurus for data preprocessing purpose. Similar terms, which are written differently, can be grouped together into the same concept based on their associations before they are used for subsequent analysis. This data preprocessing is important since it has an impact...
In this paper, we propose a Thai language specific Web crawling as a method of selectively seek out Web pages written in Thai. The strategy is to follow a URL with the highest probability of leading to Thai Web pages. The probability score is calculated from the example set of Web pages using simple Naive Bayes approach. In addition, we also use a heuristic based method to bias the probable URLs whose...
Recent research in mining user access patterns for predicting Web page requests focuses only on consecutive sequential Web page accesses, i.e., pages which are accessed by following the hyperlinks. In this paper, we propose a new method for mining user access patterns that allows the prediction of multiple non-consecutive Web pages, i.e., any pages within the Web site. Our approach consists of two...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.