The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Point of interest (POI) categorization is the task of finding of categories of POIs within a document. Because the documents that possess POIs have clue words for identifying POI categories, the task can be solved as document classification. However, this approach misses two crucial factors for identifying the category of a POI. First, the approach pays no attention to onomastic information, even...
Machine translation systems have various problems although they have been developed continuously. Especially, in Korean-English translation system, zero pronoun problem is an important problem, since omitted subject or object Korean are must be restored in English. In order to solve this problem, various methods have been proposed. In this paper, we focus on the gender determination problem in Korean...
In this paper, we propose a method of Korean text chunk identification based on support vector machines (SVMs). Text chunking is a task that divides text into syntactically related non-overlapping groups of word. It is a useful preprocessing step for the reduced time and computational resource of sentence parsing. Especially, we select features for SVM by considering the linguistic typological characteristics...
In this paper, we present a hybrid method of support vector machine and k-nearest neighbor to improve the performance of automatic text classification. The proposed methods first classify a given document using SVM which shows the best performance in text classification, and then is reinforced by k-NN for the documents that are not confidently classified by SVM. According to the experimental results,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.