The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Automatic template extraction including event template has been studied intensively in recent years. Researchers study the topic in order to solve the problem of manually defining a template that is required in most information extraction systems. Several studies of event template extraction rely on the documents characteristics to discover the pattern. Although there exist some structured knowledge...
Semantic role labeling (SRL) is a task to assign semantic role labels to sentence elements. This paper describes the initial development of an Indonesian semantic role labeling system and its application to extract event information from Tweets. We compare two feature types when designing the SRL systems: Word-to-Word and Phrase-to-Phrase. Our experiments showed that the Word-to-Word feature approach...
The absence of manually annotated training data presents an obstacle for the development of machine-learning based NLP tools in Indonesia. Existing annotation tools lack a mobile-friendly interface which is a problem in Indonesia where most users access the internet using their smartphone. In this paper, we propose the first mobile collaborative data annotation tool and evaluate it in an experiment...
Automatic template extraction has been studied intensively in order to perform information extraction without predefined template. Several existing studies utilized the similar preprocessing techniques which are applied in Open Information Extraction (Open IE) paradigm system. We investigate the use of Open IE results to build the automatic event template extraction. In this study, we adapt the clustering...
Recent development of variety and volume of information circulating in the Internet has prompted the emergence of a new paradigm in information extraction, namely the Open Information Extraction (Open IE). An evaluation of several existing Open IE systems shows a good performance on precision. However, improvement is still needed to boost the recall. A relation between entity pair in simple sentence...
Plagiarism is a form of cheating that has been so much happen. One of prevention is to make the anti-plagiarism system. The system that must compare a query document with all documents in the database requires a very long time. The more irrelevant document in database compare with the query that will be matched will waste the time. This paper will discuss a system to detect plagiarism by using indexing...
Along with the growth of Islamic religion in Indonesia, the need for information of Hadits becomes very important. Hadith as the second source of law in Islam after Al-Quran has a high position in moslem life. But application related with Hadith Retrieval is still limited. This limitation especially found in non-arabic language environment. Existing Hadith Retrieval System in Indonesia execute input...
As Internet usage becomes very common nowadays, including in Indonesia, people do their activities by relying on information gathered from world wide web. One of the examples is gathering information before buying product. Customers will search the product reviews before deciding whether to buy the products or not. Product reviews can be found by searching through search engine, reading on personal...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.