The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Gender classification is a well-researched problem, and state-of-the-art implementations achieve an accuracy of over 85%. However, most previous work has focused on gender classification of texts written in the English language, and in many cases, the results cannot be transferred to different datasets since the features used to train the machine learning models are dependent on the data. In this...
Preventing juveniles from accessing pornographic web pages remains a problem in Vietnam. The existing tools have failed to block these Vietnamese sites automatically and rely only on configuring black list and white list. In fact, the Vietnamese and English are different in both syntax and semantic, therefore, applying methods used for English into Vietnamese will definitely be much less effective...
The article designed a Web software mining system, discussed the techniques what used in the system and raised the solutions for issues in system. A Web crawler software has been designed and implemented according to the feature of the World Wide Web. Base on the information present by Web pages, the article improved feature selection method and key words weighted algorithm using Web text mining techniques...
Arranging mass of data in related groups is an important way that helps us to decide about them better, clustering and classification are two efficient methods of grouping huge volume of data, most of clustering and classification methods that work on Web pages grouping problems, use fixed size vectors in their learning algorithm. In the real world of WWW this assumption is not reliable. In this paper...
There are various opinions on the Web, and analyzing them is an important task. Although many previous studies focused on analyzing subjective evaluative expressions, objective evaluative expressions which describe positive or negative facts are also informative information. In this paper, we study extraction and classification of subjective and objective evaluative expressions on Japanese Web documents...
We present a novel method for discovering missing cross-language links between English and Japanese Wikipedia articles. We collect candidates of missing cross-language links -- a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then we select the correct cross-language links among the candidates by using a classifier trained with various types of features...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.