The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Self-organizing map (SOM) algorithm has been applied widely in tasks such as data clustering and visualization. Two major deficiencies of classical SOM are the need of predefined map structure and the lack of hierarchy generation. Several approaches have been devised to tackle these deficiencies. One of our previous works, namely the topic-oriented self-organizing map (TOSOM), tries to remedy the...
The self-organizing map (SOM) model is a well-known neural network model with wide spread of applications. The main characteristics of SOM are two-fold, namely dimension reduction and topology preservation. Using SOM, a high-dimensional data space will be mapped to some low-dimensional space. Meanwhile, the topological relations among data will be preserved. With such characteristics, the SOM was...
Social tags are annotations for Web pages collaboratively added by users. It will be much easier to understand the meaning of Web pages and classify them according to their tags. The precision in retrieving Web pages may also increase using such tags. Nowadays social tags are mostly annotated manually by users via social bookmarking Web sites. Such manual annotation process may produce diverse, redundant,...
Due to the improvement of digital image technology and increasing amount of digital image data, the issue of automatic image annotation technology and applications becomes more and more important. In order to retrieval image efficiently, it is important to extract and represent the semantics of images. Traditional image semantics extraction and representation schemes were commonly divided into two...
Web directories cluster Web pages into categories and usually organize them into hierarchies. Many users used them to browse for interesting Web pages in a coarse-to-fine manner. Nowadays most of the Web directories access monolingual Web pages and provide only monolingual interface which may limit the coverage and accessibility of Web pages for users familiar only with their native languages. Bilingual...
Multilingual information retrieval has attracted lots of attention in recent years due to the explosive increase of multilingual Web pages. It will not be easy to retrieve documents written in languages other than the query if the relationships among entities of different languages were not found. In this work, we will develop a method based on self-organizing maps to organize documents into hierarchy...
The Web pages nowadays were written in various languages including English, Chinese, Spanish, etc. There are increasing needs in searching Web pages of different languages using single query. This task is called multilingual information retrieval (MLIR). However, MLIR is difficult to achieve since we need some kind of method to find the associations between linguistic elements of different languages...
With the increasing amount of multilingual texts in the Internet, multilingual text retrieval techniques have become an important research issue. However, the discovery of relationships between different languages remains an open problem. In this paper we propose a method, which applies the growing hierarchical self- organizing map (GHSOM) model, to discover knowledge from multilingual text documents...
Research work related to plagiarism detection methods in dealing with monolingual texts (e.g. English texts) have been well established in recent years. However, little attention has been paid to facilitate plagiarism detection in cross-lingual text collections (e.g. English and Chinese texts). In this paper we present a system platform to evaluating text similarity and relatedness in multilingual...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.