The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Traditional Uyghur search engine lacks semantic information, to solve this problem, we propose a semantic retrieval model which is comprised of resources collection, semantic annotation, query analysis and results ranking. On the basis of establishing the relationship between the ontology concept and web document, we implement the automatic semantic annotation of documents and the expansion of user's...
This paper presents a novel extractive approach which takes advantage of geodesic distance for sentence similarity computation to multi-document summarization task. Based on geodesic distance between every two sentences, the text relationship map is constructed. Sentences with higher degree in the map are selected and grouped into clusters. Finally, sentences with highest degree of each cluster are...
Query-focused multi-document summarization aims to produce a summary in response to a user query. We present an approach based on estimation of content-terms to address this task. In the process of estimating content-terms, we make full use of the relevant feature and the information richness feature for assigning importance to each of them. With summary content-terms being identified correctly, the...
Extensible markup language (XML) documents clustering is useful to XML application such as XML search engine. The element tags and their position in the document's hierarchy provide valuable information to clustering XML documents. XML path can represent both element tags and their position information. Since common Xpath represents only parts of the XML structural, using common Xpath as XML structural...
In this paper, the method of gender identification for Chinese e-mail documents is described. E-mail documents' features including linguistic features, format features and structure features were analyzed. The support vector machine algorithm was selected as classification algorithm. Experiments on a set of samples gave promising results, which proved that the method was feasible.
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.