The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This article aims to solve the problem of extracting the cultural terms and their correspondent English translations from the heterogeneous literature of the translation of the ancient Chinese classics. As the tool of text processing, regular expressions can help to realize the matching in the patterned text. This research focuses on design the target-oriented regular expressions to fit the pattern...
The algorithm is being developed with a view to reduce the time of going through entire document. The tool will be able to summarize textual documents automatically using statistical as well as linguistic techniques. It will provide the much needed method for creating concise and yet precise documents.
Case-based reasoning technology has been widely applied in many areas. This study first clustered the case base according to hierarchy, then on the foundation of which designed a retrieval strategy based on non-isomorphic case base, analyzed the hierarchal clustering rules of case base, and mainly discussed the case retrieval strategy based on clustering. The results of the experiment show that this...
Based on the analysis of current component retrieval methods in web service, a method of retrieving software components based on domain ontology and user interest was studied and implemented. This paper emphasized the definition of ontology feature domain model, the presentation of component description model based on ontology feature and the retrieval method of user interest. Based on these, it presented...
In the classification of text and retrieval of information, because the redundancy of information and complex computing problems are aroused from synonymous phenomenon between sentences, this paper give a calculation model about semantic similarity of sentences: start with the semantic similarity of words and phrases, then analyze sentence category, and introduce semantic similarity to syntaxes, finally...
This paper proposes a novel hierarchical speaker identification method to save the speaker identification and training time, viz. First is to get a coarse decision by a fast scan all registered speakers using PCA classifier to found M possible target speakers; then is to get a final decision by the proposed Multi-Reduced Support Vector Machine (MRSVM). And the MRSVM has two reduction steps to reduce...
We postulate that, due to linguistics and cultures, there are differences in the Web structure and the content of the Web documents of various languages. In this work, we design experiments to study the characteristics of the Chinese Web, and compare them to that of the English Web. We also examine whether these differences in Web characteristics, if identified, have an impact on the effectiveness...
With the rapid development of the Internet, Web log mining, which is used to find useful information about users from Web log files, has become a heat issue of research. The aim of association rule mining is to find interesting and useful patterns in a transaction base. This paper makes use of variable precision rough set theory to retrieve the associated rules from Web log and applies the rules to...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.