The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Today, with the rapid increase in the use of the internet, thousands of resources can be reached about an information that is interested. However, it is difficult and time consuming to determine which of these sources is useful. Automatic document summarization is a dimension reduction process which remains the important parts of the text. In this study, the TextRank algorithm, which is a graph based...
Computational Intelligence is a dead-end attempt to recreate human-like intelligence in a computing machine. The goal is unattainable because the means chosen for its accomplishment are mutually inconsistent and contradictory: “Computational” implies data processing ability while “Intelligence” implies the ability to process information. In the research community, there is a lack of interest in data...
Treebank is one of important resources in the natural language processing. Compared with the rich and mature Chinese corpus, Vietnamese Syntactic Analysis is much more difficult. This paper presents a new approach which uses Chinese-Vietnamese bilingual word alignment corpus to build Vietnamese Dependency Treebank. Firstly, the aligned word processing was made by Chinese-Vietnamese sentence alignment;...
Nowadays, the data centric system has been playing an increasingly important role in blogs sharing, content delivery and news broadcasting, file-synchronization, and so on. Due to generated amount of data within the system, data backup and archiving has become a main challenging task. A main methods to solve the problem is Chunking based deduplication by eliminating redundant data and reducing the...
Mining the large volume textual data produced by microblogging services has attracted much attention in recent years. An important preprocessing step of microblog text mining is to convert natural language texts into proper numerical representations. Due to the short-length characteristic, finding proper representations of microblog texts is nontrivial. In this paper, we propose to build deep network-based...
Artificial Intelligence has the potential to change the world. The application of A.I. to robotics, control systems, text recognition, voice recognition, and autonomous vehicles will prove both revolutionary and useful. However, humans and machines process information in fundamentally different ways, and if those differences are not appreciated and exploited, a great deal of time and money will be...
The existing search engines retrieve information only based on the keywords. The incapability to search on the basis of the relation between the keywords and the user concepts, generates noise and hence, results in irrelevant retrieval. This leads to the idea of performing Semantic information processing by mapping the user's Concept and Context of the query with the retrieved results to filter (remove)...
Several studies on autism spectrum disorder (ASD) show that there exists significant heterogeneity in phenotype of the disorder. Additionally, many published findings also suggested that ASD is defined by atypical local/global processing. In this paper, we designed a puzzled-based intervention to examine the sensitiveness to the information of local /global processing on individuals with ASD. Additionally,...
The Gentzen system for the propositional logic is a deduction system which is logically equivalent to the corresponding axiomatic system. In the Gentzen system, the validity and provability of a sequence G?? are considered. The validity (¦) corresponds to the provability (+). In this paper, we propose the variant Gentzen system which is a dual-system of the Gentzen system. A co-sequence G|? is introduced,...
Given the importance of mastering the large vocabularies in English study, 3P strategy based on vocabulary memorization model is investigated in this paper to help students enlarge the vocabulary information retention and promote their vocabulary memorization. Students' study process has been divided into three stages according to 3P strategy: plural stimulus, visual and audial, is the first stage...
Cross-Language Information Retrieval (CLIR) is a sub field of Information Retrieval (IR). Like IR, in CLIR for a particular information need, we have to find relevant information or documents containing such information. In CLIR multiple tools must be developed to match terms containing same meaning in different languages. The usual solution is to translate the query and/or the documents before performing...
We present a framework for identifying the most representative sentence patterns from semantically and syntactically-annotated corpora via a Semantic Frame Generation (SFG). One of the difficulties to find out similar concepts from a text is because of the variations in linguistic expressions. SFG uses linguistic units as backbones to generate the most prominent patterns from various Chinese DE phrases.
This paper researches and discusses the English-based WordNet database of semantic relations and proposes a method for constructing a Uyghur language, distributed-system, semantic lexicon. In our work, we first took a modern Uyghur languageannotated dictionary containing 60,000 terms, and processed these terms using WordNet's conceptual relations and thereby extracted from this local language material...
Cognitive scientists believe that humans memorize and understand the real world through "event". A large number of narrative class texts contain various events and people can extract important events from the texts to support various event-based information processing. In this paper, we firstly research event annotation and build the Chinese Emergency Corpus. Then, we consider the event...
Tibetan information processing technology has made some advances. However it still does not keep up with the development of today's information age. The semantic study based on the Tibetan Ontology is important and valuable. In this paper, we propose an approach of Tibetan concept similarity computation based on Ontology. It considers four aspects such as the semantic coincidence degree, the semantic...
Mongolian homographs disambiguation problem is one of the difficulties of the Mongolian information processing. This paper puts forward a method for eliminating homonyms ambiguity based on Mongolian nouns Semantic network, achieving the design and implementation of the homograph disambiguation algorithm. Finally, the design process and experimental results of the corpus of homograph disambiguation...
The essence of map symbols is a psychological unity which contains geographical concepts and visual graphics. Several problems that exist in current researches on map symbols were analyzed: focusing on the visual graphics but paying little attention to the semantics, constructing the structure model of map symbols without unifying the concepts and visual graphics, and not fully investigating the relationship...
Spoken language recognition refers to the automatic process through which we determine or verify the identity of the language spoken in a speech sample. We study a computational framework that allows such a decision to be made in a quantitative manner. In recent decades, we have made tremendous progress in spoken language recognition, which benefited from technological breakthroughs in related areas,...
Automatic speech recognition (ASR) is a central and common component of voice-driven information processing systems in human language technology, including spoken language translation (SLT), spoken language understanding (SLU), voice search, spoken document retrieval, and so on. Interfacing ASR with its downstream text-based processing tasks of translation, understanding, and information retrieval...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.