The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The idea of text to speech by a computer is an enhancement of the human learning ability. Due to the fact that each person has individual ability of visualization, the receiving of information in the form of voice helps make everything become easier. The objective of this research is to develop computer software that can translate Thai Text to Speech (TTTS). The TTTS consists of four modules, which...
This paper describes an isolated word recognition method based on distinctive phonetic features (DPFs). The method comprises two multilayer neural networks (MLNs). The first MLN, MLNLF-DPF, maps local features (LFs) of an input speech signal into discrete DPFs and the second MLN, MLNDyn, restricts dynamics of outputted DPFs by the MLNLF-DPF. In the experiments on Tohokudai Isolated Spoken-Word Database...
Paraphrasing is another expression that does not change the meaning of the original statement. Combined with the Chinese sentences automatically paraphrasing techniques, Chinese text information hiding algorithm is proposed in this paper, this algorithm give prominence to template rules and matching process, which also act as a basis for realizing information hiding. Then we analyze the algorithm...
The amount of time teachers spend grading essays has increased over the past decade, prompting the development of systems that are able to lighten the workload. Many systems have thus far used linear regression or semi-supervised methods towards this objective. This paper discusses some of the main Automated Essay Grading systems, highlighting some of their strengths and weaknesses, in addition to...
Screen reader is a form of assistive technology to help visually impaired people to use or access the computer and Internet. So far, it has remained expensive and within the domain of English (and some foreign) language computing. For Indian languages this development is limited by: availability of Text-to-Speech (TTS) system in Indian languages, support for reading glyph based font encoded text,...
Nowadays, concatenative method is used in most modern TTS systems to produce artificial speech. The most important challenge in this method is choosing an appropriate unit for creating a database. This unit must warranty smoothness and high quality speech, and also, creating database for it must take reasonable resources and should be inexpensive. Syllable, phoneme, allophone, and, diphone are usually...
The objective of this study was to determine whether one perceptually dominant channel in carrying emotional cues could be determined among speech, textual content and facial expression. To this end a Wizard-Of-Oz type scenario was used to elicit a corpus of emotional speech and facial expressions from five female speakers. Excerpts from this corpus were then presented to 48 listeners in the various...
As the enormous amount of on-line text grows on the World-Wide Web, the development of methods for automatically summarizing this text becomes more important. The primary goal of this research is to create an efficient tool that is able to summarize large documents automatically. We propose an Evolving connectionist System that is adaptive, incremental learning and knowledge representation system...
In this paper, an ANN based spectrum-progression model (SPM) is proposed. This model is intended to improve the fluency level of synthetic Mandarin speech under the situation that only a small training corpus is available. In constructing this model, first each target syllable is matched with its reference syllable by using DTW. Then, each warped path, i.e. spectrum-progression path, is time normalized...
Jawi is an old version of Malay language writing that need to be preserved. Therefore, it is important to develop tools for teaching kids about Jawi characters and speech-to-text (STT) application can serve this purpose well. Unlike English, Jawi uses special characters similar to Arabic characters. However, its pronunciations are in Malay language. This uniqueness makes STT development a challenging...
We introduce a model for extractive meeting summarization based on the hypothesis that utterances convey bits of information, or concepts. Using keyphrases as concepts weighted by frequency, and an integer linear program to determine the best set of utterances, that is, covering as many concepts as possible while satisfying a length constraint, we achieve ROUGE scores at least as good as a ROUGE-based...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.