Search results

chapter

An annotated corpus for Turkish sentiment analysis at sentence level

Sevinc Ilhan Omurca, Ekin Ekinci, Hazal Turkmen

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 5

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

With the rapid growth of unstructured data accessible via web, managing these data and finding undiscovered information in huge dataset become a necessary task. Consequently text mining, which can be defined as gleaning important information from natural language text, has emerged. In this study, in order to facilitate information management for aspect based sentiment analysis studies, a Turkish sentiment...

chapter

Implementation of telugu speech synthesis system

Gangala Ramya, Nenavath Srinivas Naik

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1151 - 1154

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Speech synthesis is the computer generated human voice. It is also known as a text-to-speech system which converts text information into speech. Speech synthesis systems are often called text-to-speech (TTS) systems about their ability to convert text into speech. A TTS synthesis system converts written orthographic text into corresponding artificial speech signals. In multi-lingual cultural settings,...

chapter

Northern Thai Dialect Text to Speech

Pannakorn Chao-angthong, Atiwong Suchato, Proadpran Punyabukkana

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE) > 1 - 6

2017 14th International Joint Conference on Computer Science and Software Engineering (JCSSE)

Each of the dialects of Thai Language has a distinct identity associated with its accents. The conversation between different native speakers of these dialects despite their standard language origination cannot be avoided when visiting each region. Communication with people who understand only the Northern Thai Dialect (NTD) brought us to the idea of inventing the Northern Thai Dialect Text to Speech...

chapter

Syllabification: An effective approach for a TTS system for Konkani

Nilesh FalDessai, Jyoti Pawar, Gaurav Naik

2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT) > 161 - 167

2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT)

Speech Synthesis System converts written text to speech. To build a natural sounding speech synthesis system, it is essential that the text processing component produce an appropriate sequence of units. Syllable preserves co-articulation effects within the sound unit. In our current work, concatenative method is use to develop a synthesis system using syllable as the basic unit which includes Jodhakshars,...

chapter

Unstructured data treatment for big data solutions

Shintaro Sato, Akihiro Kayahara, Shin-ichi Imai

2016 International Symposium on Semiconductor Manufacturing (ISSM) > 1 - 4

2016 International Symposium on Semiconductor Manufacturing (ISSM)

We constructed a system infrastructure capable of processing unstructured data, with the aim of practical application of the system for document data analysis in the manufacturing industry. Using past ISSM research paper data, papers were classified and verified. Using morphological analysis, the extracted parts of speech were used as feature quantities, and machine learning was executed. Since effective...

chapter

Document similarity analysis in Slovak language

Vladimir Hanusniak, Vladimir Smatanik, Milan Straka, Michal Zabovsky

2016 International Conference on Information Management and Technology (ICIMTech) > 281 - 285

2016 International Conference on Information Management and Technology (ICIMTech)

Examining data for similar items is one of the fundamental data-mining problems. Application of methods for similarity search could be useful for plagiarism or near-duplicate web page detection. The computerized methods developed during last years are mainly focused on English language. However, Slovak language has several specific attributes and using these methods may not be precise enough. Our...

chapter

Syntactic text analysis without a dictionary

I. A. Bessmertny, A.V. Platonov, E.A. Poleschuk, Ma Pengyu

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT) > 1 - 3

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT)

Syntactic text analysis is a very important step of automatic text processing. The key problem is that all existing approaches are dictionary-dependent. It can be impossible to analyze all sentences because of the lack of one word in the dictionary. However even the presence of the dictionary does not resolve the phrases interpretation ambiguity. At the same time fusional languages contain enough...

chapter

The design and implementation of HMM-based Dai speech synthesis

Zhan Wang, Jian Yang, Xin Yang

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

By far there are more than 1.2 million Dai compatriots using Dai language in Yunnan province, researching Dai speech synthesis has great significance in advancing the informationization of Dai. This paper focuses on the study of the implementation of Dai speech synthesis by taking the HMM speech synthesis framework and STRAIGHT synthesizer into account. The methods of collection and selection of Dai...

chapter

Research on text analysis for Tibetan statistical parametric speech synthesis

Zhenye Gan, Xinjie Kong, Shuai Zhang

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 877 - 882

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

Text analysis is the front-end of a TTS system, which has a great influence on the naturalness of the back-end speech synthesis. Statistical parametric speech synthesis is being commonly applied into speech synthesis now, and gradually becoming an important method of the current speech synthesis, however, the research of front-end text analysis is often overlooked in the process of current Tibetan...

chapter

A comprehensive text analysis for Bengali TTS using unicode

Sheikh Abujar, Mahmudul Hasan

2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) > 547 - 551

2016 International Conference on Informatics, Electronics and Vision (ICIEV)

Communication is a very natural characteristic of every creature. Sometimes we use different symbols, or many formed languages to communicate each other. Every Languages we use are able for both oral and text communications. Writing symbols is a way to express our intentions through using any physical material. As we have oral communication capability too which we could use exactly as we want to speak...

chapter

Audio salient event detection and summarization using audio and text modalities

Athanasia Zlatintsi, Elias Iosif, Petros Marago, Alexandros Potamianos

2015 23rd European Signal Processing Conference (EUSIPCO) > 2311 - 2315

2015 23rd European Signal Processing Conference (EUSIPCO)

This paper investigates the problem of audio event detection and summarization, building on previous work [1,2] on the detection of perceptually important audio events based on saliency models. We take a synergistic approach to audio summarization where saliency computation of audio streams is assisted by using the text modality as well. Auditory saliency is assessed by auditory and perceptual cues...

chapter

A framework for Bangla text to speech synthesis

K. M. Azharul Hasan, Muhammad Hozaifa, Sanjoy Dutta, Rafsan Zani Rabbi

16th Int'l Conf. Computer and Information Technology > 60 - 64

2013 16th International Conference on Computer and Information Technology (ICCIT)

We describe a basic framework and methodology to convert Bangla Text to Speech. Articulated words are automatically produced from Bangla input text by the methodology from the basic pronunciation of the Bangla words. The single tone syllables are considered as the fundamental units for analysis. The methodology selects phonetic units from uttered vocabulary and then combined the appropriate diphones...

chapter

An Active Contour Model for Speech Balloon Detection in Comics

Christophe Rigaud, Jean-Christophe Burie, Jean-Marc Ogier, Dimosthenis Karatzas, more

2013 12th International Conference on Document Analysis and Recognition > 1240 - 1244

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

Comic books constitute an important cultural heritage asset in many countries. Digitization combined with subsequent comic book understanding would enable a variety of new applications, including content-based retrieval and content retargeting. Document understanding in this domain is challenging as comics are semi-structured documents, combining semantically important graphical and textual parts...

chapter

Top Management Team Attention and International Strategy: A Case Study

Jianzu Wu, Yusheng Bi

2012 Fifth International Conference on Business Intelligence and Financial Engineering > 625 - 628

2012 Fifth International Conference on Business Intelligence and Financial Engineering (BIFE)

In this paper, we investigate how top management team (TMT) attention distribution affects firm's international expansion strategy choice by a case study. With the help of automated text analysis method, we analyze CEO's public speeches and annual reports of Huawei, and measure TMT attention by counting sentences relating to technology seeking, global brand building, and target market positioning,...

chapter

Combining text and prosodic analysis for prominent word detection

Jitendra Ajmera, Om D Deshmukh

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1534 - 1537

2012 21st International Conference on Pattern Recognition (ICPR)

This paper presents an approach that considers both the corpus level (global) information as well as localized acoustic patterns to discover prominent words in an audio conversations. The global information is extracted by using text analysis techniques, in particular latent Dirichlet allocation (LDA), that extracts domain specific prominent words and also arranges them in a set of topics. The domain...

chapter

Extracting Semantic Role Information from Unstructured Texts

Diana Trandabat, Alexandru Trandabat

2011 Sixth International Workshop on Semantic Media Adaptation and Personalization > 62 - 67

2011 6th International Workshop on Semantic Media Adaptation and Personalization (SMAP)

Shallow semantic parsing of natural language processing is an important component in all kind of NLP applications and Semantic Role Labeling in particular, is an active research topic. This paper describes a rule-based Semantic Role Labeling system aimed at extracting semantic information from texts. The input text is processed by exploiting part of speech information and syntactic dependencies in...

chapter

Text Normalization and Phonetic Analysis Modules for Macedonian TTS synthesis

Branislav Gerazov, Zoran Ivanovski

2011 19thTelecommunications Forum (TELFOR) Proceedings of Papers > 671 - 674

2011 19th Telecommunications Forum Telfor (TELFOR)

The paper presents the Text Normalization and Phonetic Analysis Modules that are part of the frontend of the text-to-speech (TTS) system “Speak Macedonian”. First, the architecture of the frontend of the TTS system “Speak Macedonian” is shortly presented, followed by a detailed look into the two modules. For each of the modules a short summary is given of the tasks and developed solutions, found in...

chapter

Development of Hindi mobile communication text and speech corpus

Shweta Sinha, S.S. Agrawal, Jesper Olsen

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) > 30 - 35

2011 Oriental COCOSDA 2011 - International Conference on Speech Database and Assessments

This paper describes the collection of a text and audio corpus for mobile personal communication in Hindi. Hindi is the largest of the Indian languages, and is the first language for more than 200 million people who use it not only for spoken mobile communication but also for sending text messages to each other. The main script for Hindi is Devanagari, but it is not well supported by the current generation...

chapter

A Hybrid Approach for Part-of-Speech Tagging of Burmese Texts

Cynthia Myint

2011 International Conference on Computer and Management (CAMAN) > 1 - 4

2011 International Conference on Computer and Management (CAMAN 2011)

In Myanmar to English language translation system, in order to provide meaningful sentence from one language to another is non-trivial task. POS tagging is used as an early stage of linguistic text analysis in many applications. POS tagging is a process of assigning correct syntactic categories to each word. Tagsets and word disambiguation rules are fundamental parts of any POS tagger. This paper...

chapter

Determining Writing Genre: Towards a Rubric-based Approach to Automated Essay Grading

Hon Wai Lam, Tharam Dillon, Elizabeth Chang

2011 IEEE International Conference on Advanced Information Networking and Applications > 270 - 274

2011 IEEE 25th International Conference on Advanced Information Networking and Applications (AINA 2011)

A writing genre can be thought of as the style in which the writer chooses to present textual content to the reader. We distinguish four main types of essay genres namely Narrative, Persuasive, Descriptive and Expository. An essay's writing genre can be identified by searching for salient features present within those genres using various Natural Language Processing tools such as Named Entity Recognition,...

INFONA - science communication portal

Search results

An annotated corpus for Turkish sentiment analysis at sentence level

Implementation of telugu speech synthesis system

Northern Thai Dialect Text to Speech

Syllabification: An effective approach for a TTS system for Konkani

Unstructured data treatment for big data solutions

Document similarity analysis in Slovak language

Syntactic text analysis without a dictionary

The design and implementation of HMM-based Dai speech synthesis

Research on text analysis for Tibetan statistical parametric speech synthesis

A comprehensive text analysis for Bengali TTS using unicode

Audio salient event detection and summarization using audio and text modalities

A framework for Bangla text to speech synthesis

An Active Contour Model for Speech Balloon Detection in Comics

Top Management Team Attention and International Strategy: A Case Study

Combining text and prosodic analysis for prominent word detection

Extracting Semantic Role Information from Unstructured Texts

Text Normalization and Phonetic Analysis Modules for Macedonian TTS synthesis

Development of Hindi mobile communication text and speech corpus

A Hybrid Approach for Part-of-Speech Tagging of Burmese Texts

Determining Writing Genre: Towards a Rubric-based Approach to Automated Essay Grading

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options