Search results

Items from 1 to 14 out of 14 results

chapter

Semi-supervised probabilistics approach for normalising informal short text messages

Abiodun Modupe, Turgay Celik, Vukosi Marivate, Melvin Diale

2017 Conference on Information Communication Technology and Society (ICTAS) > 1 - 8

2017 Conference on Information Communication Technology and Society (ICTAS)

The growing use of informal social text messages on Twitter is one of the known sources of big data. These type of messages are noisy and frequently rife with acronyms, slangs, grammatical errors and non-standard words causing grief for natural language processing (NLP) techniques. In this study, our contribution is to target non-standard words in the short text and propose a method to which the given...

chapter

Automatic text summarization using fuzzy inference

Mehdi Jafari, Jing Wang, Yongrui Qin, Mehdi Gheisari, more

2016 22nd International Conference on Automation and Computing (ICAC) > 256 - 260

2016 22nd International Conference on Automation and Computing (ICAC)

Due to the high volume of information and electronic documents on the Web, it is almost impossible for a human to study, research and analyze this volume of text. Summarizing the main idea and the major concept of the context enables the humans to read the summary of a large volume of text quickly and decide whether to further dig into details. Most of the existing summarization approaches have applied...

article

Sentiment Embeddings with Applications to Sentiment Analysis

Duyu Tang, Furu Wei, Bing Qin, Nan Yang, more

IEEE Transactions on Knowledge and Data Engineering > 2016 > 28 > 2 > 496 - 509

We propose learning sentiment-specific word embeddings dubbed sentiment embeddings in this paper. Existing word embedding learning algorithms typically only use the contexts of words but ignore the sentiment of texts. It is problematic for sentiment analysis because the words with similar contexts but opposite sentiment polarity, such as good and bad, are mapped to neighboring word vectors. We address...

chapter

Using topic models in domain adaptation

Samira Tofighi Zahabi, Somayeh Bakhshaei, Shahram Khadivi

7'th International Symposium on Telecommunications (IST'2014) > 539 - 543

2014 7th International Symposium on Telecommunications (IST)

An important factor of a corpus is its domain, usually the quality of a SMT system trained on an in-domain corpus increases by adding out-of-domain sentences to its training corpus. In this paper we have shown out-of-domain corpora may also contains sentences which are proper for improving the quality of in-domain corpus. These sentences have words and phrases that occur in indomain corpora so, their...

chapter

Transitivity in semantic relation learning

Francesca Fallucchi, Fabio Massimo Zanzotto

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 8

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

Text understanding models exploit semantic networks of words as basic components. Automatically enriching and expanding these resources is then an important challenge for NLP. Existing models for enriching semantic resources based on lexical-syntactic patterns make little use of structural properties of target semantic relations. In this paper, we propose a novel approach to include transitivity in...

chapter

Design and Simulation of Human Conversational Model for Distributed Systems

S Khaddaj, B Makoond

2010 Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science > 106 - 112

2010 Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES 2010)

Human conversation is the best example of loosely coupled distributed communication known to man. This study specifically considers those parts of the human conversation that show evidence of dynamism and attempts to model these parts with the objective of translating the knowledge of conversational dynamics into the domain of computerised distributed systems. The discipline of metaphorical modelling...

chapter

Generation of F0 contours for Vietnamese speech synthesis

Do Dat Tran, Eric Castelli

International Conference on Communications and Electronics 2010 > 158 - 162

2010 Third International Conference on Communications and Electronics (ICCE 2010)

This paper describes the analysis results about the influence of coarticulation effect and syllable duration on variations of Vietnamese tones in continuous speech. Based on these results a new method for generating F0 contours is proposed. Firstly the contour of tonal register, which is calculated from relative register ratios between two adjacent tones, is produced. And then the tone patterns are...

chapter

Removing fillers to induce semantic classes for a Chinese dialogue system

Yali Li, Xuemin Zhao, Yonghong Yan

2010 2nd IEEE International Conference on Information Management and Engineering > 512 - 516

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

In this paper, we introduced an unsupervised method to remove fillers in spoken dialogues semi-automatically based on their probability distribution and the effect of removing fillers to induce semantic classes. We conduct the unigram and bigram distribution of fillers on our Chinese voice search data and find that only using these distributions, fillers are in the first 1% of all words. We also test...

chapter

Removing fillers to induce semantic classes for a Chinese dialogue system

Yali Li, Yonghong Yan

2010 2nd International Conference on Advanced Computer Control > 4 > 163 - 166

2010 2nd International Conference on Advanced Computer Control (ICACC 2010)

In this paper, we introduced an unsupervised method to remove fillers in spoken dialogues semi-automatically based on their probability distribution. Disfluencies such as fillers, repairs often make the sentence ill-formed, longer and hard to process. Fillers were emphasized instead of repairs in this paper. We conduct the unigram and bigram distribution of fillers on our Chinese voice search data...

chapter

A method of Chinese document expression for extracting formal context

Yinghui Huang, Guanyu Li, Dongyan Wang

2010 3rd International Conference on Biomedical Engineering and Informatics > 7 > 3020 - 3023

2010 3rd International Conference on Biomedical Engineering and Informatics (BMEI 2010)

As a kind of data model, a formal context must be extracted from some actual data sources such as documents. For case of unstructured Chinese document, it is the first question to decide how to express the document. Vector space model (VSM) which is the dominant model of document expression now takes a single word as a feature item, so that neglects the lexical semantic relationship between words...

chapter

Vietnamese Final Stop Consonants /p, t, k/ Described in Terms of Formant Transition Slopes

V.S. Nguyen, E. Castelli, R. Carre

2009 International Conference on Asian Language Processing > 86 - 90

2009 International Conference on Asian Language Processing (IALP 2009)

It is well known that bursts and voiced formant transitions serve as separate cues to the place of articulation of initial stop consonants. The Vietnamese presents three final voiceless stop consonants /p, t, k/ without bursts. It is an opportunity to study these final stop consonants and to compare their characteristics with those of the corresponding initial stop consonants. As final consonants...

chapter

Recognizing sentence emotions based on polynomial kernel method using Ren-CECps

Changqin Quan, F. Ren

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 7

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Emotion recognition on text has wide applications. In this study we propose a method of emotion recognition at sentence level based on a relative large emotion annotation corpus (Ren-CECps). From this corpus, we get the emotion lexicons for the eight basic emotions (expect, joy, love, surprise, anxiety, sorrow, angry and hate). Statistics show that the emotion lexicons derived from Ren-CECps are used...

chapter

A New Approach To Accent Restoration Of Vietnamese Texts Using Dynamic Programming Combined With Co-Occurrence Graph

Hoang Trong Nghia, Do Phuc

2009 IEEE-RIVF International Conference on Computing and Communication Technologies > 1 - 4

2009 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF). Research, Innovation and Vision for the Future

In this paper, we would like to introduce a new approach to recover Vietnamese text's accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical probabilities to Vietnamese sentences. Second,...

chapter

An Anaphora Based Information Retrieval Model Extension

F. Santiago do Carmo Pereira, H. Seibel Junior, S.A.A. de Freitas

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 330 - 334

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Classical information retrieval models are based on representation of document terms without considering linguistic elements. This article presents a model based on the Discourse Nominal Structure; which lets us take linguistic characteristics of text into account. The model presented is evaluated in comparison with the vector space model. Based on observations during the experimentation we propose...

Filter options

Data set:
ieee
Keywords:
CONTEXT
NATURAL LANGUAGE PROCESSING
MATHEMATICAL MODEL

Publication date

Set your own date range

Publication type

book (13)
article (1)

Keywords

EQUATIONS (6)
DATA MINING (4)
SEMANTICS (4)
TEXT ANALYSIS (4)
COMPUTATIONAL MODELING (3)
CONTEXT MODELING (3)
PROBABILITY (3)
SPEECH (3)
SPEECH PROCESSING (3)
CHINESE DIALOGUE SYSTEM (2)
CHINESE VOICE SEARCH DATA (2)
FEATURE EXTRACTION (2)
FILLERS DETECTION (2)
FILLERS DISTRIBUTION (2)
LANGUAGE TRANSLATION (2)
MAINTENANCE ENGINEERING (2)
PROBABILITY DISTRIBUTION (2)
SPOKEN DIALOGUES (2)
TRAINING (2)
VECTOR SPACE MODEL (2)
ACCENT RESTORATION (1)
ACCURACY (1)
ACTUAL DATA SOURCES (1)
ADAPTATION MODELS (1)
AFFECTIVE COMPUTING (1)
ANALYSIS OF VARIANCE (1)
ANALYTICAL MODELS (1)
ANAPHORA RESOLUTION (1)
ANIMALS (1)
BIGRAM DISTRIBUTION (1)
BLEU SCORE (1)
BLOGS (1)
CHINESE DOCUMENT EXPRESSION (1)
CO-OCCURRENCE GRAPH (1)
COARTICULATION EFFECT (1)
CONSONANT-VOWEL-CONSONANT PRODUCTIONS (1)
CONTINUOUS SPEECH (1)
CONVERSATINAL DYNAMICS (1)
CONVERSATIONAL DYNAMICS (1)
DATA MODEL (1)
DATA MODELS (1)
DENTISTRY (1)
DISCOURSE NOMINAL STRUCTURE (1)
DISTRIBUTED PROCESSING (1)
DISTRIBUTED SYSTEMS (1)
DOCUMENT EXPRESSION (1)
DOCUMENT HANDLING (1)
DOCUMENT PROCESSING (1)
DOCUMENT TERM REPRESENTATION (1)
DOMAIN ADAPTATION (1)
DRIVER CIRCUITS (1)
DYNAMIC PROGRAMMING (1)
EMOTION LEXICONS (1)
EMOTION RECOGNITION (1)
F0 CONTOUR GENERATION (1)
FILLERS BIGRAM DISTRIBUTION (1)
FILLERS REMOVAL (1)
FILLERS UNIGRAM DISTRIBUTION (1)
FINAL STOP CONSONNANT (1)
FORMAL CONTEXT (1)
FORMAL CONTEXT EXTRACTION (1)
FORMANT TRANSITION SLOPES (1)
FUZZY LOGIC (1)
GRAPH THEORY (1)
GUIDELINES (1)
HOWNET (1)
HUMAN CONVERSATION (1)
HUMAN CONVERSATIONAL MODEL (1)
HUMAN-TO-COMPUTER CORPUS (1)
HUMAN-TO-COMPUTER DIALOGUES (1)
HUMAN-TO-HUMAN CORPUS (1)
HUMAN-TO-HUMAN DIALOGUES (1)
HUMANS (1)
IN-DOMAIN PARALLEL CORPUS (1)
INDEXES (1)
INFORMAL SHORT TEXT MESSAGES (1)
INFORMATION RETRIEVAL (1)
INFORMATION RETRIEVAL MODEL EXTENSION (1)
INTERACTIVE SYSTEMS (1)
KERNEL (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE TRANSLATION (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEXICAL DATABASE (1)
LEXICAL PROBABILITY (1)
LEXICAL SEMANTIC RELATIONSHIP (1)
LEXICAL-SYNTACTIC PATTERNS (1)
LINGUISTIC CHARACTERISTICS (1)
LOCUS EQUATION SPACE (1)
MACHINE LEARNING PROBLEMS (1)
MEDIA (1)
METAPHORICAL MODELLING (1)
NATURAL DIALOGUE CORPUS (1)
NATURAL LANGUAGE (1)
NATURAL LANGUAGES (1)
NEURAL NETWORKS (1)
NLP (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options