Search results for: D. Inkpen

Items from 1 to 7 out of 7 results

article

A Machine Learning Approach for Identifying Disease-Treatment Relations in Short Texts

O Frunza, D Inkpen, T Tran

IEEE Transactions on Knowledge and Data Engineering > 2011 > 23 > 6 > 801 - 814

The Machine Learning (ML) field has gained its momentum in almost any domain of research and just recently has become a reliable tool in the medical domain. The empirical domain of automatic learning is used in tasks such as medical decision support, medical imaging, protein-protein interaction, extraction of medical knowledge, and for overall patient management care. ML is envisioned as a tool by...

chapter

An unsupervised approach to preposition error correction

A Islam, D Inkpen

Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) > 1 - 4

2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010)

In this work, an unsupervised statistical method for automatic correction of preposition errors using the Google n-gram data set is presented and compared to the state-of-the-art. We use the Google n-gram data set in a back-off fashion that increases the performance of the method. The method works automatically, does not require any human-annotated knowledge resources (e.g., ontologies) and can be...

chapter

Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction

A.H. Razavi, S. Matwin, D. Inkpen, A. Kouznetsov

2009 IEEE International Conference on Data Mining Workshops > 471 - 476

2009 IEEE International Conference on Data Mining Workshops (ICDMW 2009)

In this article, we present a novel statistical representation method for knowledge extraction from a corpus containing short texts. Then we introduce the contrast parameter which could be adjusted for targeting different conceptual levels in text mining and knowledge extraction. The method is based on second order co-occurrence vectors whose efficiency for representing meaning has been established...

chapter

Managing the Google Web 1T 5-gram data set

A. Islam, D. Inkpen

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 5

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

This paper describes how the Google Web 1T 5-gram data set, contributed by Google Inc., can be stored so that it can be used efficiently with respect to time. We present an efficient way of accessing all the 5-grams for a specific word of interest from the stored files. We measure the maximum access and processing efficiency achievable for any word of interest. We also compare results (access time...

chapter

Automatic generation of narrative content for digital games

M.F. Caropreso, D. Inkpen, S. Khan, F. Keshtkar

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 8

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

Interactive simulation games used for training usually require a large amount of coherent narrative content. An effective and efficient solution to the narrative content creation problem is to use Natural Language Generation (NLG) systems. The use of NLG systems, however, requires sophisticated linguistic and sometimes programming knowledge. For this reason, NLG systems are typically not accessible...

chapter

Using sentiment orientation features for mood classification in blogs

F. Keshtkar, D. Inkpen

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 6

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

In this paper we explore the task of mood classification for blog postings. We propose a novel approach that uses the hierarchy of possible moods to achieve better results than a standard machine learning approach. We also show that using sentiment orientation features improves the performance of classification. We used the Livejournal blog corpus as a dataset to train and evaluate our method.

chapter

Real-word spelling correction using Google Web 1T n-gram with backoff

A. Islam, D. Inkpen

2009 International Conference on Natural Language Processing and Knowledge Engineering > 1 - 8

2009 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

We present a method for correcting real-word spelling errors using the Google Web 1T n-gram data set and a normalized and modified version of the longest common subsequence (LCS) string matching algorithm. Our method is focused mainly on how to improve the correction recall (the fraction of errors corrected) while keeping the correction precision (the fraction of suggestions that are correct) as high...

Filter options

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

DATA MINING (5)
CONTEXT (4)
GOOGLE WEB 1T (3)
ACCURACY (2)
CLASSIFICATION ALGORITHMS (2)
INTERNET (2)
N-GRAM (2)
NATURAL LANGUAGE PROCESSING (2)
STATISTICAL ANALYSIS (2)
TEXT ANALYSIS (2)
WEB SITES (2)
5-GRAMS (1)
ABSTRACTS (1)
AUTOMATIC CORRECTION (1)
BEHAVIOURAL SCIENCES COMPUTING (1)
BIRDS (1)
BLOG (1)
BLOG POSTING (1)
BLOGS (1)
BOOK REVIEWS (1)
BRAIN MODELING (1)
CLASSIFICATION (1)
COMPUTER BASED SYSTEMS (1)
COMPUTER BASED TRAINING (1)
COMPUTER GAMES (1)
COMPUTERS (1)
CORRECTION PRECISION (1)
CORRECTION RECALL (1)
DATA HANDLING (1)
DIGITAL GAME (1)
DIGITAL SIMULATION (1)
DISEASE TREATMENT RELATION IDENTIFICATION (1)
DISEASES (1)
ELECTRONIC MAIL (1)
ENGLISH LANGUAGE TEXTS (1)
ENTROPY (1)
EQUATIONS (1)
ERROR CORRECTION (1)
FEATURE EXTRACTION (1)
GAMES (1)
GOOGLE (1)
GOOGLE INCORPORATED (1)
GOOGLE N-GRAM DATA SET (1)
GOOGLE WEB 1T 5-GRAM DATA SET (1)
GOOGLE WEB 1T N-GRAM DATA SET (1)
HEALTHCARE (1)
HIERARCHY (1)
HUMANS (1)
INTERACTIVE SIMULATION GAME (1)
INTERACTIVE SYSTEMS (1)
ISO STANDARDS (1)
JAVA (1)
KNOWLEDGE EXTRACTION (1)
LCS (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LINGUISTIC KNOWLEDGE (1)
LIVEJOURNAL BLOG (1)
LONGEST COMMON SUBSEQUENCE STRING MATCHING ALGORITHM (1)
MACHINE LEARNING (1)
MACHINE LEARNING APPROACH (1)
MATHEMATICAL MODEL (1)
MEDICAL COMPUTING (1)
MEDICAL DECISION SUPPORT (1)
MEDICAL DIAGNOSTIC IMAGING (1)
MEDICAL IMAGING (1)
MEMORY MANAGEMENT (1)
ML (1)
MOOD (1)
MOOD CLASSIFICATION (1)
N-GRAMS (1)
NARRATIVE CONTENT AUTOMATIC GENERATION (1)
NARRATIVE CONTENT CREATION PROBLEM (1)
NATURAL LANGUAGE GENERATION SYSTEM (1)
NATURAL LANGUAGE PROCESSING. (1)
NATURAL LANGUAGES (1)
NLG SYSTEM (1)
PARAMETERIZED CONTRAST (1)
PATIENT MANAGEMENT CARE (1)
PREPOSITION ERROR CORRECTION (1)
PREPOSITION ERRORS (1)
PROBABILITY (1)
PROBABILITY DENSITY FUNCTION (1)
PROGRAMMING (1)
PROTEIN-PROTEIN INTERACTION (1)
REAL-WORD (1)
REAL-WORD SPELLING ERROR CORRECTION (1)
SEARCH ENGINES (1)
SECOND ORDER CO-OCCURRENCE VECTORS (1)
SECOND ORDER SOFT CO-OCCURRENCES (1)
SEMANTICS (1)
SENTIMENT ORIENTATION (1)
SHORT TEXTS (1)
SPEECH (1)
SPELLING AIDS (1)
SPELLING CORRECTION (1)
STATISTICAL REPRESENTATION METHOD (1)
STRING MATCHING (1)
SUPPORT VECTOR MACHINES (1)
SYSTEMATICS (1)
TEXT MINING (1)
more

INFONA - science communication portal

Search results for: D. Inkpen

A Machine Learning Approach for Identifying Disease-Treatment Relations in Short Texts

An unsupervised approach to preposition error correction

Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction

Managing the Google Web 1T 5-gram data set

Automatic generation of narrative content for digital games

Using sentiment orientation features for mood classification in blogs

Real-word spelling correction using Google Web 1T n-gram with backoff

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options