Search results

Items from 1 to 20 out of 74 results

chapter

Data-Driven Application Maintenance: Experience from the Trenches

Janardan Misra, Shubhashis Sengupta, Divya Rawat, Milind Savagaonkar, more

2017 IEEE/ACM 4th International Workshop on Software Engineering Research and Industrial Practice (SER&IP) > 48 - 54

2017 IEEE/ACM 4th International Workshop on Software Engineering Research and Industrial Practice (SER&IP)

In this paper we present our experience during design, development, and pilot deployments of a data-driven machine learning based application maintenance solution. We implemented a proof of concept to address a spectrum of interrelated problems encountered in application maintenance projects including duplicate incident ticket identification, assignee recommendation, theme mining, and mapping of incidents...

chapter

A new approach for multi-pattern string matching in large text corpora

Ehsan Sherkat, Mojgan Farhoodi, Alireza Yari

7'th International Symposium on Telecommunications (IST'2014) > 72 - 77

2014 7th International Symposium on Telecommunications (IST)

Multi-pattern string matching with large set of patterns is nowadays a key issue in various text retrieval applications. Filtering undesirable URLs, Finding quotes from famous holy books texts, extracting specific patterns from DNA sequences, Antivirus scanning, intrusion detection or even music retrieval are some applications of multi-pattern string matching. As the size of corpora and the number...

chapter

A dynamic adjustment algorithm research of sentiment word weight based on context

Xu Ye-qiang, Zhu Yan-hui, Wang Wen-hua, Gao Li-chun

2011 3rd International Conference on Computer Research and Development > 3 > 19 - 22

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

The emotion tendency of sentiment word is divided into two types: static emotion tendency and dynamic emotion tendency. Basic semantic lexicon is static emotion tendency, in the real context, but it is different between static emotion tendency and dynamic emotion tendency. The paper proposes a method based on degree lexicon, negative lexicon and dependence relationship of sentence elements. The experimental...

chapter

SVM-based knowledge topic identification toward the autonomous knowledge acquisition

Keedong Yoo

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI) > 149 - 154

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI)

One of the most serious problems that conventional knowledge management (KM) encompasses has been pointed out tardy and ineffective acquisition of knowledge. To resolve this problem, knowledge must be autonomously acquired according to its context of use by applying the technique of keyword extraction in machine learning algorithm-based text mining. Once the topic of the given knowledge can be identified...

chapter

Co-occurrence based predictors for estimating query difficulty

H Imran, A Sharan

2010 IEEE International Conference on Data Mining Workshops > 867 - 874

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Query difficulty prediction aims to identify, in advance, how reliably an information retrieval system will perform when faced with a particular user request. The prediction of query difficulty level is an interesting and important issue in Information Retrieval (IR) and is still an open research. In order to appreciate importance of query difficulty prediction we present an example., Information...

chapter

A Medical Knowledge Based Postprocessing Approach for Doctor's Handwriting Recognition

Qi Chen, Tianxia Gong, Linlin Li, Chew Lim Tan, more

2010 12th International Conference on Frontiers in Handwriting Recognition > 45 - 50

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

In this paper, we propose a novel post processing approach for on-line handwriting recognition. Differing from the existing linguistic knowledge-based methods, we make use of domain specific knowledge to improve the performance of recognition. Our system recognizes doctor's handwriting which often poses great challenges in readability, and then enhances the quality of recognized text by analyzing...

chapter

Exploring Social Contexts along the Time Dimension: Temporal Analysis of Named Entities

Brett Walenz, Robin Gandhi, William Mahoney, Quiming Zhu

2010 IEEE Second International Conference on Social Computing > 508 - 512

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

Exploring the evolution of social contexts with time can provide unique insights into human social dynamics. Several social contexts and relationships can be mined from unstructured text articles that describe social phenomena. In contrast to structured graphs of social networks, named entity recognition is a task that attempts to classify elements in unstructured textual items into predefined categories,...

chapter

Classification of Short Text Comments by Sentiment and Actionability for VoiceYourView

William Simm, Maria-Angela Ferrario, Scott Piao, Jon Whittle, more

2010 IEEE Second International Conference on Social Computing > 552 - 557

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

Much has been documented in the literature on sentiment analysis and document summarisation. Much of this applies to long structured text in the form of documents and blog posts. With a shift in social media towards short commentary (see Facebook status updates and twitter tweets), the difference in comment structure may affect the accuracy of sentiment analysis techniques. From our VoiceYourView...

chapter

Acronym extraction using SVM with Uneven Margins

Weijian Ni, Jun Xu, Yalou Huang, Tong Liu, more

2010 IEEE 2nd Symposium on Web Society > 132 - 138

2010 IEEE 2nd Symposium on Web Society (SWS 2010)

Extracting acronyms and their expansions from plain text is an important problem in text mining. Previous research shows that the problem can be solved via machine learning approaches. That is, converting the problem of acronym extraction to binary classification. We investigate the classification problem and find that the classes are highly unbalanced (the positive instances are very rare compared...

chapter

Semantic Hierarchical Document Signature for determining sentence similarity

Sukanya Manna, Tom Gedeon

International Conference on Fuzzy Systems > 1 - 8

2010 IEEE International Conference on Fuzzy Systems

In this paper, we present a new approach that incorporates semantic information from a document, in the form of Hierarchical Document Signature (HDS), to measure semantic similarity between sentences. Due to variability of expressions of natural language, it is very essential to exploit the semantic properties of a document to accurately identify semantically similar sentences since sentences conveying...

chapter

OSS developers context-specific Preferred Representational systems: A initial Neurolinguistic text analysis of the Apache mailing list

Methanias Colao Junior, Manoel Mendona, Mario Farias, Paulo Henrique

2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010) > 126 - 129

2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010)

Open Source Software (OSS) mailing lists are used by developers to discuss software engineering tasks performed in the project. In the last years, researchers have been conducting mailing lists linguistic analyses for understanding the intricacies of OSS development. An unpublished approach for that is to use NeuroLinguistic Theory (NT). NT postulates the use of a Preferred Representational cognitive...

chapter

Trend Ontology for Knowledge-Based Trend Mining in Textual Information

Olga Streibel, Malgorzata Mochol

2010 Seventh International Conference on Information Technology: New Generations > 1285 - 1288

Seventh International Conference on Information Technology: New Generations (ITNG 2010)

Providing ontologies for the automatic trend detection enhance the quality of trend predictions. However, in the case of dynamic and fuzzy expert knowledge like the knowledge used in trend detection, it is difficult to formalize knowledge unambiguously and in a static way. In this paper we report on our experiences in modeling and formalizing trend ontology for automatic knowledge-based trend detection...

chapter

Monitoring conceptual development with text mining technologies: CONSPECT

F Wild, D Haley, K Bulow

eChallenges e-2010 Conference > 1 - 8

2010 eChallenges e-2010

This paper evaluates CONSPECT, a service that analyses states in a learner's conceptual development. It combines two technologies - Latent Semantic Analysis to analyse text and Network Analysis (NA) to provide visualisations - into a technique called Meaningful Interaction Analysis (MIA). CONSPECT was designed to help both online learners and their tutors monitor their conceptual development. This...

chapter

Morpheme-based product features categorization in Chinese reviews mining

Shu Zhang, Wenjie Jia, Yingju Xia, Yao Meng, more

2010 6th International Conference on Advanced Information Management and Service (IMS) > 324 - 329

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

Pursuing on the analysis of product reviews, an unsupervised product features categorization method is proposed. Morphemes as smallest linguistic meaningful unit are induced in measuring the intra relationship among product features instead of words. Opinion words around product features are chosen to represent the inter relationship among product features instead of full context information. The...

chapter

A framework for opinion question answering

Peng Jiang, Hongping Fu, Chunxia Zhang, Zhendong Niu

2010 6th International Conference on Advanced Information Management and Service (IMS) > 424 - 427

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

Question answering is a useful task to help people seek the knowledge of what they want to know. Previous study mainly focuses on factoid question answering, which serves the needs to answer factual questions. Due to rapidly increasing scale of user generated contents on the Web, people are more interested in opinion questions that can reflect others' opinions. In this paper, we propose a framework...

chapter

Computing of related concepts in a context

V. Rockai

2010 IEEE 8th International Symposium on Applied Machine Intelligence and Informatics (SAMI) > 341 - 345

2010 IEEE 8th International Symposium on Applied Machine Intelligence and Informatics (SAMI 2010)

Word sense disambiguation is an opened issue in the text mining and natural language processing for some time. Automatic acquisition of all distinct senses for polysemy words is still a big problem in the computer science. This paper discusses an approach to generate related words for an input word in some context. The context is used for the filtering of the related words for their distinct sense...

chapter

Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction

A.H. Razavi, S. Matwin, D. Inkpen, A. Kouznetsov

2009 IEEE International Conference on Data Mining Workshops > 471 - 476

2009 IEEE International Conference on Data Mining Workshops (ICDMW 2009)

In this article, we present a novel statistical representation method for knowledge extraction from a corpus containing short texts. Then we introduce the contrast parameter which could be adjusted for targeting different conceptual levels in text mining and knowledge extraction. The method is based on second order co-occurrence vectors whose efficiency for representing meaning has been established...

chapter

Automatic extraction of events from Textual Requirements specification

S.K. Singh, R. Gupta, S. Sabharwal, J.P. Gupta

2009 World Congress on Nature&Biologically Inspired Computing (NaBIC) > 415 - 420

2009 World Congress on Nature & Biologically Inspired Computing (NaBIC 2009)

Events give important information about the behavior of a system in a summarized form. In the past, events have played an important role in breaking the functional requirements of the system in the ??event partitioning approach??. Our previous work has shown that events can be a starting point in object-oriented analysis of requirements. Every event triggers a use case in the system, hence should...

chapter

Improving Topic Extraction in Chinese Documents Using Word Sense Disambiguation

Hongyan Song, Tianfang Yao

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC) > 1106 - 1109

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC 2009)

This paper reports experiments on topic extraction in Chinese documents using a feature set enriched with Word Sense Disambiguation (WSD) as semantic information. The results of these experiments suggest that incorporating WSD information into Chinese topic extraction tasks may yield improvements over models which do not use WSD information.

chapter

Unsupervised Relation Extraction by Massive Clustering

E. Gonzalez, J. Turmo

2009 Ninth IEEE International Conference on Data Mining > 782 - 787

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

The goal of Information Extraction is to automatically generate structured pieces of information from the relevant information contained in text documents. Machine Learning techniques have been applied to reduce the cost of Information Extraction system adaptation. However, elements of human supervision strongly bias the learning process. Unsupervised learning approaches can avoid these biases. In...

Data set:
ieee
Keywords:
DATA MINING
CONTEXT
TEXT ANALYSIS

Publication date

Set your own date range

INFONA - science communication portal

Search results

Data-Driven Application Maintenance: Experience from the Trenches

A new approach for multi-pattern string matching in large text corpora

A dynamic adjustment algorithm research of sentiment word weight based on context

SVM-based knowledge topic identification toward the autonomous knowledge acquisition

Co-occurrence based predictors for estimating query difficulty

A Medical Knowledge Based Postprocessing Approach for Doctor's Handwriting Recognition

Exploring Social Contexts along the Time Dimension: Temporal Analysis of Named Entities

Classification of Short Text Comments by Sentiment and Actionability for VoiceYourView

Acronym extraction using SVM with Uneven Margins

Semantic Hierarchical Document Signature for determining sentence similarity

OSS developers context-specific Preferred Representational systems: A initial Neurolinguistic text analysis of the Apache mailing list

Trend Ontology for Knowledge-Based Trend Mining in Textual Information

Monitoring conceptual development with text mining technologies: CONSPECT

Morpheme-based product features categorization in Chinese reviews mining

A framework for opinion question answering

Computing of related concepts in a context

Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction

Automatic extraction of events from Textual Requirements specification

Improving Topic Extraction in Chinese Documents Using Word Sense Disambiguation

Unsupervised Relation Extraction by Massive Clustering

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options