Knowledge Mining

chapter

Front Matter

Studies in Fuzziness and Soft Computing > Knowledge Mining > I-VIII

chapter

Knowledge Mining: A Quantitative Synthesis of Research Results and Findings

Penelope Markellou, Maria Rigou, Spiros Sirmakessis

Studies in Fuzziness and Soft Computing > Knowledge Mining > 1-11

Knowledge mining emerged as a rapidly growing interdisciplinary field that merges together databases, statistics, machine learning and related areas in order to extract valuable information and knowledge in large volumes of data. In this paper we present the key finding of the results achieved during the NEMIS Conference on “Knowledge Mining”.

chapter

An Evidential Approach to Classification Combination for Text Categorisation

D. A. Bell, J. W. Guan, Y. X. Bi

Studies in Fuzziness and Soft Computing > Knowledge Mining > 13-22

In this paper we look at a way of combining two or more different classification methods for text categorization. The specific methods we have been experimenting with in our group include the Support Vector Machine, kNN (nearest neighbours), kNN model-based approach (kNNM), and Rocchio methods. Then we describe our method for combining the classifiers. A previous study suggested that the combination...

chapter

Visualization Techniques for Non Symmetrical Relations

Simona Balbi, Michelangelo Misuraca

Studies in Fuzziness and Soft Computing > Knowledge Mining > 23-29

Many strategies of Text Retrieval are based on Latent Semantic Indexing and its variations, by considering different weighting systems for words and documents. Correspondence Analysis and L.S.I. share the basic algebraic tool, i.e. the Singular Value Decomposition and its generalisation, related to the use of a different way for measuring the importance of each element, both in determining and representing...

chapter

Understanding Text Mining: A Pragmatic Approach

Sergio Bolasco, Alessio Canzonetti, Federico M. Capo, Francesca Ratta-Rinaldi, more

Studies in Fuzziness and Soft Computing > Knowledge Mining > 31-50

In order to delineate the state of the art of the main TM applications a two-step strategy has been pursued: first of all, some of the main European and Italian companies offering TM solutions were contacted, in order to collect information on the characteristics of the applications; secondly, a detailed search on the web was made to collect further information about users or developers and applications...

chapter

Novel Approaches to Unsupervised Clustering Through k-Windows Algorithm

D. K. Tasoulis, M. N. Vrahatis

Studies in Fuzziness and Soft Computing > Knowledge Mining > 51-77

Summary The extraction of meaningful information from large collections of data is a fundamental issues in science. To this end, clustering algorithms are typically employed to identify groups (clusters) of similar objects. A critical issue for any clustering algorithm is the determination of the number of clusters present in a dataset. In this contribution we present a clustering algorithm that in...

chapter

Semiometric Approach, Qualitative Research and Text Mining Techniques for Modelling the Material Culture of Happiness

Furio Camillo, Melissa Tosi, Tiziana Traldi

Studies in Fuzziness and Soft Computing > Knowledge Mining > 79-92

Drawing from a recent ethnographic research on Happiness carried throughout 8 European countries in the 2003/4, Future Concept Lab will illustrate how the use of interactive digital material can be relevant to analyse qualitative and quantitative data in a participatory and creative manner. Our speech will focus on the additional value of presenting data in an interactive and flexible way by using...

chapter

Semantic Distances for Sets of Senses and Applications in Word Sense Disambiguation

Dimitrios Mavroeidis, George Tsatsaronis, Michalis Vazirgiannis

Studies in Fuzziness and Soft Computing > Knowledge Mining > 93-107

There has been an increasing interest both from the Information Retrieval community and the Data Mining community in investigating possible advantages of using Word Sense Disambiguation (WSD) for enhancing semantic information in the Information Retrieval and Data Mining process. Although contradictory results have been reported, there are strong indications that the use of WSD can contribute to the...

chapter

A Strategic Roadmap for Text Mining

Georgia Panagopoulou

Studies in Fuzziness and Soft Computing > Knowledge Mining > 109-122

A roadmap is typically a time-based plan that defines the present state, the state we want to reach and the way to achieve it. This includes identification of exact goals and the development of different routes for achieving them. In addition, it provides guidance to focus on the critical issues that are needed in order to meet these objectives. The roadmap of NEMIS aims at preparing the ground for...

chapter

Text Mining Applied to Multilingual Corpora

Federico Neri, Remo Raffaelli

Studies in Fuzziness and Soft Computing > Knowledge Mining > 123-131

Up to 80% of electronic data is textual and most valuable information is often encoded in pages which are neither structured, nor classified. Documents are — and will be — written in various native languages, but these documents are relevant even to non-native speakers. Nowadays everyone experiences a mounting frustration in the attempt of finding the information of interest, wading through thousands...

chapter

Content Annotation for the Semantic Web

Thierry Poibeau

Studies in Fuzziness and Soft Computing > Knowledge Mining > 133-145

This paper is intended to show how an Information extraction system can be recycled to produce RDF schemas for the semantic web [1]. We demonstrate that this kind of systems must respect operational constraints like the fact that the information produced must be highly relevant (high precision, possibly bad recall). The production of explicit structured data on the web will lead a better relevance...

chapter

An Open Platform for Collecting Domain Specific Web Pages and Extracting Information from Them

Vangelis Karkaletsis, Constantine D. Spyropoulos

Studies in Fuzziness and Soft Computing > Knowledge Mining > 147-157

The paper presents a platform that facilitates the use of tools for collecting domain specific web pages as well as for extracting information from them. It also supports the configuration of such tools to new domains and languages. The platform provides a user friendly interface through which the user can specify the domain specific resources (ontology, lexica, corpora for the training and testing...

chapter

Extraction of the Useful Words from a Decisional Corpus. Contribution of Correspondence Analysis

Mónica Bécue-Bertaut, Martin Rajman, Ludovic Lebart, Eric Gaussier

Studies in Fuzziness and Soft Computing > Knowledge Mining > 159-179

In the framework of the JuriSent case study, carried out within the European NEMIS thematic network, we analyze the contribution of text mining techniques to improve the consultation of jurisprudence textual databases. We mainly focus on correspondence analysis (CA) techniques, but also provide some insights on similar visualization techniques, such as self organizing maps (Kohonen maps), and review...

chapter

Collective SME Approach to Technology Watch and Competitive Intelligence: The Role of Intermediate Centers

Jorge Izquierdo, Sergio Larreina

Studies in Fuzziness and Soft Computing > Knowledge Mining > 181-189

It has been demonstrated that Technology Watch (TW) and Competitive Intelligence (CI) are important tools for the development of R&D activities and the enhancement of competitiveness in enterprises. TW activities are able to detect opportunities and threats at an early stage and facilitate the information in to decide and carry out the appropriate strategies. The base of TW is the process of search,...

chapter

New Challenges and Roles of Metadata in Text/Data Mining in Statistics

Dušan Šoltés

Studies in Fuzziness and Soft Computing > Knowledge Mining > 191-199

The paper deals with the new challenges and the roles of metada and metainformation in the area of text/data mining in the area of statistics. In the first part, the paper is presenting some basic characteristics of the contemporary statistical information systems from the point of view of the needs for utilization of metadata and data/text mining. As it is well known, modern statistical systems are...

chapter

Using Text Mining in Official Statistics

Alf Fyhrlund, Bert Fridlund, Bo Sundgren

Studies in Fuzziness and Soft Computing > Knowledge Mining > 201-211

There is a tremendous increase in the number of actors in the statistical arena in terms of producers, distributors, and users due to the new options of the web technology. These actors are not sufficiently informed about the technological progress made in the field of text mining and the ways in which they can benefit from these. The NEMIS project, and especially its Working Group 5, aims to identify...

chapter

Combining Text Mining and Information Retrieval Techniques for Enhanced Access to Statistical Data on the Web: A Preliminary Report

Martin Rajman, Martin Vesely

Studies in Fuzziness and Soft Computing > Knowledge Mining > 213-222

In this contribution, we present the StatSearch prototype, a search engine that enables an enhanced access to domain specific data available on the Web. The StatSearch engine proposes a hybrid search interface combining query-based search with automated navigation through a tree-like hierarchical structure. The goal of such an interface is to allow a more natural and intuitive control over the information...

chapter

Comparative Study of Text Mining Tools

Antoine Spinakis, Asanoula Chatzimakri

Studies in Fuzziness and Soft Computing > Knowledge Mining > 223-232

In this paper is presented the overall process and the basic conclusions of a comparison study, which was applied in the framework of NEMIS project regarding text mining tools. The basic stages of the overall comparison process are described, together with the specified evaluation criteria. Finally, the main conclusions of the particular study constitute the last chapter of the paper.

chapter

Some Industrial Applications of Text Mining

Bernd Drewes

Studies in Fuzziness and Soft Computing > Knowledge Mining > 233-238

Three industrial applications of text mining will be presented requiring different methodologies. The first application used a classification approach in order filter documents relevant for personal profiles from an underlying document collection. The second application combines cluster analysis with statistical trend analysis in order detect emerging issues in manufacturing. In the third application...

chapter

Using Text Mining Tools for Event Data Analysis

Theoni Stathopoulou

Studies in Fuzziness and Soft Computing > Knowledge Mining > 239-253

This paper concerns itself with the analysis of event data with text mining tools. The methodological approaches to event data analysis are presented, and an analysis is performed using SPAD Software and SAS Text Miner. Finally, some conclusions are drawn concerning the use of text mining tools for event data analysis.

INFONA - science communication portal

Knowledge Mining
Proceedings of the NEMIS 2004 Final Conference

Front Matter

Knowledge Mining: A Quantitative Synthesis of Research Results and Findings

An Evidential Approach to Classification Combination for Text Categorisation

Visualization Techniques for Non Symmetrical Relations

Understanding Text Mining: A Pragmatic Approach

Novel Approaches to Unsupervised Clustering Through k-Windows Algorithm

Semiometric Approach, Qualitative Research and Text Mining Techniques for Modelling the Material Culture of Happiness

Semantic Distances for Sets of Senses and Applications in Word Sense Disambiguation

A Strategic Roadmap for Text Mining

Text Mining Applied to Multilingual Corpora

Content Annotation for the Semantic Web

An Open Platform for Collecting Domain Specific Web Pages and Extracting Information from Them

Extraction of the Useful Words from a Decisional Corpus. Contribution of Correspondence Analysis

Collective SME Approach to Technology Watch and Competitive Intelligence: The Role of Intermediate Centers

New Challenges and Roles of Metadata in Text/Data Mining in Statistics

Using Text Mining in Official Statistics

Combining Text Mining and Information Retrieval Techniques for Enhanced Access to Statistical Data on the Web: A Preliminary Report

Comparative Study of Text Mining Tools

Some Industrial Applications of Text Mining

Using Text Mining Tools for Event Data Analysis

Filter options

Publication date

Publication language

INFONA - science communication portal

Knowledge Mining Proceedings of the NEMIS 2004 Final Conference $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication language

Reporting an error / abuse

Sending the report failed

Accessibility options

Knowledge Mining
Proceedings of the NEMIS 2004 Final Conference