Search results

Items from 1 to 20 out of 49 results

chapter

Research on Application of Decision Tree in Classifying Data

Liu Jian, Wang Yan-Qing

2011 Fourth International Conference on Intelligent Computation Technology and Automation > 1 > 1098 - 1101

2011 International Conference on Intelligent Computation Technology and Automation (ICICTA)

With the rapid development of database technique, categorizing datasets becomes very important for discovering information. Decision tree classification provides a rapid and effective method of categorizing datasets. Although many algorithmic methods exist for optimizing decision tree structure, these can be vulnerable to changes in the training dataset. In this paper, an evolutionary method is presented,...

chapter

A spoken dialog system based on automatically-generated example database

A Ito, T Morimoto, S Makino, M Ito

2010 International Conference on Audio, Language and Image Processing > 732 - 736

2010 International Conference on Audio, Language and Image Processing (ICALIP)

There have been proposed spoken dialog systems that utilizes simple database consisted of example sentences and the corresponding reply sentences. However, it is costly to prepare this database manually. In the present study, we propose a framework in which both the example and reply sentences are automatically generated from a database description table that describes minimum information for describing...

chapter

Database updating through user feedback in fingerprint-based Wi-Fi location systems

Thomas Gallagher, Binghao Li, Andrew G Dempster, Chris Rizos

2010 Ubiquitous Positioning Indoor Navigation and Location Based Service > 1 - 8

2010 Ubiquitous Positioning Indoor Navigation and Location Based Service (UPINLBS)

Wi-Fi fingerprinting is a technique which can provide location in GPS-denied environments, relying exclusively on Wi-Fi signals. It first requires the construction of a database of “fingerprints”, i.e. signal strengths from different access points (APs) at different reference points in the desired coverage area. The location of the device is then obtained by measuring the signal strengths at its location,...

chapter

Investigating Tabu Search for Web Effort Estimation

F Ferrucci, C Gravino, R Oliveto, F Sarro, more

2010 36th EUROMICRO Conference on Software Engineering and Advanced Applications > 350 - 357

36th EUROMICRO Conference on Software Engineering and Advanced Applications (SEAA 2010)

Tabu Search is a meta-heuristic approach successfully used to address optimization problems in several contexts. This paper reports the results of an empirical study carried out to investigate the effectiveness of Tabu Search in estimating Web application development effort. The dataset employed in this investigation is part of the Tukutuku database. This database has been used in several studies...

chapter

Gene ontology classification: Building high-level knowledge using genetic algorithms

Laurence Rodrigues do Amaral, Estevam Rafael Hruschka

IEEE Congress on Evolutionary Computation > 1 - 7

2010 IEEE Congress on Evolutionary Computation

Computational approaches have been applied in many different biology application domains. When such tools are based on conventional computation, they have shown limitations to approach complex biological problems. In the present study, a computational evolutionary environment (CEE) is proposed as tool to extract classification rules from biological datasets. The main goal of the proposed approach...

chapter

Processing missing and inaccurate data in a condition based maintenance database

Abderrazak Bennane, Soumaya Yacout

The 40th International Conference on Computers&Indutrial Engineering > 1 - 5

2010 40th International Conference on Computers & Industrial Engineering (CIE-40 2010)

Condition-based maintenance (CBM) is a maintenance approach wherein equipment repair or replacement decisions are based on the current and projected health of the equipment measured by periodic collection and analysis of data. In this context, the accuracy of data is vital. Unfortunately, missing and inaccurate data are recurring problems in many CBM database. These problems can cause bias or lead...

chapter

Researches On E-government Archiving Based On DURA Model

Shen Yang, Zhao Yuqin, Zhang Wan, Zhu Zilong, more

2010 International Conference on E-Business and E-Government > 588 - 591

2010 International Conference on E-Business and E-Government (ICEE 2010)

In order to promote the automation and intellectualization of e-government archiving, this article obtains from the e-government archiving research background to propose the DURA model, carries on the discussion to its integrant parts and the reciprocity. And it analyzes an essential factor - - the origin, the connotation and the integrated development environment of business rules (R) and the work...

chapter

Discriminating the Machine-Printed and Hand-Written Words Based on Legibility

Shahin Akbarpour, Md Sulaiman, Norwati Mustapha, Rahmita Wirza Rahmat

2010 Seventh International Conference on Information Technology: New Generations > 364 - 369

Seventh International Conference on Information Technology: New Generations (ITNG 2010)

Discrimination of machine-printed and hand-written words is deemed as a major problem in the recognition of the mixed texts. To present a new method to distinguish between machine-printed words and hand-written words using a novel statistical feature on base legibility and discriminator threshold are objectives of this study. Because of the hand trembling, sudden uncontrollable movement of hand and...

chapter

The construction of specialized vocabulary database and information retrieval method for forestry journals in Chinese

Pan Hua, Li Dan

2010 2nd IEEE International Conference on Information Management and Engineering > 122 - 125

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

With the problem that miss and false retrieval in the technical article retrieval platform, this paper brings a new retrieval method for the forestry technical articles. The specialized vocabulary database is constructed to store all the key words for Forestry Journals in Chinese. The retrieval method firstly retrieves the key words from the different word stocks, and gets all the synonyms form them,...

chapter

A Structural Approach for PoP Geo-Location

Yuval Shavitt, Noa Zilberman

2010 INFOCOM IEEE Conference on Computer Communications Workshops > 1 - 6

IEEE INFOCOM 2010 - IEEE Conference on Computer Communications Workshops

Inferring PoP level maps is gaining interest due to its importance to many areas, e.g., for tracking the Internet evolution and studying its properties. In this paper we introduce a novel structural approach to automatically generate large scale PoP level maps using traceroute measurement from multiple locations. The PoPs are first identified based on their structure, and then are assigned a location...

chapter

GeoHybrid: A hierarchical approach for accurate and scalable geographic localization

I Niang, B Gueye, B Kasse

2010 ITU-T Kaleidoscope: Beyond the Internet? - Innovations for Future Networks and Services > 1 - 8

2010 ITU-T Kaleidoscope: Beyond the Internet? Innovations for Future Networks and Services

Geographic location and Grid computing are two areas that have taken off in recent years, both receiving a lot of attention from research community. The Grid Resource Brokers, which tries to find the best match between the job requirements and the resources available on the Grid, can take benefits by knowing the geographic location of clients, for a considerable improvement of their decision-taking...

chapter

Estimation of corrosion rates at external corrosion of pipes under insulation — Evaluation indexes of case databases

S Tateno, Sung-Hye Moon, H Matsuyama

Proceedings of SICE Annual Conference 2010 > 3347 - 3352

SICE 2010 - 49th Annual Conference of the Society of Instrument and Control Engineers of Japan

Recently, some accidents which are caused by external corrosion of pipes often occur in chemical and petrochemical plants. It is very difficult to detect points of the pipes where corrosion has considerably developed because the pipes are usually covered with insulations and cannot be inspected directly. Moreover, the length of the pipes is more than 1000 km, then it needs huge time and cost to inspect...

chapter

Result Validation Tool for Guidance Simulation

Peng Jiao, Jian-bing Tang

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

Result Validation is an important phase of guidance simulation VV&A process, which main purpose is testing the consistency of simulation data and flight data or other standard data. Result Validation mainly involves the data analysis and calculate activities, therefore, developing the result validation tool (RVT), make use of computer's computing instead of manual calculation, can consumedly increase...

chapter

Classification Method of Network Database Record Based on Fuzzy Theory

Cui Shizhong, Liu Zhe

2009 International Conference on E-Learning, E-Business, Enterprise Information Systems, and E-Government > 317 - 320

2009 International Conference on E-Learning, E-Business, Enterprise Information Systems, and E-Government (EEEE 2009)

In order to improve the efficiency of database management and intelligence of database record classification, a classification method of network database record based on fuzzy theory is proposed in this paper. Firstly, an automatic classification frame of database is constructed, and then standard record model and special data record and new record model on fuzzy set are given. By calculating the...

chapter

Refining Unit Boundaries for Mandarin Text-to-Speech Database

Minghui Dong, Ling Cen, P. Chan, Haizhou Li

2009 International Conference on Asian Language Processing > 245 - 248

2009 International Conference on Asian Language Processing (IALP 2009)

In unit selection based text-to-speech (TTS) synthesis, the accurate position of the unit boundaries in the unit selection database is one of the factors that determine the quality of the synthesized speech. To ensure the accuracy of the boundary positions, developers often have to manually verify the speech boundaries that are generated by automatic speech recognition techniques. In order to reduce...

chapter

Accuracy improvement with high convenience in biometric identification using multihypothesis sequential probability ratio test

T. Murakami, K. Takahashi

2009 First IEEE International Workshop on Information Forensics and Security (WIFS) > 66 - 70

2009 First IEEE International Workshop on Information Forensics and Security (WIFS 2009)

Biometric identification has lately attracted attention because of its high convenience; it does not require a user to enter a user ID. The identification accuracy, however, degrades as the number of the enrollees increases. Although many multimodal biometric techniques have been proposed to improve the identification accuracy, it requires the user to input multiple biometric samples and makes the...

chapter

Duplicate Record Detection for Database Cleansing

M. Rehman, V. Esichaikul

2009 Second International Conference on Machine Vision > 333 - 338

2009 Second International Conference on Machine Vision (ICMV 2009)

Many organizations collect large amounts of data to support their business and decision making processes. The data collected from various sources may have data quality problems in it. These kinds of issues become prominent when various databases are integrated. The integrated databases inherit the data quality problems that were present in the source database. The data in the integrated systems need...

chapter

IKMC: An Improved K-Medoids Clustering Method for Near-Duplicated Records Detection

Ying Pei, Jungang Xu, Zhiwang Cen, Jian Sun

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

An improved K-medoids clustering algorithm (IKMC) to resolve the problem of detecting the near-duplicated records is proposed in this paper. It considers every record in database as one separate data object, uses edit-distance method and the weights of attributes to get similarity value among records, then detect duplicated records by clustering these similarity value. This algorithm can automatically...

chapter

A Case Study on the Analysis of the Data Quality of a Large Medical Database

M. Bertoni, G. Furlini, G. Gozzoli, M.P. Landini, more

2009 20th International Workshop on Database and Expert Systems Application > 308 - 312

2009 20th International Workshop on Database and Expert Systems Application. DEXA 2009

In this paper we present a report on the data quality of a medical database containing clinical and administrative data from hospitals and private clinics in the Bologna district area. In particular, we have analyzed the database according to several data quality dimensions, identifying a number of issues (e.g., inaccuracies and incompleteness) that we have systematized and described in this work.

chapter

What Software Repositories Should Be Mined for Defect Predictors?

R. Ramler, S. Larndorfer, T. Natschlager

2009 35th Euromicro Conference on Software Engineering and Advanced Applications > 181 - 187

2009 35th Euromicro Conference on Software Engineering and Advanced Applications (SEAA 2009)

The information about which modules in a software system's future version are potentially defective is a valuable aid for quality managers and testers. Defect prediction promises to indicate these defect-prone modules. Constructing effective defect prediction models in an industrial setting involves the decision from what data source the defect predictors should be derived. In this paper we compare...

Keywords:
DATABASES
DATABASE MANAGEMENT SYSTEMS

Publication date

Set your own date range

INFONA - science communication portal

Search results

Research on Application of Decision Tree in Classifying Data

A spoken dialog system based on automatically-generated example database

Database updating through user feedback in fingerprint-based Wi-Fi location systems

Investigating Tabu Search for Web Effort Estimation

Gene ontology classification: Building high-level knowledge using genetic algorithms

Processing missing and inaccurate data in a condition based maintenance database

Researches On E-government Archiving Based On DURA Model

Discriminating the Machine-Printed and Hand-Written Words Based on Legibility

The construction of specialized vocabulary database and information retrieval method for forestry journals in Chinese

A Structural Approach for PoP Geo-Location

GeoHybrid: A hierarchical approach for accurate and scalable geographic localization

Estimation of corrosion rates at external corrosion of pipes under insulation — Evaluation indexes of case databases

Result Validation Tool for Guidance Simulation

Classification Method of Network Database Record Based on Fuzzy Theory

Refining Unit Boundaries for Mandarin Text-to-Speech Database

Accuracy improvement with high convenience in biometric identification using multihypothesis sequential probability ratio test

Duplicate Record Detection for Database Cleansing

IKMC: An Improved K-Medoids Clustering Method for Near-Duplicated Records Detection

A Case Study on the Analysis of the Data Quality of a Large Medical Database

What Software Repositories Should Be Mined for Defect Predictors?

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options