Search results

Items from 1 to 9 out of 9 results

chapter

Encoding Arabic rhetorical structure: A methodology for the extraction of Arabic lexical information from TEI-encoded classical sources

Simona Olivieri, Ivana Pepe, Ilaria Cicola

2016 4th IEEE International Colloquium on Information Science and Technology (CiSt) > 390 - 394

2016 4th IEEE International Colloquium on Information Science and Technology (CIST)

Text encoding is considered as the most functional outset to store and retrieve data, with trees of information and lists of concordances as its first immediate results, but there is a wide range of possible results opening up when a complete encoding process is accomplished. The three case studies described in this paper are meant to give an overall view on the preliminary steps of a wider project...

chapter

A New Method of K-Means Clustering Algorithm with Events Based on Variable Time Granularity

Mengxing Huang, Hongjing Lin

2016 13th Web Information Systems and Applications Conference (WISA) > 41 - 44

2016 13th Web Information Systems and Applications Conference (WISA)

According to the characteristics of Weibo event, this paper analyzes the advantages and disadvantages of the traditional K-means algorithm, and proposes the K-means clustering algorithm of events based on variable time granularity. The experiments show that the improved algorithm is more suitable for clustering analysis of Weibo event, improves the efficiency of clustering algorithm, and solves the...

chapter

Data File Layout Inference Using Content-Based Oracles

Reid A. Phillips, Wing-Ning Li, Craig Thompson, Wesley Deneke

2013 IEEE 16th International Conference on Computational Science and Engineering > 1029 - 1035

2013 IEEE 16th International Conference on Computational Science and Engineering (CSE)

Data file layout inference refers to the problem of identifying the organizational characteristics associated with a structured text file, where every record in a text file shares the same structural properties. These properties include: character encoding, record length, field length (indicated by delimiting characters or fixed length), field position, and field semantic content. Within this paper,...

chapter

Linked data for humanities research — The SPQR experiment

Tobias Blanke, Gabriel Bodard, Michael Bryant, Stuart Dunn, more

2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST) > 1 - 6

2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST 2012) - Complex Environment Engineering

Ancient texts represent a primary source for research in the classics. A substantial body of digital material has evolved enriching these texts. Unfortunately these data are often distributed across myriad locations, stored in diverse and incompatible formats and are either not available online or are made available only in isolation. This paper describes an investigation into using linked data principles...

chapter

An XML C Source Code Interchange Format for CASE Tools

Noritoshi Atsumi, Takashi Kobayashi, Shinichiro Yamamoto, Kiyoshi Agusa

2011 IEEE 35th Annual Computer Software and Applications Conference > 498 - 503

2011 IEEE 35th Annual Computer Software and Applications Conference - COMPSAC 2011

We propose an XML C source code representation to support developing CASE tools. Since source code is a main artifact of software development, most CASE tools have some features related to source code editor, static analyzer, profiler, etc. To develop such tools, detailed information related to source code is needed. However, it is quite difficult to reuse program analysis features because they do...

chapter

Top-k keyword search over probabilistic XML data

Jianxin Li, Chengfei Liu, Rui Zhou, Wei Wang

2011 IEEE 27th International Conference on Data Engineering > 673 - 684

2011 27th IEEE International Conference on Data Engineering (ICDE 2011)

Despite the proliferation of work on XML keyword query, it remains open to support keyword query over probabilistic XML data. Compared with traditional keyword search, it is far more expensive to answer a keyword query over probabilistic XML data due to the consideration of possible world semantics. In this paper, we firstly define the new problem of studying top-k keyword search over probabilistic...

chapter

Identification of opinions in Arabic newspapers

Farek Lazhar, Tlili Guiassa Yamina

2010 International Conference on Machine and Web Intelligence > 317 - 319

International Conference on Machine and Web Intelligence (ICMWI 2010)

Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this...

chapter

A GML compression approach based on on-line semantic clustering

Qingting Wei, Jihong Guan

2010 18th International Conference on Geoinformatics > 1 - 7

2010 18th International Conference on Geoinformatics

Geography Markup Language (GML) has become a de facto international encoding standard for exchanging geospatial data among heterogeneous Geographic Information Systems (GIS). Whereas, structurally redundant tags and textual data representation usually inflate the sizes of GML documents substantially, which makes the storage and transport costly. In this paper, we propose an effective compression approach...

chapter

Supporting top-K keyword search in XML databases

Liang Jeff Chen, Yannis Papakonstantinou

2010 IEEE 26th International Conference on Data Engineering (ICDE 2010) > 689 - 700

2010 IEEE 26th International Conference on Data Engineering (ICDE 2010)

Keyword search is considered to be an effective information discovery method for both structured and semi-structured data. In XML keyword search, query semantics is based on the concept of Lowest Common Ancestor (LCA). However, naive LCA-based semantics leads to exponential computation and result size. In the literature, LCA-based semantic variants (e.g., ELCA and SLCA) were proposed, which define...

Filter options

Keywords:
ENCODING
SEMANTICS

Publication date

Set your own date range

Keywords

DATA MINING (4)
KEYWORD SEARCH (3)
SYNTACTICS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
DICTIONARIES (2)
TEXT ANALYSIS (2)
TOP-K KEYWORD SEARCH (2)
ANNOTATION (1)
ARABIC CORPUS LINGUISTICS (1)
ARABIC LANGUAGE (1)
ARABIC NEWSPAPERS (1)
BROWSERS (1)
CASE TOOL PROGRAM UNDERSTANDING (1)
CLUSTERING (1)
CLUSTERING ALGORITHMS (1)
CODING CHECKER (1)
COMBINATORIC APPROACH (1)
COMPUTATIONAL MODELING (1)
COMPUTER AIDED SOFTWARE ENGINEERING (1)
CONTAINERS (1)
CONTENT TYPE (1)
CONTEXT (1)
DATA COMPRESSION (1)
DATA INTEGRATION (1)
DATA MODELS (1)
DATA STRUCTURES (1)
DE FACTO INTERNATIONAL ENCODING STANDARD (1)
DELTA ENCODING (1)
DELTA-ENCODING GEOMETRIC COORDINATE DATA (1)
DICTIONARY-ENCODING STRUCTURES (1)
DIGITAL HUMANITIES (1)
DOMAIN-SPECIFIC SOFTWARE ARCHITECTURE (1)
EAGERTOPK ALGORITHM (1)
ELECTRONIC NEWSPAPER (1)
ELECTRONIC PUBLISHING (1)
EQUATIONS (1)
EXTRACT-TRANSFORM-LOAD (ETL) (1)
FILE LAYOUT INFERENCE (1)
FILE PROCESSING (1)
GENERAL TEXT COMPRESSION (1)
GEOGRAPHIC INFORMATION SYSTEMS (1)
GEOGRAPHY (1)
GEOGRAPHY MARKUP LANGUAGE COMPRESSION APPROACH (1)
GEOSPATIAL DATA (1)
GML COMPRESSION (1)
IDENTIFICATION (1)
INDEXES (1)
INFORMATION DISCOVERY METHOD (1)
INFORMATION RETRIEVAL (1)
JOIN BASED ALGORITHM (1)
K SLCA RESULTS (1)
K-MEANS (1)
KEYWORD QUERY EVALUATION (1)
LABELING (1)
LAYOUT (1)
LCA BASED SEMANTIC VARIANT (1)
LINKED DATA (1)
LOWEST COMMON ANCESTOR (1)
MARKET RESEARCH (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
META-DATA DISCOVERY (1)
MODEMS (1)
NATURAL LANGUAGE PROCESSING (1)
NEWSPAPERS (1)
ON-LINE SEMANTIC CLUSTERING (1)
ONTOLOGIES (1)
OPINIONS (1)
OPINIONS IDENTIFICATION (1)
PEDIATRICS (1)
PRAGMATICS (1)
PROBABILISTIC LOGIC (1)
PROBABILISTIC XML DATA (1)
PROBABILITY (1)
PROGRAM ANALYSIS (1)
PRSTACK ALGORITHM (1)
QUERY FORMULATION (1)
QUERY PROCESSING (1)
QUERY SEMANTICS (1)
REDUNDANT TAGS (1)
RELATIONAL DATABASES (1)
RESOURCE DESCRIPTION FRAMEWORK (1)
SAMPLING (1)
SEMANTIC EXPANSION (1)
SEMANTIC PRUNING (1)
SEMANTIC SIMILARITY (1)
SWITCHES (1)
TEXT ENCODING (1)
TEXTUAL DATA REPRESENTATION (1)
THE INITIAL CLUSTERING CENTERS (1)
TIME GRANULARITY (1)
WEIBO EVENTS (1)
XML DATABASES (1)
XML KEYWORD QUERY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options