Search results

Items from 1 to 20 out of 28 results

article

NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

Amara Tariq, Asim Karim, Hassan Foroosh

IEEE Transactions on Pattern Analysis and Machine Intelligence > 2017 > 39 > 10 > 2000 - 2014

Named entities such as people, locations, and organizations play a vital role in characterizing online content. They often reflect information of interest and are frequently used in search queries. Although named entities can be detected reliably from textual content, extracting relations among them is more challenging, yet useful in various applications (e.g., news recommending systems). In this...

chapter

Acquisition and clustering for affective semantic lexicon from web

Fang Tian, Xiao Sun, Benwang Sun

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 255 - 259

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

A simple semantic lexicon extraction method is proposed based on one hypothesis and three filtering rules from Baidu Chinese Network Encyclopedia. The acquired affective lexicon includes emotional words and their lexical semantic relations including synonyms and antonyms. The acquiring method is recursive algorithm using the seed words. The extracted affective lexicon is labeled with affective tendency...

chapter

Measuring cross-lingual semantic similarity across European languages

Lutfi Kerem Senel, Veysel Yucesoy, Aykut Koc, Tolga Cukur

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 359 - 363

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

This paper studies cross-lingual semantic similarity (CLSS) between five European languages (i.e. English, French, German, Spanish and Italian) via unsupervised word embeddings from a cross-lingual lexicon. The vocabulary in each language is projected onto a separate high-dimensional vector space, and these vector spaces are then compared using several different distance measures (i.e., correlation,...

chapter

WikiDocsAligner: An Off-the-Shelf Wikipedia Documents Alignment Tool

Motaz Saad, Basem O. Alijla

2017 Palestinian International Conference on Information and Communication Technology (PICICT) > 34 - 39

2017 Palestinian International Conference on Information and Communication Technology (PICICT)

Wikipedia encyclopedia is an attractive source for comparable corpora in many languages. Most researchers develop their own script to perform document alignment task, which requires efforts and time. In this paper, we present WikiDocsAligner, an off-the-shelf Wikipedia Articles alignment handy tool. The implementation of WikiDocsAligner does not require the researchers to import/export of interlanguage...

chapter

Linked 'Big' Data: Towards a Manifold Increase in Big Data Value and Veracity

Jeremy Debattista, Christoph Lange, Simon Scerri, Soren Auer

2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC) > 92 - 98

2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC)

The Web of Data is an increasingly rich source of information, which makes it useful for Big Data analysis. However, there is no guarantee that this Web of Data will provide the consumer with truthful and valuable information. Most research has focused on Big Data's Volume, Velocity, and Variety dimensions. Unfortunately, Veracity and Value, often regarded as the fourth and fifth dimensions, have...

chapter

Cross-Site Virtual Social Network Construction

Chenhao Xie, Deqing Yang, Jingrui He, Yanghua Xiao

2015 IEEE International Conference on Data Mining Workshop (ICDMW) > 1660 - 1663

2015 IEEE International Conference on Data Mining Workshop (ICDMW)

Given the plethora of social networking sites, it can be difficult for users to browse too many sites and discover social friends. For example, for a new diabetes patient, how can s/he find the users with similar symptoms on different dedicated sites and form supporting groups with them? Since different sites may use different vocabularies, this problem is challenging to match users across different...

chapter

From text vocabularies to visual vocabularies what basis?

Jean Martinet

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 2 > 668 - 675

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

The popular “bag-of-visual-words” approach for representing and searching visual documents consists in describing images (or video keyframes) using a set of descriptors, that correspond to quantized low-level features. Most of existing approaches for visual words are inspired from works in text indexing, based on the implicit assumption that visual words can be handled the same way as text words....

chapter

Computing Terms Semantic Relatedness by Knowledge in Wikipedia

Dexin Zhao, Liangliang Qin, Pengjie Liu, Zhen Ma, more

2015 12th Web Information System and Application Conference (WISA) > 107 - 111

2015 12th Web Information System and Application Conference (WISA)

Many researchers have recognized Wikipedia as a resource of huge dynamic knowledge base in recent years. This paper provides a new approach for obtaining measures of terms semantic relatedness, which maps terms to relevant Wikipedia articles as the background information for analyzing. The proposed algorithm WLA focuses on the hyperlink structure and summary paragraph extracted from the topic pages...

chapter

Automatic Glossing Services for E-learning Cloud Environments

Ruth Cortez, Alexander Vazhenin, John Brine

2014 IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs > 128 - 131

2014 IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs (MCSoC)

In language learning scenarios, the use of glossing technique has a positive effect on incidental vocabulary acquisition as a by-product of reading. However, the preparation of materials that include glosses can be a time consuming task for the teacher. Automatic glossing tools have gained interest to help reduce such efforts, and to provide a better experience using electronic documents. Most glossing...

chapter

Developing text and speech databases for speech recognition of Vietnamese

Nguyen Thien Chuong, Josef Chaloupka

2013 IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems (IDAACS) > 1 > 163 - 166

2013 IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

This paper describes our study on developing the text and speech databases for automatic speech recognition of Vietnamese using an available source of linguistic data: the Internet. First, a two-stage procedure is applied to extract a general text corpus which can be used for researches on Vietnamese language such as speech recognition, audio-visual speech recognition, and natural language processing…...

chapter

As Simple as It Gets - A Sentence Simplifier for Different Learning Levels and Contexts

Bernardo Pereira Nunes, Ricardo Kawase, Patrick Siehndel, Marco A. Casanova, more

2013 IEEE 13th International Conference on Advanced Learning Technologies > 128 - 132

2013 IEEE 13th International Conference on Advanced Learning Technologies (ICALT)

This paper presents a text simplification method that transforms complex sentences into simplified forms. Our method uses NLP-techniques to simplify the text based on the target audience context, improving its overall understandability. We evaluate our approach in two aspects: grammatical structure and understandability. In both aspects, our approach achieved good results, showing its applicability...

chapter

Linked Data driven development

Jan Boznik, Vili Podgorelec, Marjan Hericko, Crt Gerlec

2011 7th Central and Eastern European Software Engineering Conference (CEE-SECR) > 1 - 6

2011 7th Central and Eastern European Software Engineering Conference in Russia (CEE-SECR 2011)

In this paper we introduce Linked Data driven development, a lightweight methodology for using Linked Data throughout the software life cycle. We explain the idea of Linked Data and how it plays an important role in the semantic web. Furthermore, we describe the necessary steps and approaches when using Linked Data for improving the software development process and give a discussion on the bonuses...

chapter

A Multidimensional Semantic Space for Data Model Independent Queries over RDF Data

Andre Freitas, Joao Gabriel Oliveira, Edward Curry, Se´n O'Riain

2011 IEEE Fifth International Conference on Semantic Computing > 344 - 351

2011 IEEE Fifth International Conference on Semantic Computing (ICSC)

The vision of creating a Linked Data Web brings together the challenge of allowing queries across highly heterogeneous and distributed datasets. In order to query Linked Data on the Web today, end-users need to be aware of which datasets potentially contain the data and also which data model describes these datasets. The process of allowing users to expressively query relationships in RDF while abstracting...

chapter

HAMEX - A Handwritten and Audio Dataset of Mathematical Expressions

Solen Quiniou, Harold Mouchere, Sebasti´n Pen Saldarriaga, Christian Viard-Gaudin, more

2011 International Conference on Document Analysis and Recognition > 452 - 456

2011 International Conference on Document Analysis and Recognition (ICDAR)

In this paper, we present HAMEX, a new public dataset that contains mathematical expressions available in their on-line handwritten form and in their audio spoken form. We have designed this dataset so that, given a mathematical expression, its handwritten signal and its audio signal can be used jointly to design multimodal recognition systems. Here, we describe the different steps that allowed us...

chapter

Bridging Folksonomies and Domain Ontologies: Getting Out Non-taxonomic Relations

C Trabelsi, Aicha Ben Jrad, Sadok Ben Yahia

2010 IEEE International Conference on Data Mining Workshops > 369 - 379

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Social bookmarking tools are rapidly emerging on the Web as it can be witnessed by the overwhelming number of participants. In such spaces, users annotate resources by means of any keyword or tag that they find relevant, giving raise to lightweight conceptual structures aka folksonomies. In this respect, needless to mention that ontologies can be of benefit for enhancing information retrieval metrics...

chapter

Enriching tagging systems with Google query tags

C Trattner, D Helic, S Maglajlic

Proceedings of the ITI 2010, 32nd International Conference on Information Technology Interfaces > 205 - 210

2010 32nd International Conference on Information Technology Interfaces (ITI 2010)

As recent research shows, efficient navigability of tagging systems is only possible if the number of tags grows hand in hand with the number of tagged resources. However, the number of resources grows typically faster than the number of tags. In this paper we analyze how enriching of user tags with tags generated from Google queries influences navigability in tagging systems. The analysis dataset...

chapter

Using Multiple Hybrid Strategies to Extract Chinese Synonyms from Encyclopedia Resource

Lu Yong, Zhang Chengzhi, Hou Hanqing

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC) > 1089 - 1093

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC 2009)

Automatic extraction of Chinese synonyms plays an important role in information retrieval and semantic resource construction. Based on the analyzing and comparing the different technologies of synonyms extraction, this paper proposes multi-strategy method including literal similarity algorithm, pattern matching algorithm and PageRank algorithm to extraction Chinese synonyms from encyclopedia resource...

chapter

BEST Corpus Development and Analysis

M. Boriboon, K. Kriengket, P. Chootrakool, S. Phaholphinyo, more

2009 International Conference on Asian Language Processing > 322 - 327

2009 International Conference on Asian Language Processing (IALP 2009)

This document describes the development process of the BEST 2009 word segmented-corpus. It is the first corpus to benchmark Thai word segmentation software. The corpus is composed of four genres, namely, collection of news, novels, encyclopedia, and academic articles. It contains 509 files. Its length is 64.1 MB. There are 5,036,229 tokens with 83,027 unique tokens. Common tokens appearing in all...

chapter

Using Hyperlink Texts to Improve Quality of Identifying Document Topics Based on Wikipedia

D.T. Huynh, T.H. Cao, P.H.T. Pham, T.N. Hoang

2009 International Conference on Knowledge and Systems Engineering > 249 - 254

2009 International Conference on Knowledge and Systems Engineering (KSE 2009)

This paper presents a method to identify the topics of documents based on Wikipedia category network. It is to improve the method previously proposed by Schonhofen by taking into account the weights of words in hyperlink texts in Wikipedia articles. The experiments on computing and team sport domains have been carried out and showed that our proposed method outperforms the Schonhofen's one.

chapter

I know what you did last summer: object-level auto-annotation of holiday snaps

Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool

2009 IEEE 12th International Conference on Computer Vision > 614 - 621

2009 IEEE 12th International Conference on Computer Vision (ICCV)

The state-of-the art in visual object retrieval from large databases allows to search millions of images on the object level. Recently, complementary works have proposed systems to crawl large object databases from community photo collections on the Internet. We combine these two lines of work to a large-scale system for auto-annotation of holiday snaps. The resulting method allows for automatic labeling...

Data set:
ieee
Keywords:
ENCYCLOPEDIAS
VOCABULARY

Publication date

Set your own date range

Publication type

book (26)
article (2)

Keywords

ELECTRONIC PUBLISHING (16)
INTERNET (15)
DATA MINING (10)
SEMANTICS (8)
CONTEXT (6)
INFORMATION RETRIEVAL (5)
INFORMATION SERVICES (5)
ONTOLOGIES (5)
LINKED DATA (4)
RESOURCE DESCRIPTION FRAMEWORK (4)
FEATURE EXTRACTION (3)
NATURAL LANGUAGE PROCESSING (3)
SEARCH ENGINES (3)
SEMANTIC WEB (3)
TAGGING (3)
VISUALIZATION (3)
DATA MODELS (2)
DATABASES (2)
FOLKSONOMY (2)
INDEXING (2)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (2)
PATTERN MATCHING (2)
SEMANTIC RELATEDNESS (2)
SPEECH (2)
USER INTERFACES (2)
WEB OF DATA (2)
WIKIPEDIA (2)
WORD SENSE DISAMBIGUATION (2)
ACCURACY (1)
AFFECTIVE LEXICAL SEMANTIC (1)
AFFECTIVE TENDENCY ANNOTATION (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARABIC WIKIPEDIA CORPUS (1)
ARCHIVISTS (1)
ASSOCIATION RULE (1)
AUSTRIA-FORUM ENCYCLOPEDIA (1)
AUTOMATIC ACQUIRING (1)
AUTOMATIC CHINESE SYNONYM ACQUISITION (1)
AUTOMATIC INDEXING TOOLS (1)
BAG-OF-FEATURES (1)
BEST CORPUS DEVELOPMENT (1)
BIG DATA (1)
BIG DATA DIMENSION (1)
BIPARTITE GRAPH (1)
BROWSERS (1)
CALCULATORS (1)
CHINESE SYNONYM AUTOMATIC EXTRACTION (1)
CLASSIFICATION (1)
CLOUD COMPUTING (1)
CLUSTERING (1)
COMPARABLE CORPUS (1)
COMPOUNDS (1)
COMPUTING DOMAINS (1)
CONCEPTUAL KNOWLEDGE DISCOVERY (1)
CONFERENCES (1)
CONTENT-BASED FACILITY (1)
CONTEXT EXTRACTION (1)
CONTEXT RECOGNITION (1)
CORPUS ANNOTATION (1)
CORRELATION (1)
COURSEWARE (1)
CROSS-LINGUAL SEMANTIC SIMILARITY (1)
CROSS-SITE (1)
DATA ANALYSIS (1)
DATA DRIVEN (1)
DATA PROCESSING (1)
DATA STRUCTURES (1)
DATASET (1)
DIABETES (1)
DICTIONARIES (1)
DISCOVERY PROCESS (1)
DISTRIBUTIONAL SEMANTICS (1)
DOCUMENT HANDLING (1)
DOCUMENT INDEXATION (1)
DOCUMENT SIMILARITY (1)
DOCUMENT TOPIC IDENTIFICATION (1)
DOCUMENTS ALIGNMENT (1)
DOMAIN ONTOLOGY (1)
E-LEARNING (1)
E-LEARNING REPOSITORY (1)
ECONOMIC INDICATORS (1)
EDUCATIONAL INSTITUTIONS (1)
EGYPTIAN WIKIPEDIA CORPUS (1)
ELECTRONIC LEARNING (1)
ENCYCLOPEDIA RESOURCE (1)
EVALUATION (1)
FILTERING (1)
GEO-REFERENCED IMAGES (1)
GEOGRAPHIC INFORMATION SYSTEMS (1)
GEOGRAPHICAL DATABASE (1)
GLOBAL NEIGHBOR TAGS (1)
GOOGLE (1)
GOOGLE QUERY TAGS (1)
GRAPH (1)
GRAPH PROPAGATION (1)
GRAPH REPRESENTATION (1)
GRAPH THEORY (1)
GUIDELINES (1)
more

INFONA - science communication portal

Search results

NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

Acquisition and clustering for affective semantic lexicon from web

Measuring cross-lingual semantic similarity across European languages

WikiDocsAligner: An Off-the-Shelf Wikipedia Documents Alignment Tool

Linked 'Big' Data: Towards a Manifold Increase in Big Data Value and Veracity

Cross-Site Virtual Social Network Construction

From text vocabularies to visual vocabularies what basis?

Computing Terms Semantic Relatedness by Knowledge in Wikipedia

Automatic Glossing Services for E-learning Cloud Environments

Developing text and speech databases for speech recognition of Vietnamese

As Simple as It Gets - A Sentence Simplifier for Different Learning Levels and Contexts

Linked Data driven development

A Multidimensional Semantic Space for Data Model Independent Queries over RDF Data

HAMEX - A Handwritten and Audio Dataset of Mathematical Expressions

Bridging Folksonomies and Domain Ontologies: Getting Out Non-taxonomic Relations

Enriching tagging systems with Google query tags

Using Multiple Hybrid Strategies to Extract Chinese Synonyms from Encyclopedia Resource

BEST Corpus Development and Analysis

Using Hyperlink Texts to Improve Quality of Identifying Document Topics Based on Wikipedia

I know what you did last summer: object-level auto-annotation of holiday snaps

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options