Szukanie zaawansowane

Szukanie zaawansowane w ludziach

Od:

Do:

Pozycje od 1 do 20 spośród 23 wyników

Poprzednia

Następna

rozdział

Smart map for smart city

Sunita S Dambhare, S. J. Karale

2017 International Conference on Innovative Mechanisms for Industry Applications (ICIMIA) > 622 - 626

2017 International Conference on Innovative Mechanisms for Industry Applications (ICIMIA)

There are some limitations of the current web like high recall, low precision or search result are highly sensitive to vocabulary because of this next generation web i.e., Semantic web is used. In Semantic Web information is given in well defined and meaningful manner. Proposed system takes the advantages of Semantic web. In proposed approach we used Google Map API to create Map and used as front...

rozdział

Parallel Text Identification Using Lexical and Corpus Features for the English-Maori Language Pair

Mahsa Mohaghegh, Abdolhossein Sarrafzadeh

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 910 - 915

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Comparable corpora contain significant quantities of useful data for Natural Language Processing tasks, especially in the area of Machine Translation. They are mainly the source of parallel text fragments. This paper investigates how to effectively extract bilingual texts from comparable corpora relying on a small-size parallel training corpus. We propose a new technique to filter non parallel articles...

rozdział

Wikipedia-Based Hybrid Document Representation for Textual News Classification

Marcos Antonio Mourino Garcia, Roberto Perez Rodriguez, Manuel Vilares Ferro, Luis Anido Rifon

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI) > 148 - 153

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)

Automatic classification of news articles is a relevant problem due to the large amount of news generated every day, so it is crucial that these news are classified to allow for users to access to information of interest quickly and effectively. On the one hand, traditional classification systems represent documents as bag-of-words (BoW), which are oblivious to two problems of language: synonymy and...

rozdział

An approach to spam comment detection through domain-independent features

Jong Myoung Kim, Zae Myung Kim, Kwangjo Kim

2016 International Conference on Big Data and Smart Computing (BigComp) > 273 - 276

2016 International Conference on Big Data and Smart Computing (BigComp)

Previous research in spam detection, especially in email spam filtering, mainly focused on learning a set of discriminative features that are often present in the spam contents. Nowadays, these commercially oriented spams are well detected; the real challenge lies in filtering rather vague spams that do not exhibit distinctive spam keywords. We investigate two ways of detecting such spams: 1) By comparing...

rozdział

AttitudeBuzz: Using social media data to localize complex attitudes

Jason Cohn, Alex Kuntz, Larry Birnbaum

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 1569 - 1570

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

AttitudeBuzz is a system that analyzes and presents complex social attitudes based on geolocated social media data. The system uses a machine learning model to apply highly domain-specific sentiment analysis to such data, specifically Twitter, by learning modulators around a configurable lexicon central to the domain of inquiry. Training data are acquired from geographical areas where a specific attitude...

rozdział

The relationship of text categorization using Dewey Decimal Classification techniques

Julaluk Watthananon

2014 Twelfth International Conference on ICT and Knowledge Engineering > 72 - 77

2014 12th International Conference on ICT and Knowledge Engineering (ICT & Knowledge Engineering 2014)

Now a day, the massive amount of data and information (recently termed as “Big Data”) causes accessibility and retrieval problems if poorly managed. This is due to their relational structure which is more complicate, unexplainable, and unanalyzable with simple or traditional methods. The uniform display of these data and information is also difficult due to their diversified formats. Bag of Words...

rozdział

Detecting up-calls of Right Whales

Soumya Sen Gupta, Sai Rajeshwar

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 2669 - 2673

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The purpose of the study was to develop a machine learning based technique to detect the up-calls of North Atlantic Right Whales from all other noises, like calls of other creatures in the sea, so that ships plying in the seas could be warned of their presence in order to avoid a direct collision with the whales. What made the study quite difficult was the non-stationary component of the signals along...

rozdział

A Feature-Enhanced Ranking-Based Classifier for Multimodal Data and Heterogeneous Information Networks

Scott Deeann Chen, Ying-Yu Chen, Jiawei Han, Pierre Moulin

2013 IEEE 13th International Conference on Data Mining > 997 - 1002

2013 IEEE International Conference on Data Mining (ICDM)

We propose a heterogeneous information network mining algorithm: feature-enhanced Rank Class (F-Rank Class). F-Rank Class extends Rank Class to a unified classification framework that can be applied to binary or multiclass classification of unimodal or multimodal data. We experimented on a multimodal document dataset, 2008/9 Wikipedia Selection for Schools. For unimodal classification, F-Rank Class...

rozdział

Has this bug been reported?

Kaiping Liu, Hee Beng Kuan Tan, Hongyu Zhang

2013 20th Working Conference on Reverse Engineering (WCRE) > 82 - 91

2013 20th Working Conference on Reverse Engineering (WCRE)

Bug reporting is essentially an uncoordinated process. The same bugs could be repeatedly reported because users or testers are unaware of previously reported bugs. As a result, extra time could be spent on bug triaging and fixing. In order to reduce redundant effort, it is important to provide bug reporters with the ability to search for previously reported bugs. The search functions provided by the...

rozdział

Features for link prediction in social networks: A comprehensive study

Feng Liu, Bingquan Liu, Xiaolong Wang, Ming Liu, więcej

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1706 - 1711

2012 IEEE International Conference on Systems, Man and Cybernetics - SMC

With the development of social media websites, more and more users start to show their attitudes and emotions to each other. Some of these interactions can be represented as links with sign values(positive or negative). In this paper, a unified method is proposed for link prediction and feature analysis. This paper focuses on the data from social media websites and tries to find the features that...

rozdział

Using Signals from Text to Identify Roles within a Group

Alex Baron, Vasin Punyakanok, Marjorie Freedman

2012 IEEE Sixth International Conference on Semantic Computing > 38 - 44

2012 IEEE Sixth International Conference on Semantic Computing (ICSC)

This work investigates identifying social behaviors (adversarial behavior and influence) of participants in online discussion forums from how their language use in English, Arabic, and Chinese. We describe the challenges of annotating implicit information signaled by subtle queues and present two styles of annotation -- one using professional annotators and the other with Mechanical Turk. Our system,...

rozdział

Unsupervised two-stage keyword extraction from spoken documents by topic coherence and support vector machine

Yun-Nung Chen, Yu Huang, Hung-Yi Lee, Lin-Shan Lee

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5041 - 5044

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper proposes an unsupervised two-stage approach to automatically extract keywords from spoken documents. In the first stage, for each candidate term we compute a topic coherence and term significance measure (TCS) based on probabilistic latent semantic analysis (PLSA) models. In the second stage, we take the candidate terms with highest and lowest TCS scores as positive and negative examples...

rozdział

Merging and Re-ranking Answers from Distributed Multiple Web Sources

Hyo-Jung Oh, Jeong Hur, Chung-Hee Lee, Pum-Mo Ryu, więcej

2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology > 3 > 143 - 146

2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)

Depending on questions, various answering methods and answer sources can be used. In this paper, we build a distributed QA system to handle different types of questions and web sources. When a user question is entered, the broker distributes the question over multiple sub-QAs according to question types. The selected sub-QAs find local optimal candidate answers, and then they are collected in to the...

rozdział

Integrating visual classifier ensemble with term extraction for Automatic Image Annotation

Yinjie Lei, Wilson Wong, Mohammed Bennamoun, Wei Liu

2011 6th IEEE Conference on Industrial Electronics and Applications > 1959 - 1965

2011 6th IEEE Conference on Industrial Electronics and Applications (ICIEA)

Existing Automatic Image Annotation (AIA) systems are typically developed, trained and tested using high quality, manually labelled images. The tremendous manual efforts required with an untested ability to scale and tolerate noise all have an impact on existing systems' applicability to real-world data. In this paper, we propose a novel AIA system which harnesses the collective intelligence on the...

rozdział

Classifying Wikipedia entities into fine-grained classes

M Tkatchenko, A Ulanov, A Simanovsky

2011 IEEE 27th International Conference on Data Engineering Workshops > 212 - 217

2011 IEEE International Conference on Data Engineering Workshops (ICDEW 2011)

Recognition of named entities (people, companies, locations, etc) is an essential task of text analytics. We address the subproblem of this task, namely, named entity classification. We propose a novel approach that constructs an effective fine-grained named entity classifier. Its key highlights are semi-automatic training set construction from Wikipedia articles and additional feature selection....

rozdział

Validating Meronymy Hypotheses with Support Vector Machines and Graph Kernels

Tim vor der Bruck, H Helbig

2010 Ninth International Conference on Machine Learning and Applications > 243 - 250

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

There is a substantial body of work on the extraction of relations from texts, most of which is based on pattern matching or on applying tree kernel functions to syntactic structures. Whereas pattern application is usually more efficient, tree kernels can be superior when assessed by the F-measure. In this paper, we introduce a hybrid approach to extracting meronymy relations, which is based on both...

rozdział

Semantic Content Filtering with Wikipedia and Ontologies

P Malo, P Siitari, O Ahlgren, J Wallenius, więcej

2010 IEEE International Conference on Data Mining Workshops > 518 - 526

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

The use of domain knowledge is generally found to improve query efficiency in content filtering applications. In particular, tangible benefits have been achieved when using knowledge-based approaches within more specialized fields, such as medical free texts or legal documents. However, the problem is that sources of domain knowledge are time consuming to build and equally costly to maintain. As a...

rozdział

Mash-Up Approach for Web Video Category Recommendation

Yi-Cheng Song, Haojie Li

2010 Fourth Pacific-Rim Symposium on Image and Video Technology > 197 - 202

2010 Fourth Pacific-Rim Symposium on Image and Video Technology (PSIVT)

With the advent of web 2.0, billions of videos are now freely available online. Meanwhile, rich user generated information for these videos such as tags and online encyclopedia offer us a chance to enhance the existing video analysis technologies. In this paper, we propose a mash-up framework to realize video category recommendation by leveraging web information from different sources. Under this...

rozdział

A Comparison of Approaches for Geospatial Entity Extraction from Wikipedia

Daryl Woodward, Jeremy Witmer, Jugal Kalita

2010 IEEE Fourth International Conference on Semantic Computing > 402 - 407

2010 IEEE Fourth International Conference on Semantic Computing (ICSC)

We target in this paper the challenge of extracting geospatial data from the article text of the English Wikipedia. We present the results of a Hidden Markov Model (HMM) based approach to identify location-related named entities in the our corpus of Wikipedia articles, which are primarily about battles and wars due to their high geospatial content. The HMM NER process drives a geocoding and resolution...

rozdział

Extracting Geospatial Entities from Wikipedia

J. Witmer, J. Kalita

2009 IEEE International Conference on Semantic Computing > 450 - 457

2009 IEEE International Conference on Semantic Computing (ICSC)

This paper addresses the challenge of extracting geospatial data from the article text of the English Wikipedia. In the first phase of our work, we create a training corpus and select a set of word-based features to train a Support Vector Machine (SVM) for the task of geospatial named entity recognition. We target for testing a corpus of Wikipedia articles about battles and wars, as these have a high...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
ENCYCLOPEDIAS
SUPPORT VECTOR MACHINES

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

INTERNET (16)
ELECTRONIC PUBLISHING (14)
TRAINING (8)
FEATURE EXTRACTION (7)
SEMANTICS (6)
ACCURACY (5)
SUPPORT VECTOR MACHINE (4)
BLOGS (3)
DATA MINING (3)
TEXT ANALYSIS (3)
WIKIPEDIA (3)
CLASSIFICATION ALGORITHMS (2)
DOCUMENT REPRESENTATION (2)
ENCYCLOPAEDIAS (2)
GEOGRAPHIC INFORMATION SYSTEMS (2)
IMAGE RETRIEVAL (2)
INFORMATION FILTERING (2)
INFORMATION RETRIEVAL (2)
INFORMATION SERVICES (2)
NATURAL LANGUAGE PROCESSING (2)
NER (2)
ONTOLOGIES (2)
STANDARDS (2)
SVM (2)
WEB SITES (2)
ANSWER SELECTION (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASSUMED POINT GEOMETRY (1)
AUTOMATED THEOREM PROVER (1)
AUTOMATIC DOCUMENT EXPANSION (1)
BAG OF WORDS REPRESENTATION (1)
BAG-OF-CONCEPTS (1)
BAG-OF-WORDS (1)
BIG DATA (1)
BILINGUAL CORPORA (1)
BIOLOGICAL SYSTEM MODELING (1)
BUG REPORT SEARCH (1)
BUG TRACKING SYSTEM (1)
C4.5 ALGORITHM (1)
CARTOGRAPHY (1)
CLASSIFICATION (1)
COHERENCE (1)
COMPLEXITY THEORY (1)
CONCEPT-RELATEDNESS (1)
CONCEPTION BASED TEXTS CLASSIFICATION METHOD (1)
CONTEXT (1)
CONTEXT RECOGNITION (1)
CORRELATION (1)
DATA STRUCTURE (1)
DBPEDIA (1)
DETECTING ROLES WITHIN A GROUP (1)
DEWEY DECIMAL CLASSIFICATION (1)
DISCOURSE BEHAVIOR (1)
DISCRIMINATIVE IMAGE MODEL (1)
DOCUMENT FILTERING (1)
DOMAIN KNOWLEDGE (1)
DOMAIN ONTOLOGY (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRONIC MAIL (1)
ENGLISH WIKIPEDIA (1)
EQUATIONS (1)
F-MEASURE (1)
FEATURE SELECTION (1)
FEATURES (1)
FINE-GRAINED NAMED ENTITY CLASSIFIER (1)
GAMES (1)
GENERATIVE TEXT MODEL (1)
GEOCODING (1)
GEOSPATIAL ANALYSIS (1)
GEOSPATIAL CONTENT (1)
GEOSPATIAL DATA (1)
GEOSPATIAL DATA EXTRACTION (1)
GEOSPATIAL ENTITIES (1)
GEOSPATIAL ENTITY EXTRACTION (1)
GEOSPATIAL ENTITY RECOGNITION (1)
GEOSPATIAL EXTRACTION (1)
GEOSPATIAL EXTRATION (1)
GEOSPATIAL NAMED ENTITY RECOGNITION (1)
GEOSPATIAL SEARCH (1)
GEOVISUALIZATION (1)
GIS (1)
GRAPH KERNEL (1)
GRAPH KERNELS (1)
HETEROGENEOUS INFORMATION NETWORK (1)
HIDDEN MARKOV MODEL (1)
HIDDEN MARKOV MODELS (1)
HYBRID MODEL (1)
IMAGE COLOR ANALYSIS (1)
IMAGE EDGE DETECTION (1)
INFORMAL TEXT (1)
INTEGRATED FEATURE SELECTION MODEL (1)
KERNEL (1)
KEYWORD (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE MANAGEMENT (1)
KNOWLEDGE REPRESENTATION (1)
KNOWLEDGE-BASED APPROACHES (1)
LEGAL DOCUMENTS (1)
więcej

INFONA - portal komunikacji naukowej

Szukanie zaawansowane