Search results

Items from 41 to 60 out of 823 results

chapter

Web caching evaluation from Wikipedia request statistics

Gerhard Hasslinger, Mahmoud Kunbaz, Frank Hasslinger, Thomas Bauschert

2017 15th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt) > 1 - 6

2017 15th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)

Wikipedia is one of the most popular information platforms on the Internet. The user access pattern to Wikipedia pages depends on their relevance in the current worldwide social discourse. We use publically available statistics about the top-1000 most popular pages on each day to estimate the efficiency of caches for support of the platform. While the data volumes are moderate, the main goal of Wikipedia...

chapter

Wikipedia-based extraction of key information from resumes

Mohammad Ghufran, Nacera Bennacer, Gianluca Quercini

2017 11th International Conference on Research Challenges in Information Science (RCIS) > 135 - 145

2017 11th International Conference on Research Challenges in Information Science (RCIS)

There is a vast amount of information about individuals available on the Web that has potential uses in Human Resource Management (HRM) - both for recruiters and job seekers. Since people names are inherently ambiguous, finding information related to a specific person is challenging and a simple query by name will likely return web pages related to several different individuals who happen to share...

chapter

Automatic question generation for intelligent tutoring systems

Riken Shah, Deesha Shah, Lakshmi Kurup

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA) > 127 - 132

2017 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA)

In this paper, we present the system of automatic MCQs (Multiple Choice Questions) generation for any given input text along with a set of distractors. The system is trained on a Wikipedia-based dataset consisting of URLs of Wikipedia articles. The important words (keywords) which consist of both bigrams and unigrams are extracted and stored in a dictionary along with many other components of the...

chapter

User tracking using tweet segmentation and word

Mrudula Nimbarte, Mrunali Omprakash Thakare

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) > 1 > 664 - 668

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)

Many organizations have been reported to create and monitoring targeted Twitter streams to collect a bunch of information and understand according to user's view. Targeted Twitter stream is main usually constructed by filtering tweets and that abused words with predefined selection criteria. Due to its invaluable business value of timely information from these tweets, it's a necessary to understand...

chapter

Source-LDA: Enhancing Probabilistic Topic Models Using Prior Knowledge Sources

Justin Wood, Patrick Tan, Wei Wang, Corey Arnold

2017 IEEE 33rd International Conference on Data Engineering (ICDE) > 411 - 422

2017 IEEE 33rd International Conference on Data Engineering (ICDE)

Topic modeling has increasingly attracted interests from researchers. Common methods of topic modeling usually produce a collection of unlabeled topics where each topic is depicted by a distribution of words. Associating semantic meaning with these word distributions is not always straightforward. Traditionally, this task is left to human interpretation. Manually labeling the topics is unfortunately...

chapter

Multi-Level Topical Text Categorization with Wikipedia

Nan Guo, Yuan He, Chungang Yan, Lu Liu, more

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC) > 343 - 352

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC)

This paper introduces an automatic categorical-marking model for text categorization. Traditional classification algorithms are generally applying labeled training set and call for a lot of manual work to tag classifications beforehand. Also due to the ambiguity and fuzziness of texts, the results of traditional text categorization algorithms may not be clear enough and abundant in content. This paper...

chapter

Towards applying OCR and Semantic Web to achieve optimal learning experience

Kiran Badwaik, Khalid Mahmood, Asif Raza

2017 IEEE 13th International Symposium on Autonomous Decentralized System (ISADS) > 262 - 267

2017 IEEE 13th International Symposium on Autonomous Decentralized System (ISADS)

As more and more learners are opting for onlinelearning, e-learning industry is working on improving learningexperience of online user by providing relevant content and lotof additional references. Since online learners mostly prefervideo tutorials, identifying major topics and subtopics coveredin video tutorial is a big challenge. Recently, for efficientknowledge sharing and interoperability over...

chapter

Cross-modality matching based on Fisher Vector with neural word embeddings and deep image features

Liang Han, Wenmin Wang, Mengdi Fan, Ronggang Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2921 - 2925

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Cross-modal retrieval, which aims to solve the problem that the query and the retrieved results are from different modality, becomes more and more essential with the development of the Internet. In this paper, we mainly focus on the exploration of high-level semantic representation of image and text for cross-modal matching. Deep convolutional image features and Fisher Vector with neural word embeddings...

chapter

Movie Summarization Based on Alignment of Plot and Shots

Xueshan Li, Takehito Utsuro, Hiroshi Uehara

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA) > 189 - 196

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA)

This paper proposes a method of assisting movie summarization using plotinformation. A plot of a movie available at Wikipedia contains a majorstory of the movie. From such a plot of a movie, we extract severalimportant sentences as the content of summary. For summarizing movie, the key work is finding the best alignment between sentences of plot andshots which are segmented from a movie. There are...

chapter

Three dimensional geo-tweet visualization system for spatio-temporal events

C. R. Athira, N. M. Dhanya

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT) > 1 - 6

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT)

Geo-tweet visualization help users know the events that is happening over the space and time from the tweets or wikipedia while they click on the specified location for a 3D based tag visualization. Normal events are detected by system which happens anywhere or anytime using machine learning algorithm and special events are also extracted by comparing current situation to normal regularities. Generally,...

chapter

Bag-of-Concepts Document Representation for Bayesian Text Classification

Marcos Mourino-Garcia, Roberto Perez-Rodriguez, Luis Anido-Rifon, Miguel Gomez-Carballa

2016 IEEE International Conference on Computer and Information Technology (CIT) > 281 - 288

2016 IEEE International Conference on Computer and Information Technology (CIT)

The classification of text documents into a number of pre-defined categories has many application scenarios, for example the classification of news items into thematic sections. Documents to be classified are commonly represented by a bag-of-words feature vector. The bag-of-words model cannot handle two language phenomena: synonymy and polysemy, besides, dimensions of feature vectors are orthogonal...

chapter

Android based educational Chatbot for visually impaired people

M Naveen Kumar, P C Linga Chandar, A Venkatesh Prasad, K Sumangali

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 4

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

The purpose of this android application is to provide educational based Chatbot for visually impaired people. It will give an answer to the educational based queries asked by the visually impaired people. They can easily launch the application with the help of google voice search. Once the application is open, it will give a voice instruction to use an application. Output will be provided in voice...

chapter

Question answering system for factoid based question

Prakash Ranjan, Rakesh Chandra Balabantaray

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I) > 221 - 224

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)

Objective of question answering system (QA) is to generate concise answer of arbitrary question asked in natural language. This kind of information retrieval is required with growth of digital information. Analysis of natural language is complex task. Previously QAS were developed for specific domain and have limited efficiency. Present QAS Target on types of question commonly asked by users, characteristics...

chapter

To link or not to link: Ranking hyperlinks in Wikipedia using collective attention

Philip Thruesen, Jaroslav Cechak, Blandine Seznec, Roel Castalio, more

2016 IEEE International Conference on Big Data (Big Data) > 1709 - 1718

2016 IEEE International Conference on Big Data (Big Data)

Wikipedia is one of the fastest growing websites and a primary source of knowledge on the Internet. Being a wiki, its content is crowd-sourced by the users. This has many benefits and it is one of the main reasons it has grown to reach more than 5 million articles in its English version. Nevertheless, this also raises issues, like the overlinking of articles, which are difficult to deal with by editors...

chapter

Parallel Text Identification Using Lexical and Corpus Features for the English-Maori Language Pair

Mahsa Mohaghegh, Abdolhossein Sarrafzadeh

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 910 - 915

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Comparable corpora contain significant quantities of useful data for Natural Language Processing tasks, especially in the area of Machine Translation. They are mainly the source of parallel text fragments. This paper investigates how to effectively extract bilingual texts from comparable corpora relying on a small-size parallel training corpus. We propose a new technique to filter non parallel articles...

chapter

A multilingual ontology based framework for wikipedia entry augmentation

Md. Tasnim Manzur Ankon, Sanjida Nasreen Tumpa, Muhammad Masroor Ali

2016 19th International Conference on Computer and Information Technology (ICCIT) > 541 - 545

2016 19th International Conference on Computer and Information Technology (ICCIT)

The domain of traditional web is gradually evolving with the adaptation of newer techniques, which includes semantic web. Integration of web content using ontologies in a language independent manner is a required feature in this process. For better utilization of the resources, it is necessary that the ontology, which is working as a central knowledge repository, to be language independent as well...

chapter

Lightweight system for NE-tagged news headlines corpus creation

Avinash Kumar, Dhaval Patel, Nikita Jain

2016 IEEE International Conference on Big Data (Big Data) > 3903 - 3912

2016 IEEE International Conference on Big Data (Big Data)

Named Entity Identification (NEI) is the task of identifying named entities from textual data. While NEI for English language can be done with considerable accuracy owing to tools like Stanford NER tagger, the accuracy in case of Indian languages like Hindi is comparatively poor. One of the reasons for this is the lack of sufficiently large annotated corpora in Indian languages on which NE-taggers...

chapter

A multichannel convolutional neural network for cross-language dialog state tracking

Hongjie Shi, Takashi Ushio, Mitsuru Endo, Katsuyoshi Yamagami, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 559 - 564

2016 IEEE Spoken Language Technology Workshop (SLT)

The fifth Dialog State Tracking Challenge (DSTC5) introduces a new cross-language dialog state tracking scenario, where the participants are asked to build their trackers based on the English training corpus, while evaluating them with the unlabeled Chinese corpus. Although the computer-generated translations for both English and Chinese corpus are provided in the dataset, these translations contain...

chapter

Topic Discovery for Short Texts Using Word Embeddings

Guangxu Xun, Vishrawas Gopalakrishnan, Fenglong Ma, Yaliang Li, more

2016 IEEE 16th International Conference on Data Mining (ICDM) > 1299 - 1304

2016 IEEE 16th International Conference on Data Mining (ICDM)

Discovering topics in short texts, such as news titles and tweets, has become an important task for many content analysis applications. However, due to the lack of rich context information in short texts, the performance of conventional topic models on short texts is usually unsatisfying. In this paper, we propose a novel topic model for short text corpus using word embeddings. Continuous space word...

chapter

Seasonal Fluctuations in Collective Mood Revealed by Wikipedia Searches and Twitter Posts

Fabon Dzogang, Thomas Lansdall-Welfare, Nello Cristianini

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW) > 931 - 937

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

Understanding changes in the mood and mentalhealth of large populations is a challenge, with the need for largenumbers of samples to uncover any regular patterns within thedata. The use of data generated by online activities of healthyindividuals offers the opportunity to perform such observationson the large scales and for the long periods that are required. Various studies have previously examined...

Keywords:
INTERNET
ENCYCLOPEDIAS

Publication date

Set your own date range

Content availability

Available (819)
None (4)

Keywords

ELECTRONIC PUBLISHING (660)
WIKIPEDIA (197)
SEMANTICS (170)
DATA MINING (138)
WEB SITES (88)
ONTOLOGIES (87)
INFORMATION RETRIEVAL (78)
INFORMATION SERVICES (68)
SEARCH ENGINES (59)
CONTEXT (58)
NATURAL LANGUAGE PROCESSING (52)
COLLABORATION (48)
FEATURE EXTRACTION (45)
COMMUNITIES (41)
SOCIAL NETWORK SERVICES (39)
TRAINING (36)
VISUALIZATION (36)
TEXT ANALYSIS (35)
WEB PAGES (35)
DATABASES (33)
GOOGLE (33)
KNOWLEDGE BASED SYSTEMS (33)
DICTIONARIES (31)
EDUCATIONAL INSTITUTIONS (30)
ACCURACY (29)
SOFTWARE (29)
BLOGS (26)
HISTORY (26)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (24)
CLUSTERING ALGORITHMS (22)
EDUCATION (22)
ENCYCLOPAEDIAS (22)
ONTOLOGY (22)
SERVERS (22)
TWITTER (22)
COMPUTERS (21)
CORRELATION (21)
SOCIAL NETWORKING (ONLINE) (21)
TAGGING (20)
VECTORS (20)
COMPUTATIONAL MODELING (19)
MEDIA (19)
SEMANTIC WEB (19)
HUMANS (18)
ORGANIZATIONS (18)
ALGORITHM DESIGN AND ANALYSIS (17)
ELECTRONIC MAIL (17)
GROUPWARE (17)
INDEXES (16)
INFORMATION EXTRACTION (16)
MEASUREMENT (16)
QUERY PROCESSING (16)
SUPPORT VECTOR MACHINES (16)
WORDNET (16)
XML (16)
CLASSIFICATION ALGORITHMS (15)
DOCUMENT HANDLING (15)
KNOWLEDGE ENGINEERING (15)
RECOMMENDER SYSTEMS (15)
BUSINESS (14)
ENGINES (14)
JOINING PROCESSES (14)
KNOWLEDGE DISCOVERY (14)
SEMANTIC RELATEDNESS (14)
SEMANTIC SIMILARITY (14)
TEXT MINING (14)
VOCABULARY (14)
CITIES AND TOWNS (13)
GAMES (13)
KNOWLEDGE MANAGEMENT (13)
MACHINE LEARNING (13)
DATA MODELS (12)
MULTIMEDIA COMMUNICATION (12)
NAMED ENTITY RECOGNITION (12)
RELIABILITY (12)
SOCIAL MEDIA (12)
USER INTERFACES (12)
WEB 2.0 (12)
WEB SERVICES (12)
WORLD WIDE WEB (12)
BUILDINGS (11)
COMPUTER AIDED INSTRUCTION (11)
CONTENT MANAGEMENT (11)
E-LEARNING (11)
LABELING (11)
LIBRARIES (11)
MOBILE COMPUTING (11)
MOTION PICTURES (11)
PATTERN MATCHING (11)
QUERY EXPANSION (11)
TEXT CATEGORIZATION (11)
WIKI (11)
DATA VISUALIZATION (10)
DBPEDIA (10)
GRAPH THEORY (10)
HIDDEN MARKOV MODELS (10)
HTML (10)
MATHEMATICAL MODEL (10)
more

INFONA - science communication portal

Search results

Web caching evaluation from Wikipedia request statistics

Wikipedia-based extraction of key information from resumes

Automatic question generation for intelligent tutoring systems

User tracking using tweet segmentation and word

Source-LDA: Enhancing Probabilistic Topic Models Using Prior Knowledge Sources

Multi-Level Topical Text Categorization with Wikipedia

Towards applying OCR and Semantic Web to achieve optimal learning experience

Cross-modality matching based on Fisher Vector with neural word embeddings and deep image features

Movie Summarization Based on Alignment of Plot and Shots

Three dimensional geo-tweet visualization system for spatio-temporal events

Bag-of-Concepts Document Representation for Bayesian Text Classification

Android based educational Chatbot for visually impaired people

Question answering system for factoid based question

To link or not to link: Ranking hyperlinks in Wikipedia using collective attention

Parallel Text Identification Using Lexical and Corpus Features for the English-Maori Language Pair

A multilingual ontology based framework for wikipedia entry augmentation

Lightweight system for NE-tagged news headlines corpus creation

A multichannel convolutional neural network for cross-language dialog state tracking

Topic Discovery for Short Texts Using Word Embeddings

Seasonal Fluctuations in Collective Mood Revealed by Wikipedia Searches and Twitter Posts

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options