Search results

article

Joint Hierarchical Category Structure Learning and Large-Scale Image Classification

Yanyun Qu, Li Lin, Fumin Shen, Chang Lu, more

IEEE Transactions on Image Processing > 2017 > 26 > 9 > 4331 - 4346

We investigate the scalable image classification problem with a large number of categories. Hierarchical visual data structures are helpful for improving the efficiency and performance of large-scale multi-class classification. We propose a novel image classification method based on learning hierarchical inter-class structures. Specifically, we first design a fast algorithm to compute the similarity...

chapter

Optimizing K-means text document clustering using latent semantic indexing and pillar algorithm

Sigit Adinugroho, Yuita Arum Sari, M. Ali Fauzi, Putra Pandu Adikara

2017 5th International Symposium on Computational and Business Intelligence (ISCBI) > 81 - 85

2017 5th International Symposium on Computational and Business Intelligence (ISCBI)

Document clustering is an important tool to help managing the vast amount of digital text document. This paper introduces a new approach to cluster text document. First, text is preprocessed and indexed using inverted index. Then the index is trimmed using TF-DF thresholding. After that, Term Document Matrix is built based on TF-IDF. Next step uses Latent Semantic Indexing to extract important feature...

chapter

Multi-document abstractive summarization based on predicate argument structure

S Alshaina, Ansamma John, Aneesh G Nath

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) > 1 - 6

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES)

The proposed work is based on abstractive summarization which is the division of text summarization. It developed a summary of the multi-document using the semantic relationship between the input documents rather than what we get exactly from the input document. It is very necessary because of the difficulty of generating abstract manually and also a challenging task. In our system, summary is generated...

chapter

Aspect Extraction and Aspect Terms Expansion in Chinese Reviews Using Cluster Semi-Supervised Expansion Model

Jianyong Tuo, Shuo Yan, Bing Li, Hailiang Wang, more

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 212 - 217

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Aspect extraction is one of the most important tasks for text mining. Semi-supervised methods have been proposed to solve this problem. However, the seed terms have to be given in advance in these methods. The current methods categorize the aspects without expanding more aspects terms. And most of the methods are based on English corpus, there is a great space for the research on the aspect extraction...

chapter

A multi-view fusion approach for entity alignment

Chunxia Zhang, Xiuzhang Yang, Shuliang Wang, Zhendong Niu, more

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) > 388 - 393

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

Entity alignment is an important issue in the areas of ontology alignment and computational intelligence. Ontology alignment is a key technology to solve the semantic heterogeneity problem of ontology and the Semantic Web, and to realize knowledge reusing and integration. The task of entity alignment is to identify entities represented in textual documents or web pages which refer to the same entities...

chapter

Automatic synonym extraction using Word2Vec and spectral clustering

Li Zhang, Jun Li, Chao Wang

2017 36th Chinese Control Conference (CCC) > 5629 - 5632

2017 36th Chinese Control Conference (CCC)

Synonyms extraction is a fundamental research, which is helpful to text mining and information retrieval. In this paper, we propose method to extract synonymy from text, the method employs spectral clustering and word2vec. First, the word2vec model is trained by a large-scale English Wikipedia corpus. Then, we extract keywords from a text and use the trained model to generate similarities among these...

chapter

Crowdsourced time-sync video tagging using semantic association graph

Wenmian Yang, Na Ruan, Wenyuan Gao, Kun Wang, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 547 - 552

2017 IEEE International Conference on Multimedia and Expo (ICME)

Time-sync comments reveal a new way of extracting the online video tags. However, such time-sync comments have lots of noises due to users' diverse comments, introducing great challenges for accurate and fast video tag extractions. In this paper, we propose an unsupervised video tag extraction algorithm named Semantic Weight-Inverse Document Frequency (SW-IDF). SW-IDF first generates corresponding...

chapter

Improving Web Service Clustering through a Novel Ontology Generation Method by Domain Specificity

Rupasingha A. H. M. Rupasingha, Incheon Paik, Banage T. G. S. Kumara

2017 IEEE International Conference on Web Services (ICWS) > 744 - 751

2017 IEEE International Conference on Web Services (ICWS)

In recent years, due to the growth of information onthe internet, the number of available Web services has increased.Clustering Web services based on their functional features todifferent domains have started to play a major role in severalservice management tasks such as efficient Web service discoveryand recommendations. In this paper, we propose a novelontology-based approach for Web service clustering...

chapter

WE-LDA: A Word Embeddings Augmented LDA Model for Web Services Clustering

Min Shi, Jianxun Liu, Dong Zhou, Mingdong Tang, more

2017 IEEE International Conference on Web Services (ICWS) > 9 - 16

2017 IEEE International Conference on Web Services (ICWS)

Due to the rapid growth in both the number and diversity of Web services on the web, it becomes increasingly difficult for us to find the desired and appropriate Web services nowadays. Clustering Web services according to their functionalities becomes an efficient way to facilitate the Web services discovery as well as the services management. Existing methods for Web services clustering mostly focus...

chapter

Extracting Topics Based on Word2Vec and Improved Jaccard Similarity Coefficient

Chunzi Wu, Bai Wang

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC) > 389 - 397

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC)

To extract key topics from news articles, this paper researches into a new method to discover an efficient way to construct text vectors and improve the efficiency and accuracy of document clustering based on Word2Vec model. This paper proposes a novel algorithm, which combines Jaccard similarity coefficient and inverse dimension frequency to calculate the importance degree between each dimension...

chapter

A comparative study for Arabic Multi-Document Summarization Systems (AMD-SS)

Mossab N. Ibrahim, Khulood Abu Maria, Khalid Mohammad Jaber

2017 8th International Conference on Information Technology (ICIT) > 1013 - 1022

2017 8th International Conference on Information Technology (ICIT)

This paper demonstrates a comparative study of Arabic Multi-Document Summarization System (AMD-SS). These methods are compared and analyzed, aiming to detect which method generates a genuine summary and achieves the best results in comparison with the human summarization techniques. The comparative study shows that there is a lack in the area of Arabic Automatic Text Summarization systems. Therefore,...

chapter

A deep learning enabled subspace spectral ensemble clustering approach for web anomaly detection

Guiqin Yuan, Bo Li, Yiyang Yao, Simin Zhang

2017 International Joint Conference on Neural Networks (IJCNN) > 3896 - 3903

2017 International Joint Conference on Neural Networks (IJCNN)

With the development of the Internet, it is vital for the security of the Internet to detect web-based anomalies. Clustering based on feature extraction by manually has been verified as a significant way to detect new anomalies. But the presentations of these features can't express semantic information of the URLs. In addition, few studies try to cluster the anomalies into specific types like SQL-injection...

chapter

Word sense disambiguation in Bengali: An unsupervised approach

Alok Ranjan Pal, Diganta Saha

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT) > 1 - 5

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT)

In the proposed approach, Word Sense Disambiguation (WSD) in Bengali language has been done using unsupervised methodology. This work is consisted of sequential two sub-tasks. First one is grouping of Bengali sentences into a certain number of clusters where a particular cluster contains the sentences of similar meaning and second one is labeling the clusters with its inner meanings with the help...

chapter

OnSeS: A Novel Online Short Text Summarization Based on BM25 and Neural Network

Jianwei Niu, Qingjuan Zhao, Lei Wang, Huan Chen, more

2016 IEEE Global Communications Conference (GLOBECOM) > 1 - 6

GLOBECOM 2016 - 2016 IEEE Global Communications Conference

The last decade has witnessed a dramatic growth of social networks, such as Twitter, Sina Microblog, etc. Messages/short texts on these platforms are generally of limited length, causing difficulties for machines to understand. Moreover, it is rarely possible for users to read and understand all the content due to the large quantity. So it is imperative to cluster and extract the viewpoints of these...

chapter

Text Clustering Algorithm Based on Semantic Graph Structure

Qiuchan Bai, Chunxia Jin

2016 9th International Symposium on Computational Intelligence and Design (ISCID) > 2 > 312 - 316

2016 9th International Symposium on Computational Intelligence and Design (ISCID)

As semantic information is often missing in text representation, this paper proposes semantic graph structure to represent text and optimize graph structure by semantic similarity matrix. Then calculate the similarity of semantic graph structure by using the maximum common sub-graph of graph theory. Finally, K-means algorithm will be applied to expand Chinese text clustering to improve text clustering...

chapter

Story Forms Detection in Text through Concept-Based Co-Clustering

Sultan Alzahrani, Betul Ceran, Saud Alashri, Scott W. Ruston, more

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom) > 258 - 265

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom)

A story is defined as actors taking actions that culminate in resolutions. In this paper, we extract subject - verb - object relationships from paragraphs and generalize them into semantic conceptual representations. Overlapping generalized concepts and relationships correspond to archetypes/targets and actions that characterize story forms. We present an analytic framework which implements co-clustering...

chapter

Sentence level opinion mining of hotel comments

Hongting Li, Qinke Peng, Xinyu Guan

2016 IEEE International Conference on Information and Automation (ICIA) > 2065 - 2070

2016 IEEE International Conference on Information and Automation (ICIA)

With the rapid growth of Internet consumption, the various product comments' form and redundant information are not convenient for the customers to grasp the hot opinions of the historical comments. In view of this, this paper studies the hot opinions of the products' comments and takes the hotel comments data as the main research objects. We filter the comment data from the length of the comments...

chapter

A Fused Multi-feature Based Co-training Approach for Document Clustering

Yuanqing Wang, Wenjun Wang, Weidi Dai, Pengfei Jiao, more

2016 3rd International Conference on Information Science and Control Engineering (ICISCE) > 38 - 43

2016 3rd International Conference on Information Science and Control Engineering (ICISCE)

Document clustering is a popular topic in data mining and information retrieval. Most models and methods for this problem are based on computing the similarity between pair documents modeled in a space of all terms, or a new feature space obtained by applying a topic modeling technique for a given corpus. In this paper, we regard these two ideas as clustering on term feature and on semantic feature,...

chapter

Compressive-signal annotation driven by a supervised topic-clustering BoF model

Jianyan Zheng, Lihong Ma, Xiaoer Wang

2016 International Conference on Audio, Language and Image Processing (ICALIP) > 353 - 356

2016 International Conference on Audio, Language and Image Processing (ICALIP)

This paper presents a new Bag-of-Features model (BoF) to enhance the efficiency of automatic image annotation. Since the traditional BoF ignores the semantic of its vocabularies, it cannot be seen as descriptive representation of images in many image applications. To handle this critical limitation, firstly, we propose the RGB compressive texton. By using compressive sensing theory, the image can...

chapter

Clustering Product Features of Online Reviews Based on Nonnegative Matrix Tri-factorizations

Wang Jiajia, Liu Yezheng, Jiang Yuanchun, Sun Chunhua, more

2016 IEEE First International Conference on Data Science in Cyberspace (DSC) > 199 - 208

2016 IEEE First International Conference on Data Science in Cyberspace (DSC)

Clustering product features is the essential task to mine opinions from unstructured online reviews because different customers usually express the same feature with different words or phrases. Several supervised and unsupervised methods have been applied to accomplish this task. In this paper, we propose an orthogonal nonnegative matrix tri-factorizations model to solve the problem. We first construct...

INFONA - science communication portal

Search results

Joint Hierarchical Category Structure Learning and Large-Scale Image Classification

Optimizing K-means text document clustering using latent semantic indexing and pillar algorithm

Multi-document abstractive summarization based on predicate argument structure

Aspect Extraction and Aspect Terms Expansion in Chinese Reviews Using Cluster Semi-Supervised Expansion Model

A multi-view fusion approach for entity alignment

Automatic synonym extraction using Word2Vec and spectral clustering

Crowdsourced time-sync video tagging using semantic association graph

Improving Web Service Clustering through a Novel Ontology Generation Method by Domain Specificity

WE-LDA: A Word Embeddings Augmented LDA Model for Web Services Clustering

Extracting Topics Based on Word2Vec and Improved Jaccard Similarity Coefficient

A comparative study for Arabic Multi-Document Summarization Systems (AMD-SS)

A deep learning enabled subspace spectral ensemble clustering approach for web anomaly detection

Word sense disambiguation in Bengali: An unsupervised approach

OnSeS: A Novel Online Short Text Summarization Based on BM25 and Neural Network

Text Clustering Algorithm Based on Semantic Graph Structure

Story Forms Detection in Text through Concept-Based Co-Clustering

Sentence level opinion mining of hotel comments

A Fused Multi-feature Based Co-training Approach for Document Clustering

Compressive-signal annotation driven by a supervised topic-clustering BoF model

Clustering Product Features of Online Reviews Based on Nonnegative Matrix Tri-factorizations

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options