Search results

Items from 1 to 20 out of 36 results

chapter

Effective Information Spreading in Social Networks

Nunziato Cassavia, Elio Masciari, Domenico Sacca

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 599 - 606

2017 IEEE International Conference on Information Reuse and Integration (IRI)

Due to the emerging Big Data paradigm, traditional data management techniques result inadequate in many real life scenarios. In particular, the availability of huge amounts of data pertaining to social interactions among users calls for advanced analysis strategies. Furthermore, heterogeneity and high speed of this data require suitable data storage and management tools to be designed from scratch...

chapter

Text Document Clustering: The Application of Cluster Analysis to Textual Document

Venkata Srikanth Reddy, Patrick Kinnicutt, Roger Lee

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 1174 - 1179

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

Gathering the most relevant data for one's need, from the huge collection of data in the internet is a work of great difficult. To make it easier, we propose an application called text clustering, which is an automatic grouping of text documents into clusters, so that documents within a cluster defines the similarity between them, but they are not similar to documents in other clusters. Most of existing...

chapter

PFU: Profiling Forum users in online social networks, a knowledge driven data mining approach

G U Vasanthakumar, P Deepa Shenoy, K R Venugopal

2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE) > 57 - 60

2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE)

Online Social Networks (OSNs) provide platform to raise opinions on various issues, create and spread news rapidly in Online Social Network Forums (OSNFs). This work proposes a novel method for Profiling Forum Users (PFU) by exploring their behavioral characteristics based on their involvement in various topics of discussion and number of posts in respective topics posted by them in OSNFs dynamically...

chapter

Automated discovery of worldwide content servers infrastructure - the SNIFFER project

Andrzej Bak, Piotr Gajowniczek, Marcin Pilarski, Marcin Borkowski

2014 Federated Conference on Computer Science and Information Systems > 921 - 924

2014 Federated Conference on Computer Science and Information Systems (FedCSIS)

Service architecture of the Internet becomes more and more complex as it expands as a medium for large-scale distribution of diverse content. Dynamic growth of various content distribution systems, deployed by influential Internet companies, content distributors, aggregators and owners, has substantial impact on distribution of the network traffic and the scalability of various Internet services....

chapter

A Design and Implementation of Intrusion Detection System by Using Data Mining

Brijesh Sharma, Huma Gupta

2014 Fourth International Conference on Communication Systems and Network Technologies > 700 - 704

2014 International Conference on Communication Systems and Network Technologies (CSNT)

The role of the intrusion detection system is to enforce the pattern matching policies decided for the network. Basically Proposed IDS executes on the KDD'99 Data set, this data set is used in international level for evaluating/calculating the performance of various intrusion detection systems (IDS). First step is association phase in which frequent item set are produced by apriori algorithm. The...

chapter

Improved FCM Algorithm for Clustering on Web Usage Mining

K Suresh, R MadanaMohana, A RamaMohan Reddy, A Subramanyam

2011 International Conference on Computer and Management (CAMAN) > 1 - 4

2011 International Conference on Computer and Management (CAMAN 2011)

In this paper we present clustering method is very sensitive to the initial center values, requirements on the data set too high, and cannot handle noisy data the proposal method is using information entropy to initialize the cluster centers and introduce weighting parameters to adjust the location of cluster centers and noise problems. The navigation datasets which are sequential in nature. Clustering...

article

iHelp: An Intelligent Online Helpdesk System

Dingding Wang, Tao Li, Shenghuo Zhu, Yihong Gong

IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) > 2011 > 41 > 1 > 173 - 182

Due to the importance of high-quality customer service, many companies use intelligent helpdesk systems (e.g., case-based systems) to improve customer service quality. However, these systems face two challenges: 1) Case retrieval measures: most case-based systems use traditional keyword-matching-based ranking schemes for case retrieval and have difficulty to capture the semantic meanings of cases...

chapter

BOAT adaptive credit card fraud detection system

K K Sherly, R Nedunchezhian

2010 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 7

2010 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC 2010)

Fraud is increasing with the extensive use of internet and the increase of online transactions. More advanced solutions are desired to protect financial service companies and credit card holders from constantly evolving online fraud attacks. The main objective of this paper is to construct an efficient fraud detection system which is adaptive to the behavior changes by combining classification and...

chapter

Algorithm of the Text Copy Detection Based on Topic Bag

Wang Sen, Wang Yu

2010 International Conference on Web Information Systems and Mining > 1 > 285 - 288

2010 International Conference on Web Information Systems and Mining (WISM 2010)

In order to resolve the current problem about seriously academic plagiarism in the web environment, this article proposes an algorithm of the text copy detection on the topic bag and the algorithm uses the idea of semantic clustering and multi-instance learning. Firstly, a paper is divided into three layers construction tree: a leaf node denotes a sentence; a branch node represents a topic bag, and...

chapter

EST Clustering in Large Dataset with MapReduce

Chunyu Wang, Maozu Guo, Yang Liu

2010 First International Conference on Pervasive Computing, Signal Processing and Applications > 968 - 971

2010 First International Conference on Pervasive Computing, Signal Processing and Applications (PCSPA 2010)

Analysis about EST data usually starts with EST clustering, the process of grouping fragments according their original consensus long sequence. The similarity between ESTs always means that part of the sequences match with each other in some way. Accurate clustering is quadratic in time in average EST length and numbers, and the number of ESTs in public EST database is increasing exponentially. With...

chapter

A comparison of two suffix tree-based document clustering algorithms

M Rafi, M Maujood, Murtaza Munawar Fazal, Syed Muhammad Ali

2010 International Conference on Information and Emerging Technologies > 1 - 5

2010 International Conference on Information and Emerging Technologies (ICIET)

Document clustering as an unsupervised approach extensively used to navigate, filter, summarize and manage large collection of document repositories like the World Wide Web (WWW). Recently, focuses in this domain shifted from traditional vector based document similarity for clustering to suffix tree based document similarity, as it offers more semantic representation of the text present in the document...

chapter

Instance Discovery and Schema Matching with Applications to Biological Deep Web Data Integration

Tantan Liu, Fan Wang, Gagan Agrawal

2010 IEEE International Conference on BioInformatics and BioEngineering > 304 - 305

2010 International Conference on BioInformatics and BioEngineering (BIBE)

We presents data mining-based techniques for enabling data integration across deep web data sources. We target query processing across inter-dependent data sources. Thus, besides input-input and output-output matching of attributes, we also need to consider input-output matching. We develop data mining techniques for discovering the instances for querying deep web data sources from the information...

chapter

Research on Segmentation of E-shoppers Based on Clustering

Wang Chong, Liu Jian, Wang Yanqing

2010 International Conference on Intelligent Computation Technology and Automation > 3 > 100 - 103

2010 International Conference on Intelligent Computation Technology and Automation (ICICTA 2010)

With the rapid development of online shopping, the ability to segment e-shoppers basing on their preferences and characteristics has become a key source of competitive advantage for firms. This paper presented the realistic algorithms for clustering e-shoppers in e-commerce applications. Multi-dimensional range search is presented to solve the range-searching problem. This is a multi-level structure...

chapter

Study of Deep Web Sources Classification Technology

Huilan Zhao

2010 Second International Conference on Multimedia and Information Technology > 1 > 324 - 326

2010 Second International Conference on Multimedia and Information Technology (MMIT 2010)

Searching on the Internet today can be compared to dragging a net across the surface of the ocean. While a great deal may be caught in the net, there is still a wealth of information that is deep, and therefore, missed. Deep Web sources store their content in searchable databases that only produce result dynamically in response to a direct request. In this paper, we proposed an automatic classification...

chapter

Fuzzy Clustering of Text Documents Using Naïve Bayesian Concept

Rishiraj Saha Roy, Durga Toshniwal

2010 International Conference on Recent Trends in Information, Telecommunication and Computing > 55 - 59

2010 International Conference on Recent Trends in Information, Telecommunication and Computing (ITC 2010)

Clustering organizes text in an unsupervised fashion. In this paper, we propose an algorithm for the fuzzy clustering of text documents using the naive Bayesian concept. Fuzzy clustering implies that the text documents are assigned to multiple clusters, ranked in descending order of probability. The Vector Space Model is used to represent our dataset as a term-weight matrix. In any natural language,...

chapter

An efficient ontology approach for organizing and mapping deep Web resources

Su Su Hlaing, Khin Hay Mar Saw Hla

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 1 > 235 - 240

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

To enable effective access to databases on the Web, it is critical to integrate the large scale deep Web sources. Therefore, schema matching is a basic problem in many database application domains, such as data integration, E-business, data warehousing, and semantic query processing. In current implementations, schema matching has some significant limitations until now. And also, there are some problems...

chapter

Web wrapper generation using tree alignment and transfer learning

Yingju Xia, Shu Zhang, Hao Yu

The 2nd International Conference on Software Engineering and Data Mining > 410 - 415

2nd International Conference on Software Engineering and Data Mining (SEDM 2010)

This paper studies the web wrapper generation for web pages of forum, blog and news web sites. While more and more web pages are dynamically generated using a common template populated with data from databases. This paper proposes a novel method that uses tree alignment and transfer learning method to generate the wrapper from this kind of web pages. We present a new tree alignment algorithm to find...

chapter

Research and Design of Internet Public Opinion Analysis System

Quanlong Guan, Saizhi Ye, Guoxiang Yao, Huanming Zhang, more

2009 IITA International Conference on Services Science, Management and Engineering > 173 - 177

2009 IITA International Conference on Services Science, Management and Engineering (SSME)

Internet is becoming a spreading platform for the public opinion. It is important to grasp the Internet public opinion in time and understand the trends of their opinion correctly. Text classification plays a fundamental role in a number of information management and retrieval tasks. But Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information...

chapter

Design of Intrusion Detection System Based on Data Mining Algorithm

Changxin Song, Ke Ma

2009 International Conference on Signal Processing Systems > 370 - 373

2009 International Conference on Signal Processing Systems (ICSPS)

Internet technology has developed rapidly and both software system and hardware equipment have improved greatly in recent years. However, Internet brings people not only convenience but also great potential threats. Facts show that potential safety hazards exist from the emergence of internet. As a kind of effective information security safeguard measure, intrusion detection makes up for the defects...

chapter

Study on Personalized Service Technology Based on Deep Web Database

Rong Luo, Min-Xia Zhang, Yu-Xi Gong

2009 International Conference on Industrial Mechatronics and Automation > 268 - 270

2009 International Conference on Industrial Mechatronics and Automation (ICIMA 2009)

With the development of personalized service technology, analyze the status of deep Web database and personalized service, put forward one new method of personalized service according to user behavior currently in deep Web database, discuss the key technology. The experiment shows the system can conquer the limitation of personalized service too depend on the user behavior.

Data set:
ieee
Keywords:
DATABASES
CLUSTERING ALGORITHMS
INTERNET

Publication date

Set your own date range

Publication type

book (35)
article (1)

Keywords

DATA MINING (21)
PATTERN CLUSTERING (16)
ALGORITHM DESIGN AND ANALYSIS (14)
CLASSIFICATION ALGORITHMS (8)
FEATURE EXTRACTION (7)
INFORMATION RETRIEVAL (7)
ACCURACY (6)
WEB PAGES (6)
PARTITIONING ALGORITHMS (5)
SECURITY OF DATA (5)
SEMANTICS (5)
WEB SITES (5)
CLUSTERING (4)
CLUSTERING METHODS (4)
COMPUTERS (4)
DATABASE MANAGEMENT SYSTEMS (4)
ENTROPY (4)
QUERY PROCESSING (4)
SEARCH PROBLEMS (4)
SHAPE (4)
TEXT ANALYSIS (4)
TRAINING (4)
ANOMALY DETECTION (3)
ARTIFICIAL INTELLIGENCE (3)
CLASSIFICATION (3)
CLUSTER (3)
COMPUTATIONAL MODELING (3)
DATA MODELS (3)
DEEP WEB (3)
IMAGE RETRIEVAL (3)
IMAGE SEGMENTATION (3)
INDEXES (3)
INTRUSION DETECTION (3)
IP NETWORKS (3)
MERGING (3)
SECURITY (3)
SERVERS (3)
SIGNAL PROCESSING (3)
USA COUNCILS (3)
WORLD WIDE WEB (3)
BIOINFORMATICS (2)
BUSINESS (2)
CLUSTERING TECHNIQUES (2)
COMPUTER NETWORKS (2)
COMPUTER SCIENCE (2)
CONFERENCES (2)
CONTENT-BASED RETRIEVAL (2)
DOCUMENT HANDLING (2)
EDUCATIONAL INSTITUTIONS (2)
EQUATIONS (2)
EXPECTATION-MAXIMISATION ALGORITHM (2)
FRAUD (2)
FUZZY SET THEORY (2)
GAIN (2)
GOVERNMENT (2)
HTML (2)
IMAGE COLOR ANALYSIS (2)
IMAGE DATABASES (2)
IMAGE RESOLUTION (2)
INDEXING (2)
KNOWLEDGE ENGINEERING (2)
LEAD (2)
MATHEMATICAL MODEL (2)
MEDIA (2)
MONITORING (2)
MULTIMEDIA COMMUNICATION (2)
MUTUAL INFORMATION (2)
ONTOLOGIES (2)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (2)
PATTERN CLASSIFICATION (2)
PATTERN MATCHING (2)
PATTERN RECOGNITION (2)
PEDIATRICS (2)
PRESSES (2)
RESISTANCE (2)
REVIEWS (2)
ROBUSTNESS (2)
SCHEMA MATCHING (2)
SEARCH ENGINE (2)
SEARCH ENGINES (2)
SENSORS (2)
SIGNAL PROCESSING ALGORITHMS (2)
SOCIAL NETWORK SERVICES (2)
STREAMING MEDIA (2)
TESTING (2)
TEXT MINING (2)
VECTOR SPACE MODEL (2)
ACADEMIC PLAGIARISM (1)
ADAPTATION MODEL (1)
ADAPTIVE SYSTEMS (1)
ALIGNMENT (KEY WORDS) (1)
ANALYTICAL MODELS (1)
ANCHOR TEXT ANALYSIS (1)
ANOMALY CLASSIFICATION (1)
ANOMALY-BASED CLUSTERING (1)
APPROXIMATE ENTROPY (1)
APPROXIMATION METHODS (1)
more

INFONA - science communication portal

Search results

Effective Information Spreading in Social Networks

Text Document Clustering: The Application of Cluster Analysis to Textual Document

PFU: Profiling Forum users in online social networks, a knowledge driven data mining approach

Automated discovery of worldwide content servers infrastructure - the SNIFFER project

A Design and Implementation of Intrusion Detection System by Using Data Mining

Improved FCM Algorithm for Clustering on Web Usage Mining

iHelp: An Intelligent Online Helpdesk System

BOAT adaptive credit card fraud detection system

Algorithm of the Text Copy Detection Based on Topic Bag

EST Clustering in Large Dataset with MapReduce

A comparison of two suffix tree-based document clustering algorithms

Instance Discovery and Schema Matching with Applications to Biological Deep Web Data Integration

Research on Segmentation of E-shoppers Based on Clustering

Study of Deep Web Sources Classification Technology

Fuzzy Clustering of Text Documents Using Naïve Bayesian Concept

An efficient ontology approach for organizing and mapping deep Web resources

Web wrapper generation using tree alignment and transfer learning

Research and Design of Internet Public Opinion Analysis System

Design of Intrusion Detection System Based on Data Mining Algorithm

Study on Personalized Service Technology Based on Deep Web Database

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options