Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 57 results

chapter

Classifying Wikipedia entities into fine-grained classes

M Tkatchenko, A Ulanov, A Simanovsky

2011 IEEE 27th International Conference on Data Engineering Workshops > 212 - 217

2011 IEEE International Conference on Data Engineering Workshops (ICDEW 2011)

Recognition of named entities (people, companies, locations, etc) is an essential task of text analytics. We address the subproblem of this task, namely, named entity classification. We propose a novel approach that constructs an effective fine-grained named entity classifier. Its key highlights are semi-automatic training set construction from Wikipedia articles and additional feature selection....

chapter

Generating New Features Using Genetic Programming to Detect Link Spam

Li Shengen, Niu Xiaofei, Li Peiqi, Wang Lin

2011 Fourth International Conference on Intelligent Computation Technology and Automation > 1 > 135 - 138

2011 International Conference on Intelligent Computation Technology and Automation (ICICTA)

Link spam techniques can enable some pages to achieve higher-than-deserved rankings in the results of a search engine. They negatively affect the quality of search results. Classification methods can detect link spam. For classification problem, features play an important role. This paper proposes to derive new features using genetic programming from existing link-based features and use the new features...

chapter

Randomized tag recommendation in social networks and classification of spam posts

P P Ravindran, A Mishra, P Kesavan, S Mohanavalli

2010 IEEE International Workshop on: Business Applications of Social Network Analysis (BASNA) > 1 - 6

2010 IEEE International Workshop on: Business Applications of Social Network Analysis (BASNA)

Tag recommendation is an integral part of any bookmarking application. With the growing popularity in Web 2.0 usage, recommending tags is of utmost importance in enabling a user to perform bookmarking easily. An issue that most recommendation systems do not consider is that users have a tendency to choose from tags that are suggested to them, which might bias the true popular rankings of tags. In...

chapter

Flow classification using clustering and association rule mining

U K Chaudhary, I Papapanagiotou, M Devetsikiotis

2010 15th IEEE International Workshop on Computer Aided Modeling, Analysis and Design of Communication Links and Networks (CAMAD) > 76 - 80

2010 IEEE 15th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD 2010). 2010 15th IEEE International Workshop on Computer Aided Modeling, Analysis and Design of Communication Links and Networks

Traffic classification has become a crucial domain of research due to the rise in applications that are either encrypted or tend to change port consecutively. The challenge of flow classification is to determine the applications involved without any information on the payload. In this paper, our goal is to achieve a robust and reliable flow classification using data mining techniques. We propose a...

chapter

Analysis of Machine learning Techniques Used in Behavior-Based Malware Detection

I Firdausi, Charles Lim, A Erwin, A S Nugroho

2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies > 201 - 203

2010 Second International Conference on Advances in Computing, Control and Telecommunication Technologies (ACT 2010)

The increase of malware that are exploiting the Internet daily has become a serious threat. The manual heuristic inspection of malware analysis is no longer considered effective and efficient compared against the high spreading rate of malware. Hence, automated behavior-based malware detection using machine learning techniques is considered a profound solution. The behavior of each malware on an emulated...

chapter

Hybridization of Base Classifiers of Random Subsample Ensembles for Enhanced Performance in High Dimensional Feature Spaces

S Pathical, G Serpen

2010 Ninth International Conference on Machine Learning and Applications > 776 - 781

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

This paper presents a simulation-based empirical study of the performance profile of random sub sample ensembles with a hybrid mix of base learner composition in high dimensional feature spaces. The performance of hybrid random sub sample ensemble that uses a combination of C4.5, k-nearest neighbor (kNN) and naïve Bayes base learners is assessed through statistical testing in comparison to those...

chapter

Detecting Worms Using Data Mining Techniques: Learning in the Presence of Class Noise

I Ismail, M N Marsono, S M Nor

2010 Sixth International Conference on Signal-Image Technology and Internet Based Systems > 187 - 194

Sixth International Conference on Signal-Image Technology & Internet-Based Systems (SITIS 2010)

Worms are self-contained programs that spread over the Internet. Worms cause problems such as lost of information, information theft and denial-of-service attacks. The first part of the paper evaluates the detection of worms based on content classification by using all machine learning techniques available in WEKA data mining tools. Four most accurate and quite fast classifiers are identified for...

chapter

Early traffic identification using Bayesian networks

Rentao Gu, Hongxiang Wang, Yuefeng Ji

2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content > 564 - 568

2010 2nd IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2010)

Port-based or payload-based analysis is becoming difficult for accurate traffic identification when many applications use dynamic port numbers and encryption to avoid detection. In this paper we present an approach for online traffic classification relying on the observation of the first n packets of a flow. The packet size and inter-arrival times of the individual packets, rather than the statistic...

chapter

Using per-Source measurements to improve performance of Internet traffic classification

S Bregni, D Lucerna, C Rottondi, G Verticale

2010 IEEE Latin-American Conference on Communications > 1 - 5

2010 IEEE Latin-American Conference on Communications (LATINCOM)

Obfuscated and encrypted protocols hinder traffic classification by classical techniques such as port analysis or deep packet inspection. Therefore, there is growing interest for classification algorithms based on statistical analysis of the length of the first packets of flows. Most classifiers proposed in literature are based on machine learning techniques and consider each flow independently of...

chapter

Improving Performance of Network Traffic Classification Systems by Cleaning Training Data

Francesco Gargiulo, Carlo Sansone

2010 20th International Conference on Pattern Recognition > 2768 - 2771

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper we propose to apply an algorithm for finding out and cleaning mislabeled training sample in an adversarial learning context, in which a malicious user tries to camouflage training patterns in order to limit the classification system performance. In particular, we describe how this algorithm can be effectively applied to the problem of identifying HTTP traffic flowing through port TCP...

chapter

On Predictability of System Anomalies in Real World

Yongmin Tan, Xiaohui Gu

2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems > 133 - 140

18th IEEE/ACM International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS 2010)

As computer systems become increasingly complex, system anomalies have become major concerns in system management. In this paper, we present a comprehensive measurement study to quantify the predictability of different system anomalies. Online anomaly prediction allows the system to foresee impending anomalies so as to take proper actions to mitigate anomaly impact. Our anomaly prediction approach...

chapter

Text-Based Web Page Classification with Use of Visual Information

Vladimír Bartík

2010 International Conference on Advances in Social Networks Analysis and Mining > 416 - 420

2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010)

As the number of pages on the web is permanently increasing, there is a need to classify pages into categories to facilitate indexing or searching them. In the method proposed here, we use both textual and visual information to find a suitable representation of web page content. In this paper, several term weights, based on TF or TF-IDF weighting are proposed. Modification is based on visual areas,...

chapter

An event ontology construction approach to web crime mining

Li Cunhua, Hu Yun, Zhong Zhaoman

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2441 - 2445

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Along with the rapid popularity of the Internet, crime information on the web is becoming increasingly rampant, and the majority of them are in the form of text. Because a lot of crime information in documents is described through events, event-based semantic technology can be used to study the patterns and trends of web-oriented crimes. In our research project on cyber crime mining, we construct...

chapter

Evaluating Application-Layer Classification Using a Machine Learning Technique over Different High Speed Networks

Sven Ubik, Petr Žejdl

2010 Fifth International Conference on Systems and Networks Communications > 387 - 391

Fifth International Conference on Systems and Networks Communications (ICSNC 2010)

Application-layer classification is needed in many monitoring applications. Classification based on machine learning offers an alternative method to methods based on port or payload based techniques. It is based on statistical features computed from network flows. Several works investigated the efficiency of machine learning techniques and found algorithms suitable for network classification. A classifier...

chapter

Assessing the Quality of Opinion Retrieval Systems

Giambattista Amati, Giuseppe Amodeo, Valerio Capozio, Giorgio Gambosi, more

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 3 > 235 - 238

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

Due to the complexity of topical opinion retrieval systems, standard measures, such as MAP or precision, do not fully succeed in assessing their performances. In this paper we introduce an evaluation framework based on artificially defined opinion classifiers. Using a Monte Carlo sampling, we perturb a relevance ranking by the outcomes of these classifiers and analyse how the opinion retrieval performance...

chapter

An Unsupervised Snippet-Based Sentiment Classification Method for Chinese Unknown Phrases without Using Reference Word Pairs

Ting-Chun Peng, Chia-Chun Shih

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 3 > 243 - 248

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

This work presents an unsupervised snippet-based sentiment classification method for Chinese unknown sentiment phrases, which is also applicable to other languages theoretically. Unlike existing Semantic Orientation (SO) methods, our proposed method does not require any Reference Word Pairs (RWPs) for predicting the sentiments of phrases. The results of preliminary experiments show that our proposed...

chapter

A New Weighted Ensemble Model for Detecting DoS Attack Streams

Jinghua Yan, Xiaochun Yun, Peng Zhang, Jianlong Tan, more

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 3 > 227 - 230

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

Recently, DoS (Denial of Service) detection has become more and more important in web security. In this paper, we argue that DoS attack can be taken as continuous data streams, and thus can be detected by using stream data mining methods. More specifically, we propose a new Weighted Ensemble learning model to detect the DoS attacks. The Weighted Ensemble model first trains base classifiers using different...

chapter

Internet traffic classification using a Hidden Markov Model

José Everardo Bessa Maia, Raimir Holanda Filho

2010 10th International Conference on Hybrid Intelligent Systems > 37 - 42

2010 10th International Conference on Hybrid Intelligent Systems (HIS 2010)

This paper examines the performance of a new Hidden Markov Model (HMM) structure used as the core of an Internet traffic classsifier and compares the results against other models present in the literature. Traffic modeling and classification find importance in many areas such as bandwidth management, traffic analysis, prediction and engineering, network planning, Quality of Service provisioning and...

chapter

Product review sentiment classification using parts of speech

P Tanawongsuwan

2010 3rd International Conference on Computer Science and Information Technology > 8 > 424 - 427

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

A prospective buyer interested in a particular item may find out information about the item from various sources, including product reviews. With interactive information sharing facilitated by Web 2.0, a lot of product reviews are available on the web. For a popular item with a large number of reviews, a prospective buyer could use some help in selecting only reviews of interest, such as, only positive...

chapter

Web classification using extraction and machine learning techniques

L M Yusuf, M S Othman, J Salim

2010 International Symposium on Information Technology > 2 > 765 - 770

2010 International Symposium on Information Technology (ITSim 2010)

Internet services that has become easier to access has contributed to the drastic increase in the number of web pages. This phenomenon has created new difficulties to internet users about retrieving the latest, relevant and excellent web information. This is due to the enormous contents of web information that have caused problems in the restructuring of web information. Thus, in order to ensure the...

Keywords:
ACCURACY
PATTERN CLASSIFICATION
INTERNET

Publication date

Set your own date range

Content availability

Available (54)
None (3)

Keywords

CLASSIFICATION ALGORITHMS (32)
DATA MINING (28)
SUPPORT VECTOR MACHINES (21)
TRAINING (21)
MACHINE LEARNING (19)
LEARNING (ARTIFICIAL INTELLIGENCE) (16)
TEXT ANALYSIS (13)
ALGORITHM DESIGN AND ANALYSIS (11)
WEB SITES (11)
FEATURE EXTRACTION (10)
TELECOMMUNICATION TRAFFIC (10)
TESTING (10)
DOCUMENT HANDLING (9)
TEXT CATEGORIZATION (9)
WEB PAGES (9)
CLASSIFICATION (8)
ARTIFICIAL NEURAL NETWORKS (7)
SUPPORT VECTOR MACHINE (7)
TRAFFIC CLASSIFICATION (7)
BAYES METHODS (6)
INFORMATION RETRIEVAL (6)
PROTOCOLS (6)
STATISTICAL ANALYSIS (6)
TRAINING DATA (6)
CLASSIFICATION TREE ANALYSIS (5)
COMPUTERS (5)
DECISION TREES (5)
ESTIMATION (5)
HTML (5)
MACHINE LEARNING ALGORITHMS (5)
MEDIA (5)
PATTERN CLUSTERING (5)
SEARCH ENGINES (5)
ARTIFICIAL INTELLIGENCE (4)
CLUSTERING ALGORITHMS (4)
EQUATIONS (4)
FEATURE SELECTION (4)
GAIN (4)
INDEXES (4)
IP NETWORKS (4)
NOISE (4)
PAYLOADS (4)
PEER-TO-PEER COMPUTING (4)
SVM (4)
TEXT CLASSIFICATION (4)
UNSUPERVISED LEARNING (4)
BAYESIAN METHODS (3)
BUILDINGS (3)
COMPUTER SCIENCE (3)
CONFERENCES (3)
DATA MODELS (3)
DATABASES (3)
DOCUMENT CLASSIFICATION (3)
GENETIC ALGORITHMS (3)
GOVERNMENT (3)
HISTORY (3)
INTERNET TRAFFIC CLASSIFICATION (3)
K-NEAREST NEIGHBOR (3)
KNOWLEDGE ENGINEERING (3)
LEAD (3)
MATHEMATICAL MODEL (3)
MEASUREMENT (3)
MULTIMEDIA COMMUNICATION (3)
MULTIMEDIA SYSTEMS (3)
MUSIC (3)
MUTUAL INFORMATION (3)
OPINION MINING (3)
PROBABILITY DENSITY FUNCTION (3)
RESISTANCE (3)
ROBUSTNESS (3)
SECURITY OF DATA (3)
SIGNAL PROCESSING (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
TELECOMMUNICATION COMPUTING (3)
TELECOMMUNICATION SECURITY (3)
TRANSFORMS (3)
USA COUNCILS (3)
WEB DOCUMENT (3)
WEB DOCUMENT CLASSIFICATION (3)
WEB MINING (3)
ANALYTICAL MODELS (2)
ART (2)
ARTIFICIAL NEURAL NETWORK (2)
ATTRIBUTE SELECTION (2)
AUTOMATIC CLASSIFICATION (2)
BOOK REVIEWS (2)
CLASSIFICATION METHOD (2)
CLASSIFICATION RULES (2)
CLUSTERING (2)
COMPUTATIONAL COMPLEXITY (2)
COMPUTER CRIME (2)
COMPUTER NETWORK MANAGEMENT (2)
CONTEXT (2)
CRYPTOGRAPHY (2)
CULTURAL DIFFERENCES (2)
DATA STREAM MINING (2)
DECISION TREE (2)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Classifying Wikipedia entities into fine-grained classes

Generating New Features Using Genetic Programming to Detect Link Spam

Randomized tag recommendation in social networks and classification of spam posts

Flow classification using clustering and association rule mining

Analysis of Machine learning Techniques Used in Behavior-Based Malware Detection

Hybridization of Base Classifiers of Random Subsample Ensembles for Enhanced Performance in High Dimensional Feature Spaces

Detecting Worms Using Data Mining Techniques: Learning in the Presence of Class Noise

Early traffic identification using Bayesian networks

Using per-Source measurements to improve performance of Internet traffic classification

Improving Performance of Network Traffic Classification Systems by Cleaning Training Data

On Predictability of System Anomalies in Real World

Text-Based Web Page Classification with Use of Visual Information

An event ontology construction approach to web crime mining

Evaluating Application-Layer Classification Using a Machine Learning Technique over Different High Speed Networks

Assessing the Quality of Opinion Retrieval Systems

An Unsupervised Snippet-Based Sentiment Classification Method for Chinese Unknown Phrases without Using Reference Word Pairs

A New Weighted Ensemble Model for Detecting DoS Attack Streams

Internet traffic classification using a Hidden Markov Model

Product review sentiment classification using parts of speech

Web classification using extraction and machine learning techniques

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options