Simon Fong

chapter

Kestrel-Based Search Algorithm for Association Rule Mining and Classification of Frequently Changed Items

Israel Edem Agbehadji, Richard Millham, Simon Fong

2016 8th International Conference on Computational Intelligence and Communication Networks (CICN) > 356 - 360

2016 8th International Conference on Computational Intelligence and Communication Networks (CICN)

Nature inspired approaches have been used in the design of computer solutions for real life problems. These computer solutions take the form of algorithms which characterize specific behaviour of animals or birds in their natural habitat. The two bio-inspired computational concepts in modern times includes evolutionary and swarm intelligence. A novel introduction to the bio-inspired computational...

chapter

Lightweight Feature Selection Methods Based on Standardized Measure of Dispersion for Mining Big Data

Simon Fong, Robert P. Biuk-Aghai, Yain-Whar Si

2016 IEEE International Conference on Computer and Information Technology (CIT) > 553 - 559

2016 IEEE International Conference on Computer and Information Technology (CIT)

Big data analytics is emerging as an important research field nowadays with many technical challenges that confront both commercial IT deployment and big data research communities. One of the inherent problems of big data is the curse of dimensionality. Modern data are described with many attributes and stored with high dimensions. In data analytics, feature selection has been popularly used to lighten...

chapter

Elephant search algorithm on data clustering

Zhonghuan Tian, Simon Fong, Raymond Wong, Richard Millham

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 787 - 793

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

Data clustering is one of the most popular branches in machine learning and data analysis. Partitioning-based type of clustering algorithms, such as K-means, is prone to the problem of producing a set of clusters that is far from perfect due to its probabilistic nature. The clustering process starts with some random partitions at the beginning, and it tries to improve the partitions progressively...

chapter

Optimizing SMOTE by Metaheuristics with Neural Network and Decision Tree

Jinyan Li, Simon Fong, Yan Zhuang

2015 3rd International Symposium on Computational and Business Intelligence (ISCBI) > 26 - 32

2015 3rd International Symposium on Computational and Business Intelligence (ISCBI)

SMOTE (Synthetic minority over-sampling technique) is a commonly used over-sampling technique to subside the imbalanced dataset problem. Traditionally SMOTE has two key important parameters, one is to control the amount of over-sampling, and the other specifies the area of the nearest neighbors. These two parameters are arbitrarily chosen by user. So there are no universally best default values. In...

chapter

A Scalable Data Stream Mining Methodology: Stream-Based Holistic Analytics and Reasoning in Parallel

Simon Fong, Yan Zhuang, Raymond Wong, Sabah Mohammed

2014 2nd International Symposium on Computational and Business Intelligence > 110 - 115

2014 2nd International Symposium on Computational and Business Intelligence (ISCBI)

Big Data though it is a hype up-springing many technical challenges that confront both academic research communities and commercial IT deployment, the root sources of Big Data are founded on data streams. It is generally known that data which are sourced from data streams accumulate continuously making traditional batch-based model induction algorithms infeasible for real-time data mining or high-speed...

chapter

Hierarchical Classification in Text Mining for Sentiment Analysis

Jinyan Li, Simon Fong, Yan Zhuang, Richard Khoury

2014 International Conference on Soft Computing and Machine Intelligence > 46 - 51

2014 International Conference on Soft Computing & Machine Intelligence (ISCMI)

Sentiment analysis in text mining is known to be a challenging task. Sentiment is subtly reflected by the tone, affective state or emotion of a writer's expression in words. Conventional text mining techniques which are based on keyword frequency counting usually run short of accurately detecting such subjective information implied in the text. In this paper we evaluated several popular classification...

chapter

Swarm Search for Feature Selection in Classification

Simon Fong, Xin-She Yang, Suash Deb

2013 IEEE 16th International Conference on Computational Science and Engineering > 902 - 909

2013 IEEE 16th International Conference on Computational Science and Engineering (CSE)

Finding an appropriate set of features from data of high dimensionality for building an accurate classification model is a well-known NP-hard computational problem. Unfortunately in data mining, some big data are not only big in volume but they are described by a large number of features. Many feature subset selection algorithms have been proposed in the past, they are nevertheless far from perfect...

chapter

Improving the Accuracy of Incremental Decision Tree Learning Algorithm via Loss Function

Hang Yang, Simon Fong

2013 IEEE 16th International Conference on Computational Science and Engineering > 910 - 916

2013 IEEE 16th International Conference on Computational Science and Engineering (CSE)

Hoeffding's bound (HB) has been widely used for node splitting in incremental decision tree algorithms. Many decision-tree algorithms adopt a sliding-window technique to detect concept drift when mining changing data streams. This paper presents a novel node-splitting approach that replaces the traditional HB with a new measure. The new measure is derived from a loss function applied in a cache-based...

chapter

Not every friend on a social network can be trusted: Classifying imposters using decision trees

Simon Fong, Yan Zhuang, Jiaying He

The First International Conference on Future Generation Communication Technologies > 58 - 63

2012 International Conference on Future Generation Communication Technology (FGCT)

There is an alarming news recently revealed on media that 8.7 percent of users on Facebook are fake; this amounts to more than 83 million accounts worldwide. Consequently this huge number of fake users whose profiles were unverified translates to the potential dangers ranging from espionage, identity thievery, information misuse and loophole to privacy compromise to the users and their families. Nowadays...

chapter

Opinion mining over twitterspace: Classifying tweets programmatically using the R approach

Jinan Fiaidhi, Osama Mohammed, Sabah Mohammed, Simon Fong, more

Seventh International Conference on Digital Information Management (ICDIM 2012) > 313 - 319

2012 Seventh International Conference on Digital Information Management (ICDIM)

Today the channels for expressing opinions seem to increase daily. When these opinions are relevant to a company, they are important sources of business insight, whether they represent critical intelligence about a customer's defection risk, the impact of an influential reviewer on other people's purchase decisions, or early feedback on product releases, company news or competitors. Capturing and...

chapter

Mining twitterspace for information: Classifying sentiments programmatically using Java

Jinan Fiaidhi, Osama Mohammed, Sabah Mohammed, Simon Fong, more

Seventh International Conference on Digital Information Management (ICDIM 2012) > 303 - 308

2012 Seventh International Conference on Digital Information Management (ICDIM)

People increasingly use Twitter to share advice, opinions, news, moods, concerns, facts, rumors, and everything else imaginable. Much of that data is public and available for mining. However, classifying automatically the sentiment of the Twitter messages into either positive or negative with respect to a query term represents a new research challenge. Variety of approaches that use natural language...

chapter

Bayesian based subgroup discovery

Talha Anwar, Sohail Asghar, Simon Fong

2011 Sixth International Conference on Digital Information Management > 154 - 161

2011 Sixth International Conference on Digital Information Management (ICDIM)

Data Mining is concerned with extraction of interesting patterns or knowledge from huge amounts of Data. Generally data mining tasks are either predictive or descriptive. Classification falls under predictive induction while clustering and association rule mining fall under descriptive induction. Subgroup discovery is a task at the intersection of supervised learning and descriptive induction. In...

INFONA - science communication portal

Search results for: Simon Fong

Kestrel-Based Search Algorithm for Association Rule Mining and Classification of Frequently Changed Items

Lightweight Feature Selection Methods Based on Standardized Measure of Dispersion for Mining Big Data

Elephant search algorithm on data clustering

Optimizing SMOTE by Metaheuristics with Neural Network and Decision Tree

A Scalable Data Stream Mining Methodology: Stream-Based Holistic Analytics and Reasoning in Parallel

Hierarchical Classification in Text Mining for Sentiment Analysis

Swarm Search for Feature Selection in Classification

Improving the Accuracy of Incremental Decision Tree Learning Algorithm via Loss Function

Not every friend on a social network can be trusted: Classifying imposters using decision trees

Opinion mining over twitterspace: Classifying tweets programmatically using the R approach

Mining twitterspace for information: Classifying sentiments programmatically using Java

Bayesian based subgroup discovery

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Simon Fong

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options