2016 IEEE International Conference on Big Data (Big Data)

chapter

Sequential randomized matrix factorization for Gaussian processes

Shaunak D. Bopardikar, George S. Eskander Ekladious

2016 IEEE International Conference on Big Data (Big Data) > 3957 - 3959

The Gaussian process framework models a function as a stochastic process such that the training data results into a finite number of jointly Gaussian random variables, whose properties can then be used to infer the statistics (the mean and variance) of the function at test values for the input. The computation can be implemented in a batch setting, i.e., one-shot over the entire training data, or...

chapter

Labeling actors in multi-view social networks by integrating information from within and across multiple views

Ngot Bui, Thanh Le, Vasant Honavar

2016 IEEE International Conference on Big Data (Big Data) > 616 - 625

2016 IEEE International Conference on Big Data (Big Data)

Real world social networks typically consist of actors (individuals) that are linked to other actors or different types of objects via links of multiple types. Different types of relationships induce different views of the underlying social network. We consider the problem of labeling actors in such multi-view networks based on the connections among them. Given a social network in which only a subset...

chapter

The best of both worlds: Using automatic detection and limited human supervision to create a homogenous magnetic catalog spanning four solar cycles

A. Munoz-Jaramillo, Z. A. Werginz, J. P. Vargas-Acosta, M. D. DeLuca, more

2016 IEEE International Conference on Big Data (Big Data) > 3194 - 3203

2016 IEEE International Conference on Big Data (Big Data)

Bipolar magnetic regions (BMRs) are the corner-stone of solar variability. They are tracers of the large-scale magnetic processes that give rise to the solar cycle, shapers of the solar corona, building blocks of the large-scale solar magnetic field, and significant contributors to the free-energetic budget that gives rise to flares and coronal mass ejections. Surprisingly, no homogeneous catalog...

chapter

Memory access pattern based insider threat detection in big data systems

Santosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori

2016 IEEE International Conference on Big Data (Big Data) > 3625 - 3628

2016 IEEE International Conference on Big Data (Big Data)

Big data platforms like Hadoop and Spark are being widely adopted both by academia and industry. In this paper, we propose a runtime intrusion detection technique that understands and works according to the memory properties of such distributed compute platforms. The proposed method is based on runtime analysis of memory access patterns of tasks running on the slave nodes of a distributed compute...

chapter

Structure preserving dimension reduction with 2D images as predictors

Bo Zhang, Liwei Wang

2016 IEEE International Conference on Big Data (Big Data) > 3619 - 3624

2016 IEEE International Conference on Big Data (Big Data)

Nearly all existing dimension reduction methods on 2D matrix-valued image predictors are unsupervised or supervised without preserving matrix structure, which can result in loss of the structure-specific relation between the response and predictors. In this paper, we propose a kernel-based solution for supervised dimension reduction which preserves the matrix structure of the reduced predictors. This...

chapter

Deep topology classification: A new approach for massive graph classification

Stephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, more

2016 IEEE International Conference on Big Data (Big Data) > 3290 - 3297

2016 IEEE International Conference on Big Data (Big Data)

The classification of graphs is a key challenge within many scientific fields using graphs to represent data and is an active area of research. Graph classification can be critical in identifying and labelling unknown graphs within a dataset and has seen application across many scientific fields. Graph classification poses two distinct problems: the classification of elements within a graph and the...

chapter

Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter

Wanying Ding, Yue Zhang, Chaomei Chen, Xiaohua Hu

2016 IEEE International Conference on Big Data (Big Data) > 869 - 874

2016 IEEE International Conference on Big Data (Big Data)

Understanding ongoing topics and their evolutions in social media is of great importance. Although topic analysis is not a novel research question, social media environment has presented new challenges. First, with insufficient co-occurrence information, short text have undermined many word co-occurrence oriented topic models' applicability. Second, real time message streams make traditional discretized...

chapter

Efficient multiple scale kernel classifiers

Rocco Langone, Johan A. K. Suykens

2016 IEEE International Conference on Big Data (Big Data) > 128 - 133

2016 IEEE International Conference on Big Data (Big Data)

While kernel methods using a single Gaussian kernel have proven to be very successful for nonlinear classification, in case of learning problems with a more complex underlying structure it is often desirable to use a linear combination of kernels with different widths. To address this issue, this paper presents a classification algorithm based on a jointly convex constrained optimization formulation...

chapter

Community detection with partially observable links and node attributes

Xiaokai Wei, Bokai Cao, Weixiang Shao, Chun-Ta Lu, more

2016 IEEE International Conference on Big Data (Big Data) > 773 - 782

2016 IEEE International Conference on Big Data (Big Data)

Community detection has been an important task for social and information networks. Existing approaches usually assume the completeness of linkage and content information. However, the links and node attributes can usually be partially observable in many real-world networks. For example, users can specify their privacy settings to prevent non-friends from viewing their posts or connections. Such incompleteness...

chapter

Container-based virtualization for byte-addressable NVM data storage

Ellis R. Giles

2016 IEEE International Conference on Big Data (Big Data) > 2754 - 2763

2016 IEEE International Conference on Big Data (Big Data)

Container based virtualization is rapidly growing in popularity for cloud deployments and applications as a virtualization alternative due to the ease of deployment coupled with high-performance. Emerging byte-addressable, nonvolatile memories, commonly called Storage Class Memory or SCM, technologies are promising both byte-addressability and persistence near DRAM speeds operating on the main memory...

chapter

Efficient portfolio allocation with sparse volatility estimation for high-frequency financial data

Jian Zou, Chuqin Huang

2016 IEEE International Conference on Big Data (Big Data) > 2332 - 2341

2016 IEEE International Conference on Big Data (Big Data)

Traditionally, investors try to estimate short term portfolio volatility based on daily return. When tick-by-tick data are available, investors use different volatility estimators based on high-frequency data to evaluate the portfolio risk in the hope of outperforming those based on low-frequency data. In this paper, we optimize block realized kernel estimator in Hautsch et al. (2015) and propose...

chapter

Mini-apps for high performance data analysis

Sreenivas R. Sukumar, Michael A. Matheson, Ramakrishnan Kannan, Seung-Hwan Lim

2016 IEEE International Conference on Big Data (Big Data) > 1483 - 1492

2016 IEEE International Conference on Big Data (Big Data)

Scaling-up scientific data analysis and machine learning algorithms for data-driven discovery is a grand challenge that we face today. Despite the growing need for analysis from science domains that are generating ‘Big Data’ from instruments and simulations, building high-performance analytical workflows of data-intensive algorithms have been daunting because: (i) the ‘Big Data’ hardware and software...

chapter

Kernels for scalable data analysis in science: Towards an architecture-portable future

Sreenivas R. Sukumar, Ramakrishnan Kannan, Seung-Hwan Lim, Michael A. Matheson

2016 IEEE International Conference on Big Data (Big Data) > 1026 - 1031

2016 IEEE International Conference on Big Data (Big Data)

In this paper, we pose and address some of the unique challenges in the analysis of scientific Big Data on supercomputing platforms. Our approach identifies, implements and scales numerical kernels that are critical to the instantiation of theory-inspired analytic workflows on modern computing architectures. We present the benefits of scalable kernels towards constructing algorithms such as principal...

chapter

Sampling-based distributed Kernel mean matching using spark

Ahsanul Haque, Zhuoyi Wang, Swarup Chandra, Yupeng Gao, more

2016 IEEE International Conference on Big Data (Big Data) > 462 - 471

2016 IEEE International Conference on Big Data (Big Data)

Limited access to supervised information may forge scenarios in real-world data mining applications, where training and test data are interconnected by a covariate shift, i.e., having equal class conditional distribution with unequal covariate distribution. Traditional data mining techniques assume that both training and test data represent an identical distribution, therefore suffer in presence of...

chapter

A distributed approach to estimating sea port operational regions from lots of AIS data

Leonardo M. Millefiori, Dimitrios Zissis, Luca Cazzanti, Gianfranco Arcieri

2016 IEEE International Conference on Big Data (Big Data) > 1627 - 1632

2016 IEEE International Conference on Big Data (Big Data)

Seaports play a vital role in the global economy, as they operate as the connection corridors to all other modes of transport and as engines of growth for the wider region. But ports today are faced with numerous unique challenges and for them to remain competitive, significant investments are required. In support of greater transparency in policy making, decisions regarding investment need to be...

chapter

Multiple submodels parallel support vector machine on spark

Chang Liu, Bin Wu, Yi Yang, Zhihong Guo

2016 IEEE International Conference on Big Data (Big Data) > 945 - 950

2016 IEEE International Conference on Big Data (Big Data)

The Support Vector Machine (SVM) is a classical classification algorithm that has a wide range of application. With kernel function, SVM can dispose the datasets that are not linearly separable in their original feature space, making it more flexible in practical use compared with linear model. However, its complexity in training is an obstacle to large-scale dataset handling. This paper proposes...

chapter

Minimum density hyperplanes in the feature space

Katie R. Yates, Nicos G. Pavlidis

2016 IEEE International Conference on Big Data (Big Data) > 3613 - 3618

2016 IEEE International Conference on Big Data (Big Data)

We introduce a kernel formulation of the recently proposed minimum density hyperplane approach to clustering. This enables the identification of clusters that are not linearly separable in the input space by mapping them into a feature space. This mapping also extends the applicability of the minimum density hyperplane to datasets whose features are not necessarily continuous. The location of minimum...

INFONA - science communication portal

2016 IEEE International Conference on Big Data (Big Data)

Sequential randomized matrix factorization for Gaussian processes

Labeling actors in multi-view social networks by integrating information from within and across multiple views

The best of both worlds: Using automatic detection and limited human supervision to create a homogenous magnetic catalog spanning four solar cycles

Memory access pattern based insider threat detection in big data systems

Structure preserving dimension reduction with 2D images as predictors

Deep topology classification: A new approach for massive graph classification

Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter

Efficient multiple scale kernel classifiers

Community detection with partially observable links and node attributes

Container-based virtualization for byte-addressable NVM data storage

Efficient portfolio allocation with sparse volatility estimation for high-frequency financial data

Mini-apps for high performance data analysis

Kernels for scalable data analysis in science: Towards an architecture-portable future

Sampling-based distributed Kernel mean matching using spark

A distributed approach to estimating sea port operational regions from lots of AIS data

Multiple submodels parallel support vector machine on spark

Minimum density hyperplanes in the feature space

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 IEEE International Conference on Big Data (Big Data) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 IEEE International Conference on Big Data (Big Data)