Search results

Items from 1 to 18 out of 18 results

chapter

Large data management in IOT applications

Catalin Constantin Cerbulescu, Claudia Monica Cerbulescu

2016 17th International Carpathian Control Conference (ICCC) > 111 - 115

2016 17th International Carpathian Control Conference (ICCC)

In numerous IOT applications, large number of sensors and data receivers send information to server. The servers gather information that reaches huge amount in short time. In such cases, IOT applications can face the challenge of real time managing/displaying/extracting client useful information from the whole data stored on servers. Especially in critical situations, client's database query can take...

chapter

Big data proteogenomics and high performance computing: Challenges and opportunities

Fahad Saeed

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 141 - 145

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Proteogenomics is an emerging field of systems biology research at the intersection of proteomics and genomics. Two high-throughput technologies, Mass Spectrometry (MS) for proteomics and Next Generation Sequencing (NGS) machines for genomics are required to conduct proteogenomics studies. Independently both MS and NGS technologies are inflicted with data deluge which creates problems of storage,...

chapter

Data quality: The other face of Big Data

Barna Saha, Divesh Srivastava

2014 IEEE 30th International Conference on Data Engineering > 1294 - 1297

2014 IEEE 30th International Conference on Data Engineering (ICDE)

In our Big Data era, data is being generated, collected and analyzed at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Recent studies have shown that poor quality data is prevalent in large databases and on the Web. Since poor quality data can have serious consequences on the results of data analyses, the importance of veracity, the fourth ‘V’ of...

chapter

Topics and Terms Mining in Unstructured Data Stores

Richard K. Lomotey, Ralph Deters

2013 IEEE 16th International Conference on Computational Science and Engineering > 854 - 861

2013 IEEE 16th International Conference on Computational Science and Engineering (CSE)

One of the major challenges of the "Big Data" epoch is unstructured data mining. The problem arises due to the storage of high-dimensional data that has no standard schema. While knowledge discovery in database (KDD) algorithms were designed for data extraction, the algorithms best fit for structured data storages. Moreover, today, at the data storage level, NoSQL databases have been deployed...

chapter

Reduction of Association Rules for Big Data Sets in Socially-Aware Computing

Woo Sik Seol, Hwi Woon Jeong, Byungjun Lee, Hee Yong Youn

2013 IEEE 16th International Conference on Computational Science and Engineering > 949 - 956

2013 IEEE 16th International Conference on Computational Science and Engineering (CSE)

Reduction of the number of association rules in data mining is a very important issue in the field of socially-aware computing in which big data need to be manipulated. The existing schemes based on the frequency of occurrences are not effective for relatively large size dataset. In this paper we propose the tabular-algorithm that assigns a weight to each rule for the removal of unimportant rules...

chapter

Attribute Relationship Evaluation Methodology for Big Data Security

Sung-Hwan Kim, Nam-Uk Kim, Tai-Myoung Chung

2013 International Conference on IT Convergence and Security (ICITCS) > 1 - 4

2013 International Conference on IT Convergence and Security (ICITCS)

There has been an increasing interest in big data and big data security with the development of network technology and cloud computing. However, big data is not an entirely new technology but an extension of data mining. In this paper, we describe the background of big data, data mining and big data features, and propose attribute selection methodology for protecting the value of big data. Extracting...

chapter

Efficient Online Sharing of Geospatial Big Data Using NoSQL XML Databases

Pouria Amirian, Anahid Basiri, Adam Winstanley

2013 Fourth International Conference on Computing for Geospatial Research and Application > 152

2013 4th International Conference on Computing for Geospatial Research & Application (COM.Geo)

Today a huge amount of geospatial data is being created, collected and used more than ever before. The ever increasing observations and measurements of geo-sensor networks, satellite imageries, point clouds from laser scanning, geospatial data of Location Based Services (LBS) and location-based social networks has become a serious challenge for data management and analysis systems. Traditionally,...

chapter

Location: A Feature for Service Selection in the Era of Big Data

Luo Zhiling, Li Ying, Yin Jianwei

2013 IEEE 20th International Conference on Web Services > 515 - 522

2013 IEEE International Conference on Web Services (ICWS)

This paper introduces a service selection model with the service location considered. The location of a service represents its position in the network, which determines the transmission cost of calling this service in the composite service. The more concentrated the invoking services are, the less transmission time the composite service costs. On the other hand, the more and more popular big data...

chapter

Distributed File Management System Based on SSH2 and HDFS

Junxiong Sun, Yan Chen, Taoying Li, Yingying Yu, more

2013 International Conference on Computational and Information Sciences > 549 - 551

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

As time goes on, the current running information system produces massive data. Obviously the previous hardware could not fit for managing the much bigger data any more. In order to upgrade the system, there are three main problems need to consider: enough storage space, high reliability, high performance and relatively low cost. As an open-source J2EE framework, SSH2 is widely used by developers to...

chapter

Big Data Security Hardening Methodology Using Attributes Relationship

Sung-Hwan Kim, Jung-Ho Eom, Tai-Myoung Chung

2013 International Conference on Information Science and Applications (ICISA) > 1 - 2

2013 International Conference on Information Science and Applications (ICISA)

Recently developments in network, mining and data store technology have heightened the need for big data and big data security. In this paper, we focus on the big data's characteristic which takes seriously the analysis of value than the data itself. We express the relationship between attributes using nodes and edges. Through this, we propose a big data security hardening methodology by selecting...

article

Big Data's Big Unintended Consequences

Marcus R. Wigan, Roger Clarke

Computer > 2013 > 46 > 6 > 46 - 53

Businesses and governments exploit big data without regard for issues of legality, data quality, disparate data meanings, and process quality. This often results in poor decisions, with individuals bearing the greatest risk. The threats harbored by big data extend far beyond the individual, however, and call for new legal structures, business processes, and concepts such as a Private Data Commons...

chapter

Introduction to Big Data: Scalable Representation and Analytics for Data Science Minitrack

Stephen Kaisler, Frank Armour, Alberto Espinosa

2013 46th Hawaii International Conference on System Sciences > 984

2013 46th Hawaii International Conference on System Sciences (HICSS)

Big data is an emerging phenomenon characterized by the three Vs: volume, velocity, and variety. The volume of data has increased from terabytes to petabytes and is encroaching on exabytes. Some pundits are suggesting that zettabytes (1021) are reachable within the next several years. Velocity is concerned with not only how fast we accumulate data, but also how fast some of the data that we already...

chapter

Beyond Simple Integration of RDBMS and MapReduce -- Paving the Way toward a Unified System for Big Data Analytics: Vision and Progress

Xiongpai Qin, Huiju Wang, Furong Li, Baoyao Zhou, more

2012 Second International Conference on Cloud and Green Computing > 716 - 725

2012 International Conference on Cloud and Green Computing (CGC)

MapReduce has shown vigorous vitality and penetrated both academia and industry in recent years. MapReduce not only can be used as an ETL tool, it can do even much more. The technique has been applied to SQL summation, OLAP, data mining, machine learning, information retrieval, multimedia data processing, science data processing etc. Basically MapReduce is a general purpose parallel computing framework...

chapter

Towards HPC for the digital Humanities, Arts, and Social Sciences: Needs and challenges of adapting academic HPC for big data

Kalev H. Leetaru

2012 IEEE 8th International Conference on E-Science > 1 - 6

2012 IEEE 8th International Conference on E-Science (e-Science)

This paper examines the needs of emerging applications of High Performance Computing by the Humanities, Arts, and Social Sciences (HASS) disciplines and presents a vision for how the current academic HPC environment could be adapted to better serve this new class of “big data” research.

chapter

Infrastructure for collaborating data-researchers in a Smart Grid pilot

W. Labeeuw, S. Claessens, K. Mets, C. Develder, more

2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe) > 1 - 8

2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe)

A large amount of stakeholders are often involved in Smart Grid projects. Each partner has its own way of storing, representing and accessing its data. An integrated data storage and a joint online analytical mining infrastructure is needed to limit the amount of duplicated work and to raise the overall security of the system. The proposed infrastructure is composed of standard application software...

chapter

The F&A Methodology and Its Experimental Validation on a Real-Life Parallel Processing Database System

Ladjel Bellatreche, Soumia Benkrid, Alain Crolotte, Alfredo Cuzzocrea, more

2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems > 114 - 121

2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS)

This paper complements our previous results in the context of effectively and efficient designing Parallel Relational Data Warehouses (PRDW) over heterogenous database clusters, which are represented by the proposal of a methodology called Fragmentation & Allocation (F& A). The main merit of F& A is that of combining the fragmentation and the allocation phases simultaneously,...

chapter

Lock-free transactional support for large-scale storage systems

Flavio Junqueira, Benjamin Reed, Maysam Yabandeh

2011 IEEE/IFIP 41st International Conference on Dependable Systems and Networks Workshops (DSN-W) > 176 - 181

2011 IEEE/IFIP 41st International Conference on Dependable Systems and Networks Workshops (DSN-W)

In this paper, we introduce ReTSO, a reliable and efficient design for transactional support in large-scale storage systems. ReTSO uses a centralized scheme and implements snapshot isolation, a property that guarantees that read operations of a transaction read a consistent snapshot of the data stored. The centralized scheme of ReTSO enables a lock-free commit algorithm that prevents unre-leased locks...

article

Performance Evaluation of Various Storage Formats for Clinical Data Repositories

Jeff Gilchrist, Monique Frize, Colleen M. Ennett, Erika Bariciak

IEEE Transactions on Instrumentation and Measurement > 2011 > 60 > 10 > 3244 - 3252

Fast access to clinical data is necessary when performing real-time predictions of medical events. A clinical data repository (CDR) therefore requires an efficient format for storing data so it can meet the access demands of prediction algorithms for clinical decision support. We have developed a new hybrid entity–attribute–value (EAV) storage format for CDRs that is compared with the common simple...

Filter options

Keywords:
DATABASES

Publication date

Set your own date range

Publication type

book (16)
article (2)

Keywords

DATA HANDLING (13)
INFORMATION MANAGEMENT (13)
BIG DATA (7)
DATA MINING (4)
SECURITY (3)
SERVERS (3)
ASSOCIATION RULES (2)
CLEANING (2)
DATA PROCESSING (2)
DATA WAREHOUSES (2)
EDUCATIONAL INSTITUTIONS (2)
NOSQL (2)
ABSTRACTS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ALGORITHMS (1)
ANALYTICS (1)
ASSOCIATION RULE REDUCTION (1)
BIG DATA ANALYTICS (1)
BIG DATA MINING (1)
BIOINFORMATICS (1)
BIOMEDICAL IMAGING (1)
BIOMEDICAL MONITORING (1)
BLOOD (1)
BLOOM FILTERING (1)
BUSINESS (1)
COMPUTATIONAL MODELING (1)
CORRELATION (1)
DATA (1)
DATA ACQUISITION (1)
DATA ANALYSIS (1)
DATA MODELS (1)
DATA PRIVACY (1)
DATA SCIENCE (1)
DATA VISUALIZATION (1)
DECISION SUPPORT SYSTEMS (1)
DESIGN METHODOLOGY (1)
DIGITAL HASS (1)
DIGITAL HUMANITIES (1)
DISTRIBUTION (1)
EXPERIMENTATION (1)
FIELD CACHING (1)
GENOMICS (1)
GEOSPATIAL ANALYSIS (1)
GEOSPATIAL BIG DATA (1)
GOVERNMENT POLICIES (1)
HDFS (1)
HIGH-PERFORMANCE COMPUTING METHODOLOGIES FOR PARALLEL DATA WAREHOUSES (1)
HUGE DATA (1)
INFORMATION RETRIEVAL (1)
INSTRUCTION SETS (1)
INTERNET (1)
INTERNET OF THINGS (1)
INTEROPERABILITY (1)
LABORATORIES (1)
LAYOUT (1)
LEGAL ASPECTS (1)
MAINTENANCE ENGINEERING (1)
MAPREDUCE (1)
MEDICAL INFORMATION SYSTEMS (1)
MEMORY MANAGEMENT (1)
MONITORING (1)
NAVIGATION (1)
OLAP (1)
OPTIMIZATION (1)
PARALLEL ALGORITHMS (1)
PARALLEL RELATIONAL DATA WAREHOUSES (1)
PEDIATRICS (1)
PERFORMANCE (1)
PERFORMANCE EVALUATION (1)
POLICY (1)
PRIVACY (1)
PRIVATE DATA COMMONS (1)
PROTEINS (1)
PROTEOMICS (1)
PROTOCOLS (1)
QUALITY MANAGEMENT (1)
QUALITY OF SERVICE (1)
QUINE-MCCLUSKEY METHOD (1)
RDBMS (1)
REAL TIME APPLICATIONS (1)
RESOURCE MANAGEMENT (1)
SCALABILITY (1)
SENSORS (1)
SERVICE COMPOSITION (1)
SERVICE LOCATION (1)
SERVICE SELECTION (1)
SHORTEST PATH (1)
SILICON (1)
SMART GRID (1)
SOCIAL IMPACT (1)
SOCIALLY AWARE COMPUTING (1)
SOFTWARE ARCHITECTURE (1)
SPRINGS (1)
SSH2 (1)
TERMS (1)
TOPICS (1)
UNIFIED SYSTEM (1)
UNSTRUCTURED DATA MINING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options