William Arcand

chapter

LLMapReduce: Multi-level map-reduce for high performance data analysis

Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 8

2016 IEEE High Performance Extreme Computing Conference (HPEC)

The map-reduce parallel programming model has become extremely popular in the big data community. Many big data workloads can benefit from the enhanced performance offered by supercomputers. LLMapReduce provides the familiar map-reduce parallel programming model to big data users running on a supercomputer. LLMapReduce dramatically simplifies map-reduce programming by providing simple parallel programming...

chapter

Benchmarking SciDB data import on HPC systems

Siddharth Samsi, Laura Brattain, William Arcand, David Bestor, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 5

2016 IEEE High Performance Extreme Computing Conference (HPEC)

SciDB is a scalable, computational database management system that uses an array model for data storage. The array data model of SciDB makes it ideally suited for storing and managing large amounts of imaging data. SciDB is designed to support advanced analytics in database, thus reducing the need for extracting data for analysis. It is designed to be massively parallel and can run on commodity hardware...

chapter

Enabling on-demand database computing with MIT SuperCloud database management system

Andrew Prout, Jeremy Kepner, Peter Michaleas, William Arcand, more

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2015 IEEE High Performance Extreme Computing Conference (HPEC)

The MIT SuperCloud database management system allows for rapid creation and flexible execution of a variety of the latest scientific databases, including Apache Accumulo and SciDB. It is designed to permit these databases to run on a High Performance Computing Cluster (HPCC) platform as seamlessly as any other HPCC job. It ensures the seamless migration of the databases to the resources assigned by...

chapter

D4M: Bringing associative arrays to database engines

Vijay Gadepally, Jeremy Kepner, William Arcand, David Bestor, more

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2015 IEEE High Performance Extreme Computing Conference (HPEC)

The ability to collect and analyze large amounts of data is a growing problem within the scientific community. The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity and variety. Numerous tools exist that allow users to store, query and index these massive quantities of data. Each storage or database engine comes with the promise...

chapter

D4M 2.0 schema: A general purpose high performance schema for the Accumulo database

Jeremy Kepner, Christian Anderson, William Arcand, David Bestor, more

2013 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2013 IEEE High Performance Extreme Computing Conference (HPEC)

Non-traditional, relaxed consistency, triple store databases are the backbone of many web companies (e.g., Google Big Table, Amazon Dynamo, and Facebook Cassandra). The Apache Accumulo database is a high performance open source relaxed consistency database that is widely used for government applications. Obtaining the full benefits of Accumulo requires using novel schemas. The Dynamic Distributed...

chapter

Dynamic distributed dimensional data model (D4M) database and computation system

Jeremy Kepner, William Arcand, William Bergeron, Nadya Bliss, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5349 - 5352

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

A crucial element of large web companies is their ability to collect and analyze massive amounts of data. Tuple store databases are a key enabling technology employed by many of these companies (e.g., Google Big Table and Amazon Dynamo). Tuple stores are highly scalable and run on commodity clusters, but lack interfaces to support efficient development of mathematically based analytics. D4M (Dynamic...

INFONA - science communication portal

Search results for: William Arcand

LLMapReduce: Multi-level map-reduce for high performance data analysis

Benchmarking SciDB data import on HPC systems

Enabling on-demand database computing with MIT SuperCloud database management system

D4M: Bringing associative arrays to database engines

D4M 2.0 schema: A general purpose high performance schema for the Accumulo database

Dynamic distributed dimensional data model (D4M) database and computation system

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: William Arcand

LLMapReduce: Multi-level map-reduce for high performance data analysis

Benchmarking SciDB data import on HPC systems

Enabling on-demand database computing with MIT SuperCloud database management system

D4M: Bringing associative arrays to database engines

D4M 2.0 schema: A general purpose high performance schema for the Accumulo database

Dynamic distributed dimensional data model (D4M) database and computation system

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options