2017 IEEE High Performance Extreme Computing Conference (HPEC)

Items from 61 to 80 out of 81 results

chapter

Mixed data layout kernels for vectorized complex arithmetic

Doru T. Popovici, Franz Franchetti, Tze Meng Low

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Implementing complex arithmetic routines with Single Instruction Multiple Data (SIMD) instructions requires the use of instructions that are usually not found in their real arithmetic counter-parts. These instructions, such as shuffles and addsub, are often bottlenecks for many complex arithmetic kernels as modern architectures usually can perform more real arithmetic operations than execute instructions...

chapter

A quantitative and qualitative analysis of tensor decompositions on spatiotemporal data

Tom Henretty, Muthu Baskaran, James Ezick, David Bruns-Smith, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

With the recent explosion of systems capable of generating and storing large quantities of GPS data, there is an opportunity to develop novel techniques for analyzing and gaining meaningful insights into this spatiotemporal data. In this paper we examine the application of tensor decompositions, a high-dimensional data analysis technique, to georeferenced data sets. Guidance is provided on fitting...

chapter

A linear algebra approach to fast DNA mixture analysis using GPUs

Siddharth Samsi, Brian Helfer, Jeremy Kepner, Albert Reuther, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Analysis of DNA samples is an important tool in forensics, and the speed of analysis can impact investigations. Comparison of DNA sequences is based on the analysis of short tandem repeats (STRs), which are short DNA sequences of 2–5 base pairs. Current forensics approaches use 20 STR loci for analysis. The use of single nucleotide polymorphisms (SNPs) has utility for analysis of complex DNA mixtures...

chapter

Memory-efficient parallel tensor decompositions

Muthu Baskaran, Tom Henretty, Benoit Pradelle, M. Harper Langston, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Tensor decompositions are a powerful technique for enabling comprehensive and complete analysis of real-world data. Data analysis through tensor decompositions involves intensive computations over large-scale irregular sparse data. Optimizing the execution of such data intensive computations is key to reducing the time-to-solution (or response time) in real-world data analysis applications. As high-performance...

chapter

[Copyright notice]

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Presents the copyright information for the conference. May include reprint permission information.

chapter

Investigating TI KeyStone II and quad-core ARM Cortex-A53 architectures for on-board space processing

Benjamin Schwaller, Barath Ramesh, Alan D. George

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Future space missions require reliable architectures with higher performance and lower power consumption. Exploring new architectures worthy of undergoing the expensive and time-consuming process of radiation hardening is critical for this endeavor. Two such architectures are the Texas Instruments KeyStone II octal-core processor and the ARM® Cortex®-A53 (ARMv8) quad-core CPU. DSPs have been proven...

chapter

MIT SuperCloud portal workspace: Enabling HPC web application deployment

Andrew Prout, William Arcand, David Bestor, Bill Bergeron, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2017 IEEE High Performance Extreme Computing Conference (HPEC)

The MIT SuperCloud Portal Workspace enables the secure exposure of web services running on high performance computing (HPC) systems. The portal allows users to run any web application as an HPC job and access it from their workstation while providing authentication, encryption, and access control at the system level to prevent unintended access. This capability permits users to seamlessly utilize...

chapter

Algorithm and hardware co-optimized solution for large SpMV problems

Fazle Sadi, Larry Fileggi, Franz Franchetti

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Sparse Matrix-Vector multiplication (SpMV) is a fundamental kernel for many scientific and engineering applications. However, SpMV performance and efficiency are poor on commercial of-the-shelf (COTS) architectures, specially when the data size exceeds on-chip memory or last level cache (LLC). In this work we present an algorithm co-optimized hardware accelerator for large SpMV problems. We start...

chapter

A distributed algorithm for the efficient computation of the unified model of social influence on massive datasets

Alex Popa, Marc Frincu, Charalampos Chelmis

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Online social networks offer a rich data source for analyzing diffusion processes including rumor and viral spreading in communities. While many models exist, a unified model which enables analytical computation of complex, nonlinear phenomena while considering multiple factors was only recently proposed. We design an optimized implementation of the unified model of influence for vertex centric graph...

chapter

Accelerating big data applications using lightweight virtualization framework on enterprise cloud

Janki Bhimani, Zhengyu Yang, Miriam Leeser, Ningfang Mi

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Hypervisor-based virtualization technology has been successfully used to deploy high-performance and scalable infrastructure for Hadoop, and now Spark applications. Container-based virtualization techniques are becoming an important option, which is increasingly used due to their lightweight operation and better scaling when compared to Virtual Machines (VM). With containerization techniques such...

chapter

Triangle counting via vectorized set intersection

Shahir Mowlaei

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 5

2017 IEEE High Performance Extreme Computing Conference (HPEC)

In this paper we propose a vectorized sorted set intersection approach for the task of counting the exact number of triangles of a graph on CPU cores. The computation is factorized into reordering and counting kernels where the reordering kernel builds upon the Reverse Cuthill-McKee heuristic.

chapter

WCET analysis of the shared data cache in integrated CPU-GPU architectures

Yijie Huangfu, Wei Zhang

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

By taking the advantages of both CPU and GPU as well as the shared DRAM and cache, the integrated CPU-GPU architecture has the potential to boost the performance for a variety of applications, including real-time applications as well. However, before being applied to the hard real-time and safety-critical applications, the time-predictability of the integrated CPU-GPU architecture needs to be studied...

chapter

Leakage energy reduction for hard real-time caches

Yijie Huangfu, Wei Zhang

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Cache leakage reduction techniques usually compromise time predictability, which are not desirable for real-time systems. In this work, we extend the cache decay and drowsy cache techniques within the hardware-based Performance Enhancement Guaranteed Cache (PEG-C) architecture. The PEG-C can dynamically monitor the performance penalties caused by using leakage energy reduction techniques to ensure...

chapter

FastID: Extremely fast forensic DNA comparisons

Darrell O. Ricke

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 4

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Rapid analysis of DNA forensic samples can have a critical impact on time sensitive investigations. Analysis of forensic DNA samples by massively parallel sequencing is creating the next gold standard for DNA forensic analysis. This technology enables the expansion of forensic profiles from the current 20 short tandem repeat (STR) loci to tens of thousands of single nucleotide polymorphism (SNP) loci...

chapter

Exploring optimizations on shared-memory platforms for parallel triangle counting algorithms

Ancy Sarah Tom, Narayanan Sundaram, Nesreen K. Ahmed, Shaden Smith, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

The widespread use of graphs to model large scale real-world data brings with it the need for fast graph analytics. In this paper, we explore the problem of triangle counting, a fundamental graph-analytic operation, on shared-memory platforms. Existing triangle counting implementations do not effectively utilize the key characteristics of large sparse graphs for tuning their algorithms for performance...

chapter

Towards numerical benchmark for half-precision floating point arithmetic

Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, Jack Dongarra

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 5

2017 IEEE High Performance Extreme Computing Conference (HPEC)

With NVIDA Tegra Jetson X1 and Pascal P100 GPUs, NVIDIA introduced hardware-based computation on FP16 numbers also called half-precision arithmetic. In this talk, we will introduce the steps required to build a viable benchmark for this new arithmetic format. This will include the connections to established IEEE floating point standards and existing HPC benchmarks. The discussion will focus on performance...

chapter

An ensemble framework for detecting community changes in dynamic networks

Timothy La Fond, Geoffrey Sanders, Christine Klymko, Van Emden Henson

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Dynamic networks, especially those representing social networks, undergo constant evolution of their community structure over time. Nodes can migrate between different communities, communities can split into multiple new communities, communities can merge together, etc. In order to represent dynamic networks with evolving communities it is essential to use a dynamic model rather than a static one...

chapter

TriX: Triangle counting at extreme scale

Yang Hu, Pradeep Kumar, Guy Swope, H. Howie Huang

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Triangle counting is widely used in many applications including spam detection, link recommendation, and social network analysis. The DARPA Graph Challenge seeks a scalable solution for triangle counting on big graphs. In this paper we present TriX, a scalable triangle counting framework, which is comprised of a 2-D graph partition strategy and a binary search based intersection algorithm designed...

chapter

Static graph challenge: Subgraph isomorphism

Siddharth Samsi, Vijay Gadepally, Michael Hurley, Michael Jones, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2017 IEEE High Performance Extreme Computing Conference (HPEC)

The rise of graph analytic systems has created a need for ways to measure and compare the capabilities of these systems. Graph analytics present unique scalability difficulties. The machine learning, high performance computing, and visual analytics communities have wrestled with these difficulties for decades and developed methodologies for creating challenges to move these communities forward. The...

chapter

Efficient and accurate Word2Vec implementations in GPU and shared-memory multicore architectures

Trevor M. Simonton, Gita Alaghband

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Word2Vec is a popular set of machine learning algorithms that use a neural network to generate dense vector representations of words. These vectors have proven to be useful in a variety of machine learning tasks. In this work, we propose new methods to increase the speed of the Word2Vec skip gram with hierarchical softmax architecture on multi-core shared memory CPU systems, and on modern NVIDIA GPUs...

Publication date

Set your own date range

Content availability

Available (80)
None (1)

Keywords

ALGORITHM DESIGN AND ANALYSIS (18)
GRAPHICS PROCESSING UNITS (15)
KERNEL (15)
SPARSE MATRICES (13)
HARDWARE (11)
LIBRARIES (11)
BANDWIDTH (10)
BENCHMARK TESTING (9)
COMPUTER ARCHITECTURE (9)
COMPUTATIONAL MODELING (8)
PARTITIONING ALGORITHMS (8)
ARRAYS (7)
STANDARDS (7)
CLUSTERING ALGORITHMS (6)
HEURISTIC ALGORITHMS (6)
INSTRUCTION SETS (6)
MONITORING (6)
OPTIMIZATION (6)
RANDOM ACCESS MEMORY (6)
APPROXIMATION ALGORITHMS (5)
DATA MODELS (5)
MATRIX DECOMPOSITION (5)
PARALLEL PROCESSING (5)
REGISTERS (5)
SERVERS (5)
THREE-DIMENSIONAL DISPLAYS (5)
CONTAINERS (4)
DATA STRUCTURES (4)
DATABASES (4)
FIELD PROGRAMMABLE GATE ARRAYS (4)
INDEXES (4)
LAYOUT (4)
MEASUREMENT (4)
MEMORY MANAGEMENT (4)
MULTICORE PROCESSING (4)
PERFORMANCE EVALUATION (4)
RESOURCE MANAGEMENT (4)
RUNTIME (4)
TOOLS (4)
BIG DATA (3)
DISTRIBUTED DATABASES (3)
ENGINES (3)
GPU (3)
LINEAR ALGEBRA (3)
MATLAB (3)
PROGRAM PROCESSORS (3)
REAL-TIME SYSTEMS (3)
SCALABILITY (3)
SOCIOLOGY (3)
SYMMETRIC MATRICES (3)
SYSTEM-ON-CHIP (3)
TENSILE STRESS (3)
THROUGHPUT (3)
ACCELERATION (2)
ANALYTICAL MODELS (2)
APACHE SPARK (2)
C++ LANGUAGES (2)
CLOUD COMPUTING (2)
COMPUTERS (2)
CONVERGENCE (2)
DATA ANALYSIS (2)
DATABASE SYSTEMS (2)
DNA (2)
DOCKER (2)
ENCODING (2)
FORCE (2)
FORENSICS (2)
FPGA (2)
HIGH PERFORMANCE COMPUTING (2)
HISTORY (2)
IMAGE COLOR ANALYSIS (2)
IMAGE EDGE DETECTION (2)
INFERENCE ALGORITHMS (2)
ITERATIVE ALGORITHMS (2)
MATHEMATICAL MODEL (2)
MATRICES (2)
METADATA (2)
MIDDLEWARE (2)
NEURAL NETWORKS (2)
PARALLEL ALGORITHMS (2)
PARALLEL COMPUTING (2)
PIPELINES (2)
PORTS (COMPUTERS) (2)
SOFTWARE (2)
SPARKS (2)
STATISTICS (2)
STOCHASTIC PROCESSES (2)
SYNCHRONIZATION (2)
TIME COMPLEXITY (2)
TIMING (2)
TRAINING (2)
UNCERTAINTY (2)
3D CHIPS (1)
3D PHYSICAL LAYOUT (1)
ACCELERATORS (1)
ADDERS (1)
ADJACENCY OF CONNECTIONS (1)
AMAZON SIMPLE STORAGE SERVICE (1)
ANOMALY DETECTION (1)
APPLICATION SPECIFIC PROCESSORS (ASP) (1)
more

INFONA - science communication portal

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Mixed data layout kernels for vectorized complex arithmetic

A quantitative and qualitative analysis of tensor decompositions on spatiotemporal data

A linear algebra approach to fast DNA mixture analysis using GPUs

Memory-efficient parallel tensor decompositions

[Copyright notice]

Investigating TI KeyStone II and quad-core ARM Cortex-A53 architectures for on-board space processing

MIT SuperCloud portal workspace: Enabling HPC web application deployment

Algorithm and hardware co-optimized solution for large SpMV problems

A distributed algorithm for the efficient computation of the unified model of social influence on massive datasets

Accelerating big data applications using lightweight virtualization framework on enterprise cloud

Triangle counting via vectorized set intersection

WCET analysis of the shared data cache in integrated CPU-GPU architectures

Leakage energy reduction for hard real-time caches

FastID: Extremely fast forensic DNA comparisons

Exploring optimizations on shared-memory platforms for parallel triangle counting algorithms

Towards numerical benchmark for half-precision floating point arithmetic

An ensemble framework for detecting community changes in dynamic networks

TriX: Triangle counting at extreme scale

Static graph challenge: Subgraph isomorphism

Efficient and accurate Word2Vec implementations in GPU and shared-memory multicore architectures

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2017 IEEE High Performance Extreme Computing Conference (HPEC) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE High Performance Extreme Computing Conference (HPEC)