Search results for: Venkatraman Govindaraju

Items from 1 to 7 out of 7 results

chapter

Big Data Processing: Scalability with Extreme Single-Node Performance

Venkatraman Govindaraju, Sam Idicula, Sandeep Agrawal, Venkatanathan Vardarajan, more

2017 IEEE International Congress on Big Data (BigData Congress) > 129 - 136

2017 IEEE International Congress on Big Data (BigData Congress)

Contemporary frameworks for data analytics, such as Hadoop, Spark, and Flink seek to allow applications to scale performance flexibly by adding hardware nodes. However, we find that when the computation on each individual node is optimized, peripheral activities such as creating data partitions, messaging and synchronizing between nodes diminish the speedup obtainable from adding more hardware. We...

article

A Graph-Based Program Representation for Analyzing Hardware Specialization Approaches

Tony Nowatzki, Venkatraman Govindaraju, Karthikeyan Sankaralingam

IEEE Computer Architecture Letters > 2015 > 14 > 2 > 94 - 98

Hardware specialization has emerged as a promising paradigm for future microprocessors. Unfortunately, it is natural to develop and evaluate such architectures within end-to-end vertical silos spanning application, language/compiler, hardware design and evaluation tools, leaving little opportunity for cross-architecture analysis and innovation. This paper develops a novel program representation suitable...

chapter

Breaking SIMD shackles with an exposed flexible microarchitecture and the access execute PDG

Venkatraman Govindaraju, Tony Nowatzki, Karthikeyan Sankaralingam

Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques > 341 - 351

2013 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT)

Modern microprocessors exploit data level parallelism through in-core data-parallel accelerators in the form of short vector ISA extensions such as SSE/AVX and NEON. Although these ISA extensions have existed for decades, compilers do not generate good quality, high-performance vectorized code without significant programmer intervention and manual optimization. The fundamental problem is that the...

chapter

Prototyping the DySER specialization architecture with OpenSPARC

Jesse Benson, Ryan Cofell, Chris Frericks, Venkatraman Govindaraju, more

2012 IEEE Hot Chips 24 Symposium (HCS) > 1 - 3

2012 IEEE Hot Chips 24 Symposium (HCS)

This paper describes the prototype implementation of the DySER specialization architecture integrated into the OpenSPARC processor. The paper's description covers the hardware, compiler, and application tuning. The prototype system provides speedups up to 14× over OpenSPARC (geometric mean 5×). The architecture is more flexible than SIMD and GPU-based acceleration while supporting a more diverse set...

chapter

Design, integration and implementation of the DySER hardware accelerator into OpenSPARC

Jesse Benson, Ryan Cofell, Chris Frericks, Chen-Han Ho, more

IEEE International Symposium on High-Performance Comp Architecture > 1 - 12

2012 IEEE 18th International Symposium on High Performance Computer Architecture (HPCA)

Accelerators and specialization in various forms are emerging as a way to increase processor performance. Examples include Navigo, Conservation-Cores, BERET, and DySER. While each of these employ different primitives and principles to achieve specialization, they share some common concerns with regards to implementation. Two of these concerns are: how to integrate them with a commercial processor...

article

DySER: Unifying Functionality and Parallelism Specialization for Energy-Efficient Computing

Venkatraman Govindaraju, Chen-Han Ho, Tony Nowatzki, Jatin Chhugani, more

IEEE Micro > 2012 > 32 > 5 > 38 - 51

The DySER (Dynamically Specializing Execution Resources) architecture supports both functionality specialization and parallelism specialization. By dynamically specializing frequently executing regions and applying parallelism mechanisms, DySER provides efficient functionality and parallelism specialization. It outperforms an out-of-order CPU, Streaming SIMD Extensions (SSE) acceleration, and GPU...

chapter

Sampling + DMR: Practical and low-overhead permanent fault detection

Shuou Nomura, Matthew D. Sinclair, Chen-Han Ho, Venkatraman Govindaraju, more

2011 38th Annual International Symposium on Computer Architecture (ISCA) > 201 - 212

2011 ACM/IEEE 38th International Symposium on Computer Architecture (ISCA)

With technology scaling, manufacture-time and in-field permanent faults are becoming a fundamental problem. Multi-core architectures with spares can tolerate them by detecting and isolating faulty cores, but the required fault detection coverage becomes effectively 100% as the number of permanent faults increases. Dual-modular redundancy(DMR) can provide 100% coverage without assuming device-level...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Venkatraman Govindaraju

Big Data Processing: Scalability with Extreme Single-Node Performance

A Graph-Based Program Representation for Analyzing Hardware Specialization Approaches

Breaking SIMD shackles with an exposed flexible microarchitecture and the access execute PDG

Prototyping the DySER specialization architecture with OpenSPARC

Design, integration and implementation of the DySER hardware accelerator into OpenSPARC

DySER: Unifying Functionality and Parallelism Specialization for Energy-Efficient Computing

Sampling + DMR: Practical and low-overhead permanent fault detection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options