Search results for: N. Puzovic

Items from 1 to 4 out of 4 results

chapter

Exploiting DMA to enable non-blocking execution in Decoupled Threaded Architecture

R. Giorgi, Z. Popovic, N. Puzovic

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

DTA (decoupled threaded architecture) is designed to exploit fine/medium grained Thread Level Parallelism (TLP) by using a distributed hardware scheduling unit and relying on existing simple cores (in-order pipelines, no branch predictors, no ROBs). In DTA, the local variables and synchronization data are communicated via a fast frame memory. If the compiler cannot remove global data accesses, the...

chapter

Introducing Hardware TLP Support in the Cell Processor

R. Giorgi, Z. Popovic, N. Puzovic

2009 International Conference on Complex, Intelligent and Software Intensive Systems > 657 - 662

2009 International Conference on Complex, Intelligent and Software Intensive Systems (CISIS 2009)

The focus of our study is the support for fine/medium grained thread level parallelism (TLP) by using a hardware scheduling unit and relying on existing simple cores. Simple cores are grouped into clusters in order to provide a scalable solution. As a proof of concept, we use an implementation based on the cell broadband engine (CBE). Cell is a multiprocessor on a chip developed by Sony, Toshiba and...

chapter

Analyzing Scalability of Deblocking Filter of H.264 via TLP Exploitation in a New Many-Core Architecture

R. Giorgi, Z. Popovic, N. Puzovic, A. Azevedo, more

2008 11th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools > 189 - 194

2008 11th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools (DSD)

In this paper we present results of parallelization of Deblocking Filter (DF) of H.264 video codec on decoupled threaded architecture (DTA). We parallelized the code trying to exploit all available thread level parallelism and to make it suitable for DTA architecture. Experimental results show that significant speed up can be achieved and that DTA architecture can efficiently exploit available parallelism...

chapter

DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems

R. Giorgi, Z. Popovic, N. Puzovic

19th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'7) > 263 - 270

2007 19th International Symposium on Computer Architecture and High Performance Computing

One way to exploit Thread Level Parallelism (TLP) is to use architectures that implement novel multithreaded execution models, like Scheduled Data- Flow (SDF). This latter model promises an elegant decoupled and non-blocking execution of threads. Here we extend that model in order to be used in future scalable CMP systems where wire delay imposes to partition the design. In this paper we describe...

Filter options

Publication date

Set your own date range

Keywords

MULTI-THREADING (4)
COMPUTER ARCHITECTURE (3)
DECOUPLED THREADED ARCHITECTURE (3)
BENCHMARK TESTING (2)
HARDWARE (2)
MICROPROCESSORS (2)
PARALLEL PROCESSING (2)
THREAD LEVEL PARALLELISM (2)
CELL BROADBAND ENGINE (1)
CELL PROCESSOR (1)
DATA PREFETCHING MECHANISM (1)
DEBLOCKING FILTER (1)
DISTRIBUTED HARDWARE SCHEDULING UNIT (1)
DISTRIBUTED SCHEDULERS (1)
DMA (1)
DTA (1)
FAST FRAME MEMORY (1)
FILTERING (1)
FILTERING THEORY (1)
FINE/MEDIUM GRAINED THREAD LEVEL PARALLELISM (1)
H.264 (1)
H.264 DEBLOCKING FILTER PARALLELIZATION (1)
H.264 VIDEO CODEC (1)
HARDWARE TLP SUPPORT (1)
LAKES (1)
MAGNETIC CORES (1)
MANY-CORE (1)
MANY-CORE ARCHITECTURE (1)
MICROPROCESSOR CHIPS (1)
MULTICORE PROCESSORS (1)
MULTIPROCESSING SYSTEMS (1)
MULTIPROCESSOR (1)
MULTITHREADED ARCHITECTURE DECOUPLING (1)
MULTITHREADED EXECUTION MODELS (1)
NONBLOCKING THREAD EXECUTION (1)
PARALLEL ARCHITECTURES (1)
PIPELINES (1)
PIXEL (1)
PREFETCHING (1)
PROGRAM PROCESSORS (1)
REGISTERS (1)
SCALABILITY (1)
SCHEDULED DATA-FLOW (1)
SCHEDULING (1)
SOFTWARE ARCHITECTURE (1)
STORAGE MANAGEMENT (1)
SYNCHRONISATION (1)
SYNCHRONIZATION (1)
SYNCHRONIZATION DATA (1)
THREAD LEVEL PARALLELISM EXPLOITATION (1)
TLP (1)
VIDEO CODECS (1)
VIDEO CODING (1)
YARN (1)
more

INFONA - science communication portal

Search results for: N. Puzovic

Exploiting DMA to enable non-blocking execution in Decoupled Threaded Architecture

Introducing Hardware TLP Support in the Cell Processor

Analyzing Scalability of Deblocking Filter of H.264 via TLP Exploitation in a New Many-Core Architecture

DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options