Search results for: Masaaki Kondo

Items from 1 to 19 out of 19 results

chapter

Production Hardware Overprovisioning: Real-World Performance Optimization Using an Extensible Power-Aware Resource Management Framework

Ryuichi Sakamoto, Thang Cao, Masaaki Kondo, Koji Inoue, more

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 957 - 966

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Limited power budgets will be one of the biggest challenges for deploying future exascale supercomputers. One of the promising ways to deal with this challenge is hardware overprovisioning, that is, installingmore hardware resources than can be fully powered under a given power limit coupled with software mechanisms to steer the limited power to where it is needed most. Prior research has demonstrated...

chapter

Cooling-Aware Job Scheduling and Node Allocation for Overprovisioned HPC Systems

Thang Cao, Wei Huang, Yuan He, Masaaki Kondo

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 728 - 737

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Limited power budget is becoming one of the most crucial challenges in developing supercomputer systems. Hardware overprovisioning which installs a larger number of nodes beyond the limitations of the power constraint is an attractive way to design next generation supercomputers. In air cooled HPC centers, about half of the total power is consumed by cooling facilities. Reducing cooling power and...

chapter

Opportunistic circuit-switching for energy efficient on-chip networks

Yuan He, Masaaki Kondo

2016 IFIP/IEEE International Conference on Very Large Scale Integration (VLSI-SoC) > 1 - 6

2016 IFIP/IEEE International Conference on Very Large Scale Integration (VLSI-SoC)

Modern on-chip networks (NoCs) rely on virtual channel (VC) flow control to allow effective utilization of link bandwidth at the cost of more power and longer per-hop latency. Despite many existing optimization techniques for NoCs under VC flow control, we take a further step on questioning its necessity. Our finding is, when the network is not busy, circuit-switching (CS) may already satisfy the...

chapter

Demand-Aware Power Management for Power-Constrained HPC Systems

Thang Cao, Yuan He, Masaaki Kondo

2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) > 21 - 31

2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)

As limited power budget is becoming one of the most crucialchallenges in developing supercomputer systems, hardware overprovisioning which installs larger number of nodes beyond the limitations of the power constraint determinedby Thermal Design Power is an attractive way to design extreme-scale supercomputers. In this design, power consumption of each node should be controlled by power-knobs equipped...

chapter

Runtime multi-optimizations for energy efficient on-chip interconnections1

Yuan He, Masaaki Kondo, Takashi Nakada, Hiroshi Sasaki, more

2015 33rd IEEE International Conference on Computer Design (ICCD) > 455 - 458

2015 33rd IEEE International Conference on Computer Design (ICCD)

On-chip interconnection (or NoC) is a major performance and power contributor to modern and future multicore processors. So far, many optimization techniques have been developed to improve its bandwidth, latency and power consumption. But it is not clear how energy efficiency is affected since an optimization technique normally comes with overheads. This paper thus attempts to address when and how...

chapter

A flexible hardware barrier mechanism for many-core processors

Takeshi Soga, Hiroshi Sasaki, Tomoya Hirao, Masaaki Kondo, more

The 20th Asia and South Pacific Design Automation Conference > 61 - 68

2015 20th Asia and South Pacific Design Automation Conference (ASP-DAC)

This paper proposes a new hardware barrier mechanism which offers the flexibility to select which cores should join the synchronization, allowing for executing multiple multi-threaded applications by dividing a many-core processor into several groups. Experimental results based on an RTL simulation show that our hardware barrier achieves a 66-fold reduction in latency over typical software based implementations,...

chapter

Analyzing and mitigating the impact of manufacturing variability in power-constrained supercomputing

Yuichi Inadomi, Tapasya Patki, Koji Inoue, Mutsumi Aoyagi, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 12

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

A key challenge in next-generation supercomputing is to effectively schedule limited power resources. Modern processors suffer from increasingly large power variations due to the chip manufacturing process. These variations lead to power inhomogeneity in current systems and manifest into performance inhomogeneity in power constrained environments, drastically limiting supercomputing performance. We...

chapter

Unbalanced buffer tree synthesis to suppress ground bounce for fine-grain power gating

Kimiyoshi Usami, Makoto Miyauchi, Masaru Kudo, Kazumitsu Takagi, more

2014 International Symposium on System-on-Chip (SoC) > 1 - 7

2014 International Symposium on System-on-Chip (SoC)

This paper describes a new approach to reduce the ground bounce (GB) while keeping the wakeup time short for fine-grain power gating. We propose a novel algorithm to synthesize an optimal unbalanced buffer tree (UBT) that turns on parallel power switches with slight time differences. We have applied our algorithm to function units of a 32-bit microprocessor. Experimental results have revealed that...

chapter

Design and evaluation of fine-grained power-gating for embedded microprocessors

Masaaki Kondo, Hiroaki Kobyashi, Ryuichi Sakamoto, Motoki Wada, more

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 6

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Power-performance efficiency is still remaining a primary concern for microprocessor designers. One of the sources of power inefficiency for recent LSI chips is increasing leakage power consumption. Power-gating is a well known technique to reduce leakage power consumption by switching off the power supply to idle logic blocks. Recently, fine-grained power-gating is emerged as a technique to minimize...

chapter

Design and control methodology for fine grain power gating based on energy characterization and code profiling of microprocessors

Kimiyoshi Usami, Masaru Kudo, Kensaku Matsunaga, Tsubasa Kosaka, more

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC) > 843 - 848

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC)

This paper presents a design and control scheme of a microprocessor whose internal function units are power gated at instruction-by-instruction basis. Enabling/disabling the power gating is adaptively controlled under the support of on-chip leakage monitors and the operating system to minimize energy overhead due to sleep-in and wakeup. Measured results of the fabricated chip in the 65nm CMOS technology...

chapter

Demonstration of a heterogeneous multi-core processor with 3-D inductive coupling links

Yusuke Koizumi, Noriyuki Miura, Yasuhiro Take, Hiroki Matsutani, more

2013 23rd International Conference on Field programmable Logic and Applications > 1

2013 23rd International Conference on Field Programmable Logic and Applications (FPL)

Cube-1 is a heterogeneous multi-core processor which can achieve the required performance with the least energy consumption as possible. It can control the performance and energy with two levels: (1) the number of accelerators can be easily changed by increasing or decreasing the number of stacked chips after fabrication, as they are connected with inductive coupling links. (2) The supply voltage...

chapter

A scalable 3D heterogeneous multi-core processor with inductive-coupling thruchip interface

Noriyuki Miura, Yusuke Koizumi, Eiichi Sasaki, Yasuhiro Take, more

2013 IEEE Hot Chips 25 Symposium (HCS) > 1

2013 IEEE Hot Chips 25 Symposium (HCS)

Recent battery driven IT devices including smart phone and tablets require versatile functions and high performance with low energy consumption. On the other hand, the initial cost of LSI for design and mask development has increased rapidly, and development of an SoC (System-on-a Chip) for each product has become difficult. Although flexible reconfigurable architectures can be a solution, the performance...

chapter

A scalable 3D heterogeneous multi-core processor with inductive-coupling thruchip interface

Noriyuki Miura, Yusuke Koizumi, Eiichi Sasaki, Yasuhiro Take, more

2013 IEEE COOL Chips XVI > 1 - 3

2013 COOL Chips XVI

A scalable heterogeneous multi-core processor is developed. 3D heterogeneous chip stacking of a general-purpose CPU and reconfigurable multi-core accelerators improves computational energy efficiency by proper task assignment and massive parallel computing. The stacked chips interconnect through a scalable 3D Network on Chip (NoC). By simply changing the number of stacked accelerator chips, processor...

article

A Scalable 3D Heterogeneous Multicore with an Inductive ThruChip Interface

Noriyuki Miura, Yusuke Koizumi, Yasuhiro Take, Hiroki Matsutani, more

IEEE Micro > 2013 > 33 > 6 > 6 - 15

The authors developed a scalable heterogeneous multicore processor. 3D heterogeneous chip stacking of a general-purpose CPU and reconfigurable multicore accelerators enables various trade-offs between performance and energy consumption. The stacked chips interconnect through a scalable 3D network on a chip (NoC). By simply changing the number of stacked accelerator chips, processor parallelism can...

chapter

Dynamic power control with a heterogeneous multi-core system using a 3-D wireless inductive coupling interconnect

Yusuke Koizumi, Hideharu Amano, Hiroki Matsutani, Noriyuki Miura, more

2012 International Conference on Field-Programmable Technology > 293 - 296

2012 International Conference on Field-Programmable Technology (FPT)

Cube-2 is a prototype of building block scalable reconfigurable accelerator using an inductive coupling interconnect. It is consisting of a ultra low leakage embedded processor Geyser and coarse-grained reconfigurable accelerators CMA (Cool Mega Array). A Geyser chip and multiple CMA chips are stacked, and a powerful network is formed by using the inductive coupling interconnect. The performance can...

chapter

CMA-Cube: A scalable reconfigurable accelerator with 3-D wireless inductive coupling interconnect

Yusuke Koizumi, Eiichi Sasaki, Hideharu Amano, Hiroki Matsutani, more

22nd International Conference on Field Programmable Logic and Applications (FPL) > 543 - 546

2012 22nd International Conference on Field Programmable Logic and Applications (FPL)

CMA-Cube is the second prototype of building block scalable reconfigurable accelerator using inductive coupling interconnect. It uses the wireless inductive coupling interconnect as a packet switching network which connects accelerators. As an accelerator core, CMA (Cool Mega Array), which consists of a large coarse-grained PE array with combinatorial circuits and tiny micro-controller, is applied...

chapter

SLD-1(Silent Large Datapath): A ultra low power reconfigurable accelerator

Nobuaki Ozaki, Kimiyoshi Usami, Hideharu Amano, Mitaro Namiki, more

2011 IEEE Cool Chips XIV > 1 - 3

2011 IEEE Cool Chips XIV

SLD(Silent Large Datapath)-1 is a prototype accelerator for media processing consisting of a large Processing Element (PE) array which includes 24bit 8 × 8 PEs with combinatorial circuits and a small micro-controller for data memory access. It was fabricated in 2.1mm × 4.2mm 65 nm CMOS, and achieves 1.356GOPS/11mW sustained performance by reducing overhead of clock tree and the benefit of voltage...

article

Cool Mega-Arrays: Ultralow-Power Reconfigurable Accelerator Chips

Nobuaki Ozaki, Yoshihiro Yasuda, Mai Izawa, Yoshiki Saito, more

IEEE Micro > 2011 > 31 > 6 > 6 - 18

Cool Mega-Array (CMA) is an energy-efficient reconfigurable accelerator for battery-driven mobile devices. It has a large processing-element array without memory elements for mapping an application's data-flow graph, a simple programmable microcontroller for data management, and data memory. Unlike coarse-grained dynamically reconfigurable processors, CMA reduces power consumption by switching hardware...

chapter

Adaptive power gating for function units in a microprocessor

Kimiyoshi Usami, Tatsunori Hashida, Satoshi Koyama, Tatsuya Yamamoto, more

2010 11th International Symposium on Quality Electronic Design (ISQED) > 29 - 37

Eleventh International Symposium on Quality of Electronic Design (ISQED 2010)

This paper describes adaptive fine-grain control to power gate function units based on temperature dependent break-even time (BET). An analytical model to express the temperature dependent BET is introduced and the accuracy of the model was examined. Results demonstrated that the model well represents the exponential decrease in BET with the temperature. Meanwhile, it was found that the accuracy gets...

Filter options

Data set:
ieee

Publication date

Set your own date range

Publication type

book (17)
article (2)

Keywords

POWER DEMAND (5)
ARRAYS (4)
CLOCKS (4)
REGISTERS (4)
MONITORING (3)
RESOURCE MANAGEMENT (3)
ENERGY EFFICIENCY (2)
HARDWARE (2)
HARDWARE OVERPROVISIONED SYSTEM (2)
LOGIC GATES (2)
LOW POWER (2)
MICROPROCESSORS (2)
MULTICORE PROCESSING (2)
NETWORK-ON-CHIP (2)
POWER GATING (2)
POWER-CONSTRAINED HPC SYSTEM (2)
PROCESSOR SCHEDULING (2)
PROPOSALS (2)
PROTOTYPES (2)
RANDOM ACCESS MEMORY (2)
SWITCHES (2)
SWITCHING CIRCUITS (2)
TEMPERATURE SENSORS (2)
TRANSISTORS (2)
65NMCMOS (1)
ADAPTATION MODELS (1)
ADAPTIVE (1)
ADAPTIVE CONTROL (1)
ADAPTIVE FINE GRAIN CONTROL (1)
ADAPTIVE POWER GATING (1)
ADAPTIVE POWER MANAGEMENT (1)
ANALYTICAL MODELS (1)
BANDWIDTH (1)
BENCHMARK TESTING (1)
BIOLOGICAL CELLS (1)
CIRCUIT-SWITCHING (1)
CMA-1 (1)
CMA-2 (1)
CONTEXT (1)
COOL MEGA-ARRAY (1)
COOLING (1)
COOLING-AWARE (1)
COUPLINGS (1)
DECODING (1)
DELAY (1)
DELAYS (1)
DESIGN OF RESOURCE MANAGER (1)
DISCRETE COSINE TRANSFORMS (1)
DYNAMIC SCHEDULING (1)
ENERGY DISSIPATION (1)
ENERGY SAVING (1)
ENERGY-EFFICIENT ACCELERATOR (1)
EXTREME-SCALE COMPUTING (1)
FLOW CONTROL (1)
FUNCTION UNIT (1)
GATE FUNCTION UNIT (1)
GROUND BOUNCE (1)
HEATING SYSTEMS (1)
HETEROGENEOUS MULTICORE SYSTEM (1)
HPC SYSTEM (1)
INDUCTIVE COUPLING THROUGH CHIP INTERFACE (1)
INDUCTORS (1)
INTEGRATED CIRCUIT DESIGN (1)
INTEGRATED CIRCUIT INTERCONNECTIONS (1)
JOB SCHEDULING (1)
JOB SHOP SCHEDULING (1)
LAYOUT (1)
LEAKAGE (1)
LEAKAGE CURRENTS (1)
MANUFACTURING (1)
MATHEMATICAL MODEL (1)
MEMORY MANAGEMENT (1)
MICROPROCESSOR CHIPS (1)
MICROPROCESSOR FUNCTION UNIT (1)
MODELING (1)
NETWORKS-ON-CHIP (1)
NONHOMOGENEOUS MEDIA (1)
OPTIMIZATION (1)
OVERPROVISIONED (1)
PARALLEL JOB SCHEDULING (1)
PERFORMANCE EVALUATION (1)
PIPELINES (1)
PLUGIN INTERFACE (1)
POWER CHARACTERISTICS OF HPC SYSTEM (1)
POWER MEASUREMENT (1)
PROCESSING-ELEMENT ARRAY (1)
PROGRAM PROCESSORS (1)
RADIATION DETECTORS (1)
RECONFIGURABLE ACCELERATOR (1)
RECONFIGURABLE SYSTEM (1)
RUNTIME (1)
SERVERS (1)
SIZE 65 NM (1)
SLEEP (1)
SOCIOLOGY (1)
STATISTICS (1)
SUPERLUMINESCENT DIODES (1)
TEMPERATURE (1)
TEMPERATURE 100 C (1)
TEMPERATURE DEPENDENCE (1)
more

INFONA - science communication portal

Search results for: Masaaki Kondo

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options