Search results

chapter

Calibration of ultrasonic phased arrays for industrial applications

M. Ingram, A. Gachagan, A. J. Mulholland, A. Nordon, more

2017 IEEE SENSORS > 1 - 3

2017 IEEE SENSORS

This paper investigates the consistency in phased array element performance by extracting information from the Full Matrix Capture (FMC) of a reflection from a planar interface. The purpose of this work is to generate a robust methodology for tracking phased array performance over time, therefore, ensuring the reliability of measured data. To achieve this, a calibration method has been developed that...

chapter

Development, performance and application of novel GaN-based micro-LED arrays with individually addressable n-electrodes

Enyuan Xie, Mark Stonehouse, Ricardo Ferreira, Jonathan J. D. McKendry, more

2017 IEEE Photonics Conference (IPC) > 71 - 72

2017 IEEE Photonics Conference (IPC)

We demonstrate the development, performance and application of a GaN-based micro-light emitting diode array sharing a common p-electrode with individual-addressed n-electrodes. These individually-addressed n-electrodes minimize the series-resistance difference from conductive paths, and offer compatibility with n-type metal-oxide-semiconductor transistor-based drivers for faster modulation.

chapter

Photodiode-integrated UWB phased array antennas

Dylan D. Ross, Matthew R. Konkol, Shouyuan Shi, Dennis W. Prather

2017 IEEE Photonics Conference (IPC) > 109 - 110

2017 IEEE Photonics Conference (IPC)

High-power, high-linearity CC-MUTC photodiodes, directly integrated into connected and tightly coupled array antennas enable ultra-wideband (UWB) phased array operation with improved size, weight, and power (SWaP). Presented is high-fidelity beam steering and bandwidth performance of several of these one-dimensional photodiode-integrated antenna arrays.

chapter

A high-throughput reconfigurable processing array for neural networks

Ephrem Wu, Xiaoqian Zhang, David Berman, Inkeun Cho

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper presents a processing array architected to consistently achieve timing closure at 100% of the peak DSP clock rate with standard FPGA tools. In the HDL design environment, our processing array operates at the peak DSP clock rates on Xilinx UltraScale...

chapter

Automated generation of banked memory architectures in the high-level synthesis of multi-threaded software

Yu Ting Chen, Jason H. Anderson

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Some modern high-level synthesis (HLS) tools [1] permit the synthesis of multi-threaded software into parallel hardware, where concurrent software threads are realized as concurrently operating hardware units. A common performance bottleneck in any parallel implementation (whether it be hardware or software) is memory bandwidth — parallel threads demand concurrent access to memory resulting in contention...

chapter

TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers

Francois Tessier, Venkatram Vishwanath, Emmanuel Jeannot

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 70 - 80

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Reading and writing data efficiently from storage system is necessary for most scientific simulations to achieve good performance at scale. Many software solutions have been developed to decrease the I/O bottleneck. One well-known strategy, in the context of collective I/O operations, is the two-phase I/O scheme. This strategy consists of selecting a subset of processes to aggregate contiguous pieces...

chapter

8×8 Phased series fed patch antenna array at 28 GHz for 5G mobile base station antennas

Muhammad Kamran Ishfaq, Tharek Abd Rahman, Yoshihide Yamada, Kunio Sakakibara

2017 IEEE-APS Topical Conference on Antennas and Propagation in Wireless Communications (APWC) > 160 - 162

2017 IEEE-APS Topical Conference on Antennas and Propagation in Wireless Communications (APWC)

5G, the next generation of wireless communications, is focusing on modern antenna technologies like massive MIMO, phased arrays and mm-wave band to obtain data rates up to 10 Gbps. In this paper, we have proposed a new 64 element, 8×8 phased series fed patch antenna array, for 28 GHz, mm-wave band 5G mobile base station antennas. The phased array steers its beam along the horizontal axis to provide...

chapter

POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators

Swagath Venkataramani, Jungwook Choi, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan, more

2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT) > 146 - 147

2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT)

The growing prominence and computational challenges imposed by Deep Neural Networks (DNNs) has fueled the design of specialized accelerator architectures and associated dataflows to improve their implementation efficiency. Each of these solutions serve as a datapoint on the throughput vs. energy trade-offs for a given DNN and a set of architectural constraints. In this paper, we set out to explore...

chapter

A novel ReRAM-based processing-in-memory architecture for graph computing

Lei Han, Zhaoyan Shen, Zili Shao, H. Howie Huang, more

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA) > 1 - 6

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA)

Graph algorithms such as breadth-first search (BFS) have been gaining ever-increasing importance in the era of Big Data. However, the memory bandwidth remains the key performance bottleneck for graph processing. To address this problem, we utilize processing-in-memory (PIM), combined with non-volatile metal-oxide resistive random access memory (ReRAM), to improve the performance of both computation...

chapter

Early science results from ASKAP

Karen Lee-Waddell

2017 XXXIInd General Assembly and Scientific Symposium of the International Union of Radio Science (URSI GASS) > 1 - 4

2017 XXXIInd General Assembly and Scientific Symposium of the International Union of Radio Science (URSI GASS)

ASKAP has recently started its Early Science program with 12 MkII PAF-equipped antennas and 36 beams simultaneously covering a 30 square degree field of view. The first observations have focused on mapping extragalactic neutral hydrogen in galaxy groups and clusters selected by the ‘WALLABY’ Survey Science Team. Significant efforts from engineers, software designers, and scientists are overcoming...

chapter

PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout

Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, more

2017 46th International Conference on Parallel Processing (ICPP) > 402 - 411

2017 46th International Conference on Parallel Processing (ICPP)

Parity declustering is widely deployed in erasure coded storage systems so as to provide fast recovery and high data availability. However, to perform scaling on such RAIDs, it is necessary to preserve the parity declustered data layout so as to guarantee the RAID performance after scaling. Unfortunately, existing scaling algorithms fail to achieve this goal so they can not be applied for scaling...

chapter

Locality and availability of array codes constructed from subspaces

Natalia Silberstein, Tuvi Etzion, Moshe Schwartz

2017 IEEE International Symposium on Information Theory (ISIT) > 829 - 833

2017 IEEE International Symposium on Information Theory (ISIT)

Ever-increasing amounts of data are created and processed in internet-scale companies such as Google, Facebook, and Amazon. The efficient storage of such copious amounts of data has thus become a fundamental and acute problem in modern computing. No single machine can possibly satisfy such immense storage demands. Therefore, distributed storage systems (DSS), which rely on tens of thousands of storage...

chapter

Triple-fault-tolerant binary MDS array codes with asymptotically optimal repair

Hanxu Hou, Patrick P. C. Lee, Yunghsiang S. Han, Yuchong Hu

2017 IEEE International Symposium on Information Theory (ISIT) > 839 - 843

2017 IEEE International Symposium on Information Theory (ISIT)

Binary maximum distance separable (MDS) array codes are a special class of erasure codes for distributed storage that not only provide fault tolerance with minimum storage redundancy, but also achieve low computational complexity. They are constructed by encoding k information columns into r parity columns, in which each element in a column is a bit, such that any k out of the k + r columns suffice...

chapter

An explicit, coupled-layer construction of a high-rate MSR code with low sub-packetization level, small field size and d < (n − 1)

Birenjith Sasidharan, Myna Vajha, P. Vijay Kumar

2017 IEEE International Symposium on Information Theory (ISIT) > 2048 - 2052

2017 IEEE International Symposium on Information Theory (ISIT)

This paper presents an explicit construction for an ((n = 2qt, k = 2q{t−1), d = n − (q + 1)), (α = q(2q)^t−1,β = α/q)) regenerating code over a field F_q operating at the Minimum Storage Regeneration (MSR) point. The MSR code can be constructed to have rate k/n as close to 1 as desired, sub-packetization level α ≤ r^n/r for r = (n − k), field size Q no larger than n and where all code symbols can be...

chapter

Time-delay compensation in array lens antennas

Payam Nayeri, Randy Haupt

2017 11th European Conference on Antennas and Propagation (EUCAP) > 2841 - 2842

2017 11th European Conference on Antennas and Propagation (EUCAP)

A phased array lens has limited bandwidth due to the phase shifters that collimate and scan the beam. A wideband signal requires time delay units in place of phase shifters. This paper investigates the feasibility of implementing time-delay units in array lens antennas. Time-delay compensation mechanisms for array lens antennas are outlined and investigations are carried to determine the required...

chapter

Design space exploration of FPGA accelerators for convolutional neural networks

Atul Rahman, Sangyun Oh, Jongeun Lee, Kiyoung Choi

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1147 - 1152

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The increasing use of machine learning algorithms, such as Convolutional Neural Networks (CNNs), makes the hardware accelerator approach very compelling. However the question of how to best design an accelerator for a given CNN has not been answered yet, even on a very fundamental level. This paper addresses that challenge, by providing a novel framework that can universally and accurately evaluate...

chapter

23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache

Tah-Kang Joseph Ting, Gyh-Bin Wang, Ming-Hung Wang, Chun-Peng Wu, more

2017 IEEE International Solid-State Circuits Conference (ISSCC) > 404 - 405

2017 IEEE International Solid- State Circuits Conference - (ISSCC)

In recent years, the demand for memory performance has grown rapidly due to the increasing number of cores on a single CPU, along with the integration of graphics processing units and other accelerators. Caching has been a very effective way to relieve bandwidth demand and to reduce average memory latency. As shown by the cache feature table in Fig. 23.9.1, there is a big latency gap between SRAM...

chapter

Bandwidth optimization through on-chip memory restructuring for HLS

Jason Cong, Peng Wei, Cody Hao Yu, Peipei Zhou

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

High-level synthesis (HLS) is getting increasing attention from both academia and industry for high-quality and high-productivity designs. However, when inferring primitive-type arrays in HLS designs into on-chip memory buffers, commercial HLS tools fail to effectively organize FPGAs' on-chip BRAM building blocks to realize high-bandwidth data communication; this often leads to suboptimal quality...

chapter

A Workload Sensitive Dynamic Scaling Matrix Multiplier Structure

Yuran Qiao, Junzhong Shen, Tao Xiao, Qianming Yang

2016 8th International Conference on Computational Intelligence and Communication Networks (CICN) > 548 - 552

2016 8th International Conference on Computational Intelligence and Communication Networks (CICN)

Matrix multiplication is one of the most widely used computational kernels in scientific computing and machine learning. Using dedicated circuit for matrix multiplication can reduce the computational time and energy consumption. Traditional matrix multipliers always adopt linear array architecture, which works inefficiently when the size of matrix sub-block is much smaller than the array length. Using...

chapter

Tessellation-based multi-block memory mapping scheme for high-level synthesis with FPGA

auJuan Escobedo, auMingjie Lin

2016 International Conference on Field-Programmable Technology (FPT) > 125 - 132

2016 International Conference on Field-Programmable Technology (FPT)

For many intensive computing tasks, simultaneous data access into multi-dimensional data arrays is highly restricted by its data mapping strategy and memory port constraint. As such, to increase memory accessing bandwidth, innovative memory partitioning and mapping algorithms have been proposed to simultaneously access multiple memory blocks through physically distributing data elements in the same...

INFONA - science communication portal

Search results

Calibration of ultrasonic phased arrays for industrial applications

Development, performance and application of novel GaN-based micro-LED arrays with individually addressable n-electrodes

Photodiode-integrated UWB phased array antennas

A high-throughput reconfigurable processing array for neural networks

Automated generation of banked memory architectures in the high-level synthesis of multi-threaded software

TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers

8×8 Phased series fed patch antenna array at 28 GHz for 5G mobile base station antennas

POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators

A novel ReRAM-based processing-in-memory architecture for graph computing

Early science results from ASKAP

PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout

Locality and availability of array codes constructed from subspaces

Triple-fault-tolerant binary MDS array codes with asymptotically optimal repair

An explicit, coupled-layer construction of a high-rate MSR code with low sub-packetization level, small field size and d < (n − 1)

Time-delay compensation in array lens antennas

Design space exploration of FPGA accelerators for convolutional neural networks

23.9 An 8-channel 4.5Gb 180GB/s 18ns-row-latency RAM for the last level cache

Bandwidth optimization through on-chip memory restructuring for HLS

A Workload Sensitive Dynamic Scaling Matrix Multiplier Structure

Tessellation-based multi-block memory mapping scheme for high-level synthesis with FPGA

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options