Advanced search

chapter

Extending Skel to Support the Development and Optimization of Next Generation I/O Systems

Jeremy Logan, Jong Youl Choi, Matthew Wolf, George Ostrouchov, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 563 - 571

2017 IEEE International Conference on Cluster Computing (CLUSTER)

As the memory and storage hierarchy get deeper and more complex, it is important to have new benchmarks and evaluation tools that allow us to explore the emerging middleware solutions to use this hierarchy. Skel is a tool aimed at automating and refining this process of studying HPC I/O performance. It works by generating application I/O kernel/benchmarks as determined by a domain-specific model....

chapter

Quantifying the Potential Benefits of On-chip Near-Data Computing in Manycore Processors

Jagadish B. Kotra, Diana Guttman, Nachiappan Chidamabaram N., Mahmut T. Kandemir, more

2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS) > 198 - 209

2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)

Increasing data set sizes motivate for a shift of focus from computation-centric systems to data-centric systems, where data movement is treated as a first-class optimization metric. An example of this emerging paradigm is in-situ computing in largescale computing systems. Observing that data movement costs are increasing at an exponential rate even at a node level (as a node itself is fast-becoming...

chapter

A unified HW/SW system-level simulation framework for next generation wireless system

N. Sutisna, L. Lanante, Y. Nagao, M. Kurosaki, more

2017 30th IEEE International System-on-Chip Conference (SOCC) > 322 - 327

2017 30th IEEE International System-on-Chip Conference (SOCC)

Recently, wireless technology experiences a fast growth to meet user demand and push toward the boundary limit of system performance. The simulation and verification framework play important role for accelerating investigation of technology proof of concept, field-trial, and large-scale commercial prototyping. In this paper, we present system-level simulation of heterogeneous model and unified HW/SW...

chapter

Distributed Particle-Based Rendering Framework for Large Data Visualization on HPC Environments

Jorji Nonaka, Naohisa Sakamoto, Takashi Shimizu, Masahiro Fujita, more

2017 International Conference on High Performance Computing & Simulation (HPCS) > 300 - 307

2017 International Conference on High Performance Computing & Simulation (HPCS)

In this paper, we present a distributed data visualization framework for HPC environments based on the PBVR (Particle Based Volume Rendering) method. The PBVR method is a kind of point-based rendering approach where the volumetric data to be visualized is represented as a set of small and opaque particles. This method has the object-space and image-space variants, defined by the place (object or image-...

chapter

ESP: A Machine Learning Approach to Predicting Application Interference

Nikita Mishra, John D. Lafferty, Henry Hoffmann

2017 IEEE International Conference on Autonomic Computing (ICAC) > 125 - 134

2017 IEEE International Conference on Autonomic Computing (ICAC)

Independent applications co-scheduled on the same hardware will interfere with one another, affecting performance in complicated ways. Predicting this interference is key to efficiently scheduling applications on shared hardware, but forming accurate predictions is difficult because there are many shared hardware features that could lead to the interference. In this paper we investigate machine learning...

chapter

Simulating spark cluster for deployment planning, evaluation and optimization

Qian Chen, Kebing Wang, Zhaojuan Bian, Illia Cremer, more

2016 6th International Conference on Simulation and Modeling Methodologies, Technologies and Applications (SIMULTECH) > 1 - 11

2016 6th International Conference on Simulation and Modeling Methodologies, Technologies and Applications (SIMULTECH)

As the most active project in the Hadoop ecosystem these days (Zaharia, 2014), Spark is a fast and general purpose engine for large-scale data processing. Thanks to its advanced Directed Acyclic Graph (DAG) execution engine and in-memory computing mechanism, Spark runs programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk (Apache, 2016). However, Spark performance is impacted...

chapter

BrainGrid+Workbench: High-performance/high-quality neural simulation

Michael Stiber, Fumitaka Kawasaki, Delmar B. Davis, Hazeline U. Asuncion, more

2017 International Joint Conference on Neural Networks (IJCNN) > 2469 - 2476

2017 International Joint Conference on Neural Networks (IJCNN)

Availability of affordable hardware that in effect enables desktop supercomputing has enabled more ambitious neural simulations driven by more complex software. However, this opportunity comes with costs, in terms of long learning curves to take advantage of the performance possibilities of idiosyncratic, architecturally heterogenous hardware and decreasing ability to be confident in the quality of...

chapter

Simulation tools for cloud computing: A survey and comparative study

Fairouz Fakhfakh, Hatem Hadj Kacem, Ahmed Hadj Kacem

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) > 221 - 226

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

Today, cloud computing has become a promising paradigm that aims at delivering computing resources and services on demand. The adoption of these services has been rapidly increasing. One of the main issues in this context is how to evaluate the ability of cloud systems to provide the desired services while respecting the QoS constraints. Experimentation in a real environment is a hard problem. In...

chapter

Fast IPC estimation for performance projections using proxy suites and decision trees

Kanishka Lahiri, Subhash Kunnoth

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 77 - 86

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Accurate IPC estimates are critical for generating performance projections of key workloads on future designs. However, the need to respond to projections requests in a timely manner in the face of rapidly evolving applications and software stacks and tight schedule constraints, often preclude design teams from executing detailed workload analysis, sampling and simulation flows for such purposes....

chapter

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates

Yijin Guan, Hao Liang, Ningyi Xu, Wenqiang Wang, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 152 - 159

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

DNNs (Deep Neural Networks) have demonstrated great success in numerous applications such as image classification, speech recognition, video analysis, etc. However, DNNs are much more computation-intensive and memory-intensive than previous shallow models. Thus, it is challenging to deploy DNNs in both large-scale data centers and real-time embedded systems. Considering performance, flexibility, and...

chapter

The Potential of Dynamic Binary Modification and CPU-FPGA SoCs for Simulation

John Mawer, Oscar Palomar, Cosmin Gorgovan, Andy Nisbet, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 144 - 151

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

In this paper we describe a flexible infrastructure that can directly interface unmodified application executables with FPGA hardware acceleration IP in order to 1), facilitate faster computer architecture simulation, and 2), to prototype microarchitecture or accelerator IP. Dynamic binary modification tool plugins are directly interfaced to the application under evaluation via flexible software interfaces...

chapter

Architectures for cloud-based HPC in data centers

Dao Manh Phan Hung, Sunil Manyam Seshadri Naidu, Michael Opoku Agyeman

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 138 - 143

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

The growing demands in IT services for improving efficiency and quality at low cost to handle complex compute requirements has led to the integration of High performance computing (HPC) systems and cloud infrastructure in data centers. Earlier, HPC systems were limited to academic and research institutions and engineering laboratories. However, the emergence of cloud infrastructures and their successful...

chapter

Research on interactive application of online education based on cloud computing and large data

Yuan Jiugen, Xing Ruonan, Kuang Rongrong

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 593 - 596

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Online education interaction is an important part in online education research. The emergence and development of cloud computing and large data technology provide new opportunities for online education interaction research, and have great influence on its service mode and data processing. Based on the characteristics of cloud computing and large data, this paper discusses the problems faced by online...

chapter

Invited: Accelerator design for deep learning training

Ankur Agrawal, Chia-Yu Chen, Jungwook Choi, Kailash Gopalakrishnan, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 2

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Deep Neural Networks (DNNs) have emerged as a powerful and versatile set of techniques showing successes on challenging artificial intelligence (AI) problems. Applications in domains such as image/video processing, autonomous cars, natural language processing, speech synthesis and recognition, genomics and many others have embraced deep learning as the foundation. DNNs achieve superior accuracy for...

chapter

Property mining using dynamic dependency graphs

Jan Malburg, Tino Flenker, Gorschwin Fey

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) > 244 - 250

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC)

We present a technique to automatically generate System Verilog-Assertions from designs using dynamic dependency graphs. We extract relations between signals of the design using only a few simulation runs, which drastically reduces the required number of use cases compared to other approaches. Additionally, unlike previous approaches, we do not use expression templates to establish those relations...

chapter

A Modeling Approach to Hardware Analysis of the Heterogeneous DEAC Cluster

Riana J. Freedman, Damian Valles

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 1408 - 1409

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

The employment of five distinct benchmarks on the Distributed Environment for Academic Computing (DEAC) Cluster at Wake Forest University provides meaningful metrics of cluster processor and memory performance. Given the heterogeneous nature of the DEAC Cluster, the benchmarks taken consider the specific processor architectures comprising the cluster. The data obtained will be assessed via two modeling...

chapter

Performance prediction techniques for scalable large data processing in distributed MPI systems

Janki Bhimani, Ningfang Mi, Miriam Leeser

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC) > 1 - 2

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC)

Predicting performance of an application running on parallel computing platforms is increasingly becoming important due to the long development time of an application and the high resource management cost of parallel computing platforms. However, predicting overall performance is complex and must take into account both parallel calculation time and communication time. Difficulty in accurate performance...

chapter

Design for Hardware In-the-Loop Real-Time Simulation Test of Combined Seeker

Yang Zhang, Chuan Shi, Huanyao Dai

2016 9th International Symposium on Computational Intelligence and Design (ISCID) > 1 > 74 - 77

2016 9th International Symposium on Computational Intelligence and Design (ISCID)

Hardware in-the-loop simulation test has the advantage of live test, digital simulation test, which can build the lifelike test environment. This kind of test can carry through repeated test of multi-sample. The key technique of the hardware in-the-loop simulation test is real-time algorithmic and communication technology. In this paper, based on reflective memory network, the design for hardware...

chapter

Scalable Interconnection Network Models for Rapid Performance Prediction of HPC Applications

Kishwar Ahmed, Jason Liu, Stephan Eidenbenz, Joe Zerr

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) > 1069 - 1078

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS)

Performance Prediction Toolkit (PPT) is a simulator mainly developed at Los Alamos National Laboratory to facilitate rapid and accurate performance prediction of large-scale scientific applications on existing and future HPC architectures. In this paper, we present three interconnect models for performance prediction of large-scale HPC applications. They are based on interconnect topologies widely...

chapter

Exploring multi-view learning for activity inferences on smartphones

Gunarto Sindoro Njoo, Chien-Hsiang Lai, Kuo-Wei Hsu

2016 Conference on Technologies and Applications of Artificial Intelligence (TAAI) > 212 - 219

2016 Conference on Technologies and Applications of Artificial Intelligence (TAAI)

Inferring activities on smartphones is a challenging task. Prior works have elaborated on using sensory data from built-in hardware sensors in smartphones or taking advantage of location information to understand human activities. In this paper, we explore two types of data on smartphones to conduct activity inference: 1) Spatial-Temporal: reflecting daily routines from the combination of spatial...

INFONA - science communication portal

Advanced search

Advanced search in people

Extending Skel to Support the Development and Optimization of Next Generation I/O Systems

Quantifying the Potential Benefits of On-chip Near-Data Computing in Manycore Processors

A unified HW/SW system-level simulation framework for next generation wireless system

Distributed Particle-Based Rendering Framework for Large Data Visualization on HPC Environments

ESP: A Machine Learning Approach to Predicting Application Interference

Simulating spark cluster for deployment planning, evaluation and optimization

BrainGrid+Workbench: High-performance/high-quality neural simulation

Simulation tools for cloud computing: A survey and comparative study

Fast IPC estimation for performance projections using proxy suites and decision trees

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates

The Potential of Dynamic Binary Modification and CPU-FPGA SoCs for Simulation

Architectures for cloud-based HPC in data centers

Research on interactive application of online education based on cloud computing and large data

Invited: Accelerator design for deep learning training

Property mining using dynamic dependency graphs

A Modeling Approach to Hardware Analysis of the Heterogeneous DEAC Cluster

Performance prediction techniques for scalable large data processing in distributed MPI systems

Design for Hardware In-the-Loop Real-Time Simulation Test of Combined Seeker

Scalable Interconnection Network Models for Rapid Performance Prediction of HPC Applications

Exploring multi-view learning for activity inferences on smartphones

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options