Search results

Items from 1 to 20 out of 47 results

chapter

On-board real-time singularity detection for large-scale 7-DOF space manipulator

Ji-yang Yu, Xiao-dong Zhang, Dan Huang, Xin Li, more

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM) > 113 - 117

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM)

According to the characteristic of large space manipulator, an on-board real-time singularity detection design is proposed. On the basis of forward and inverse kinematics calculation, the forward and inverse power method is applied to obtain the singularity by iterative computation. Firstly, the 7-DOF manipulator kinematics model is described and analyzed, and the main computational process is presented;...

article

In-Memory Intelligence

Tim Finkbeiner, Glen Hush, Troy Larsen, Perry Lea, more

IEEE Micro > 2017 > 37 > 4 > 30 - 38

Recent activity in near-data processing has built or proposed systems that can exploit technologies such as 3D stacks, in-situ computing, or dataflow devices. However, little effort has been applied to exploit the natural parallelism and throughput of DRAM. This article details research from Micron Technology in the area of processing in memory as a form of memory-centric computing. In-Memory Intelligence...

chapter

Optimal compilation for exposed datapath architectures with buffered processing units by SAT solvers

Anoop Bhagyanath, Klaus Schneider

2016 ACM/IEEE International Conference on Formal Methods and Models for System Design (MEMOCODE) > 143 - 152

2016 ACM/IEEE International Conference on Formal Methods and Models for System Design (MEMOCODE)

Conventional processor architectures are restricted in exploiting instruction level parallelism (ILP) due to the limited number of available registers in their instruction sets. Therefore, recent processor architectures expose their datapaths so that the compiler not only schedules instructions to functional units, but also takes care of directly moving values between functional units avoiding the...

chapter

A Tow-Level Buffered SDRAM Controller

Tian Jin, Wenxin Li, Xiangyu Hu

2016 3rd International Conference on Information Science and Control Engineering (ICISCE) > 126 - 128

2016 3rd International Conference on Information Science and Control Engineering (ICISCE)

With the improvement of processor and SDRAM performance, the performance of SDRAM controller becomes the bottleneck of the system performance. In this paper, a Tow-Level Buffered SDRAM controller is proposed, and its design and verification are described. To some extent, the controller improves the throughput of the processor for the SDRAM memory, and provides a solution for the design of high performance...

chapter

Retargeting and enhancing a compact multitasking kernel for the Altera Nios II processor

Naraig Manjikian

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 5

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

This paper describes the retargeting and further enhancement of a compact multitasking kernel for the 32-bit Altera Nios II processor. The kernel, called QUERK for Queen's University Educational Real-time Kernel, was originally written in assembly language and then the C language for the Motorola (and then Freescale) 68HC11 processor. Consisting of less than 200 lines of assembly-language instructions,...

chapter

Analyzing graphics processor unit (GPU) instruction set architectures

Kothiya Mayank, Hongwen Dai, Jizeng Wei, Huiyang Zhou

2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 155 - 156

2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Because of their high throughput and power efficiency, massively parallel architectures like graphics processing units (GPUs) become a popular platform for generous purpose computing. However, there are few studies and analyses on GPU instruction set architectures (ISAs) although it is wellknown that the ISA is a fundamental design issue of all modern processors including GPUs.

chapter

An efficient high speed RISC processor for convolution

Suyog V Pande, Prashant D Bhirange

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO) > 1 - 7

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO)

Many algorithms have been design in order to accomplish an improved the performance of the filters by using the convolution design. The architecture of the proposed RISC CPU is a uniform 32-bit instruction format, single cycle non-pipelined processor. It has load/store architecture, where the operations will only be performed on registers, and not on memory locations. It follows the classical von-Neumann...

chapter

Area-efficient dynamically reconfigurable protocol-processing-hardware for access network communications SoC

Saki Hatta, Nobuyuki Tanaka, Satoshi Shigematsu

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14) > 1 - 6

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig)

Our proposed architecture of dynamically reconfigurable hardware for protocol processing (DRHPP) provides flexibility with high area efficiency. It can be used for a communications system-on-a-chip (SoC) in access networks. The DRHPP enables the modification and addition of various functions for protocol processing. Our architecture consists of three types of cells. The optimized number of these types...

chapter

Hardware Managers with File System Support for Faster Dynamic Partial Reconfiguration

Fernando A. Escobar, Jimmy Tarrillo, Xin Chang, Carlos Valderrama

2014 IEEE International Symposium on Parallel and Distributed Processing with Applications > 205 - 210

2014 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)

FPGA-based platforms allow implementing reconfigurable systems that can change functionality of portions of hardware at runtime. For this purpose, non-volatile, off-chip storage is required to hold the partial-configuration bitstreams that will be used for reconfiguration. Accessing such devices requires a high CPU usage or a dedicated hardware such as a Direct Memory Access (DMA) module, especially...

chapter

GRAPE-MPs: Implementation of an SIMD for Quadruple/Hexuple/Octuple-Precision Arithmetic Operation on a Structured ASIC and an FPGA

Naohito Nakasato, Hiroshi Daisaka, Toshiyuki Fukushige, Atsushi Kawai, more

2012 IEEE 6th International Symposium on Embedded Multicore SoCs > 75 - 83

2012 IEEE 6th International Symposium on Embedded Multicore Socs (MCSoC)

We describe the design and performance of the GRAPE-MPs, a series of SIMD accelerator boards for quadruple/hexuple/octuple-precision arithmetic operations. Basic design of GRAPE-MPs is that it consists of a number of processing elements (PE) and memory components which handle data with quadruple/hexuple/octuple-precision. A GRAPE-MPs processor is implemented on a structured ASIC chip and an FPGA chip...

chapter

Broadcast with mask on a massively parallel processing on a chip

Hana Krichene, Mouna Baklouti, Mohamed Abid, Philippe Marquet, more

2012 International Conference on High Performance Computing & Simulation (HPCS) > 275 - 280

2012 International Conference on High Performance Computing & Simulation (HPCS)

The delay of instructions broadcast has a significant impact on the performance of Single Instruction Multiple Data (SIMD) architecture. This is especially true for massively parallel processing Systems-on-Chip (mppSoC), where the processing stage and that of setting up the communication mechanism need several clock periods. Subnetting is the strategy used to partition a single physical network into...

chapter

P-double operators in the pipeline system of DF-KPI architecture

Norbert Adam, Branislav Mados, Anton Balaz

2012 IEEE 16th International Conference on Intelligent Engineering Systems (INES) > 357 - 362

2012 IEEE 16th International Conference on Intelligent Engineering Systems (INES)

The data flow technique is a multiprocessor technique which enables parallelism to be found without being explicitly declared. One of the most important steps based on the dynamic data flow model is direct operand matching. The concept of direct operand matching represents the elimination of the costly process (in terms of computing time) related to associative searching of the operands. This paper...

article

B-Fetch: Branch Prediction Directed Prefetching for In-Order Processors

Reena Panda, Paul V. Gratz, Daniel A. Jimenez

IEEE Computer Architecture Letters > 2012 > 11 > 2 > 41 - 44

Computer architecture is beset by two opposing trends. Technology scaling and deep pipelining have led to high memory access latencies; meanwhile, power and energy considerations have revived interest in traditional in-order processors. In-order processors, unlike their superscalar counterparts, do not allow execution to continue around data cache misses. In-order processors, therefore, suffer a greater...

chapter

Controller design for one dimensional SIMD array

Facun Zhang, Qiankun Wang, Wei Liu

2011 International Conference on Electronics, Communications and Control (ICECC) > 1361 - 1364

2011 International Conference on Electronics, Communications and Control (ICECC)

One dimensional SIMD array which is PIM-based data parallel computer architectural has been proposed for multimedia processing application. This paper describes the implementation of a controller for one dimensional SIMD array. The main components of the controller and the instruction format are presented. PE array control is introduced in detail. Finally, the results of simulation are given to show...

chapter

A Thread Speed Control Scheme for Real-Time Microprocessors

Kohei Matsumoto, Hiroyuki Umeo, Nobuyuki Yamasaki

2011 IEEE 17th International Conference on Embedded and Real-Time Computing Systems and Applications > 2 > 16 - 21

2011 IEEE 17th International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA)

Real-time execution of applications is one of key requirements for Cyber-Physical Systems (CPS) that integrate computational and physical elements for our social infrastructure, such as robotics, transportation, and consumer appliances. In such real-time systems, a task must be executed so as not to violate given time constraints. Moreover, it is desirable that the execution time of the task is predictable...

chapter

An Obfuscation-Based Approach against Injection Attacks

Fabrizio Baiardi, Daniele Sgandurra

2011 Sixth International Conference on Availability, Reliability and Security > 51 - 58

2011 Sixth International Conference on Availability, Reliability and Security (ARES)

We present an obfuscation strategy to protect a program against injection attacks. The strategy represents the program as a set of code fragments in-between two consecutive system calls (the system blocks) and a graph that represents the execution order of the fragment (the system block graph). The system blocks and the system block graph are partitioned between two virtual machines (VMs). The Blocks-VM...

chapter

Design and Implementation of BIOS for Godson-3A Interconnections

Yuhui Gao, Mingfa Zhu, Jiantong Huo, Limin Xiao, more

2011 International Conference on Computer and Management (CAMAN) > 1 - 5

2011 International Conference on Computer and Management (CAMAN 2011)

The hardware design of Godson-3A processor adopts the scalable distributed multi-core structure which is based on a 2D mesh. It can make use of multi-chip interconnection to construct a unified topology structure for board level or system level. This kind of interconnected system can't achieve entirely by hardware design, and it also needs the reasonable design of the BIOS and upper software. As the...

chapter

Implementing a safe embedded computing system in SRAM-based FPGAs using IP cores: A case study based on the Altera NIOS-II soft processor

Julio Perez Acle, M S Reorda, M Violante

2011 IEEE Second Latin American Symposium on Circuits and Systems (LASCAS) > 1 - 5

2011 IEEE Second Latin American Symposium on Circuits and Systems (LASCAS)

Reconfigurable Field Programmable Gate Arrays (FPGAs) are growing the attention of developers of mission- and safety-critical applications (e.g., aerospace ones), as they allow unprecedented levels of performance, which are making these devices particularly attractive as ASICs replacement, and as they offer the unique feature of in-the-field reconfiguration. However, the sensitivity of reconfigurable...

chapter

RJOP - A customized Java processor for reactive embedded systems

Muhammad Nadeem, Morteza Biglari-Abhari, Zoran Salcic

2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1038 - 1043

2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC)

This paper presents a novel, high performance and low cost execution architecture for the system level GALS programming language SystemJ, which extends Java with synchronous reactive features present in Esterel and asynchronous constructs of CSP (Communicating Sequential Processes). The new architecture is based on JOP (Java Optimized Processor), which is a hardware implementation of the Java Virtual...

chapter

Programmable architecture for NFA-based string matching

Junghak Kim, Song-In Choi, Sook-Jin Lee, Jee-Hwan Ahn, more

2010 International Conference on Information and Communication Technology Convergence (ICTC) > 484 - 489

2010 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we propose a programmable string matching architecture to process multiple characters at a single cycle. To simplify the architecture of the previous works, we employ a method of realigning the input data stream by offsets. We show that some registers can be eliminated by using the method. Additionally, we present two different approaches to implement a programmable hardware for string...

Keywords:
COMPUTER ARCHITECTURE
REGISTERS

Publication date

Set your own date range

Publication type

book (42)
article (5)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (14)
HARDWARE (13)
CLOCKS (11)
SOFTWARE (10)
RANDOM ACCESS MEMORY (8)
COMPUTERS (7)
CONFERENCES (6)
FPGA (6)
PROTOCOLS (6)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (5)
COMPUTATIONAL MODELING (5)
EMBEDDED SYSTEMS (5)
INSTRUCTION SETS (5)
MICROPROCESSOR CHIPS (5)
MONITORING (5)
RADIATION DETECTORS (5)
REAL TIME SYSTEMS (5)
REDUCED INSTRUCTION SET COMPUTING (5)
TESTING (5)
TIMING (5)
ALGORITHM DESIGN AND ANALYSIS (4)
DATA MINING (4)
DATA MODELS (4)
DECODING (4)
FLIP-FLOPS (4)
MULTIPLEXING (4)
MULTIPROCESSING SYSTEMS (4)
PARALLEL ARCHITECTURES (4)
SIGNAL PROCESSING (4)
SYSTEM PERFORMANCE (4)
SYSTEM-ON-A-CHIP (4)
AEROSPACE ELECTRONICS (3)
ANALYTICAL MODELS (3)
ASSEMBLY (3)
BUFFER STORAGE (3)
CACHE MEMORY (3)
CONTROL SYSTEMS (3)
DISPLAYS (3)
EQUATIONS (3)
FAULT TOLERANCE (3)
LOAD MODELING (3)
LOGIC GATES (3)
MATHEMATICAL MODEL (3)
MEMORY MANAGEMENT (3)
MICROCOMPUTERS (3)
MULTI-THREADING (3)
OPTIMIZATION (3)
PARALLEL PROCESSING (3)
PERFORMANCE EVALUATION (3)
PIPELINE PROCESSING (3)
PIPELINES (3)
POWER DEMAND (3)
RECONFIGURABLE ARCHITECTURES (3)
REDUNDANCY (3)
SIGNAL PROCESSING ALGORITHMS (3)
SIMULATION (3)
SOFTWARE ALGORITHMS (3)
STREAMING MEDIA (3)
SYSTEM-ON-CHIP (3)
TRANSIENT ANALYSIS (3)
VLIW (3)
ADAPTATION MODEL (2)
ADAPTIVE CONTROL (2)
ADDERS (2)
ARRAYS (2)
ARTIFICIAL NEURAL NETWORKS (2)
BANDWIDTH (2)
BENCHMARK TESTING (2)
BLOCK MATCHING MOTION ESTIMATION (2)
CIRCUIT FAULTS (2)
CIRCUIT SYNTHESIS (2)
COMPUTATIONAL EFFICIENCY (2)
CONVOLUTION (2)
CRYPTOGRAPHY (2)
DEBUGGING (2)
DELAY (2)
DIGITAL ARITHMETIC (2)
DIGITAL SYSTEMS (2)
DRIVER CIRCUITS (2)
EDUCATIONAL INSTITUTIONS (2)
ELECTRONIC MAIL (2)
EMBEDDED SYSTEM (2)
FAULT DETECTION (2)
FAULT INJECTION (2)
FAULT TOLERANT SYSTEMS (2)
FLOWCHARTS (2)
IMAGE COLOR ANALYSIS (2)
IMAGE PROCESSING (2)
INTEGRATED CIRCUIT DESIGN (2)
IP NETWORKS (2)
KERNEL (2)
LOW PASS FILTERS (2)
MEDIA (2)
MEMORY ARCHITECTURE (2)
MOTION ESTIMATION (2)
MULTIMEDIA SYSTEMS (2)
NONVOLATILE MEMORY (2)
more

INFONA - science communication portal

Search results

On-board real-time singularity detection for large-scale 7-DOF space manipulator

In-Memory Intelligence

Optimal compilation for exposed datapath architectures with buffered processing units by SAT solvers

A Tow-Level Buffered SDRAM Controller

Retargeting and enhancing a compact multitasking kernel for the Altera Nios II processor

Analyzing graphics processor unit (GPU) instruction set architectures

An efficient high speed RISC processor for convolution

Area-efficient dynamically reconfigurable protocol-processing-hardware for access network communications SoC

Hardware Managers with File System Support for Faster Dynamic Partial Reconfiguration

GRAPE-MPs: Implementation of an SIMD for Quadruple/Hexuple/Octuple-Precision Arithmetic Operation on a Structured ASIC and an FPGA

Broadcast with mask on a massively parallel processing on a chip

P-double operators in the pipeline system of DF-KPI architecture

B-Fetch: Branch Prediction Directed Prefetching for In-Order Processors

Controller design for one dimensional SIMD array

A Thread Speed Control Scheme for Real-Time Microprocessors

An Obfuscation-Based Approach against Injection Attacks

Design and Implementation of BIOS for Godson-3A Interconnections

Implementing a safe embedded computing system in SRAM-based FPGAs using IP cores: A case study based on the Altera NIOS-II soft processor

RJOP - A customized Java processor for reactive embedded systems

Programmable architecture for NFA-based string matching

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options