Nicolas Brunie

chapter

Modified Fused Multiply and Add for Exact Low Precision Product Accumulation

Nicolas Brunie

2017 IEEE 24th Symposium on Computer Arithmetic (ARITH) > 106 - 113

2017 IEEE 24th Symposium on Computer Arithmetic (ARITH)

The implementation of the Fused Multiply and Add (FMA) operation has been extensively studied in the literature on standard and large precisions. We suggest re- visiting those studies for 16-bit precision. We introduce a variation of the Mixed precision FMA targeted for applications processing low precision inputs (such as machine learning). We also introduce several versions of a fixed point based...

chapter

Computing floating-point logarithms with fixed-point operations

Julien Le Maire, Nicolas Brunie, Florent de Dinechin, Jean-Michel Muller

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH) > 156 - 163

2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH)

Elementary functions from the mathematical library input and output floating-point numbers. However it is possible to implement them purely using integer/fixed-point arithmetic. This option was not attractive between 1985 and 2005, because mainstream processor hardware supported 64-bit floating-point, but only 32-bit integers. This has changed in recent years, in particular with the generalization...

chapter

Code Generators for Mathematical Functions

Nicolas Brunie, Florent de Dinechin, Olga Kupriianova, Christoph Lauter

2015 IEEE 22nd Symposium on Computer Arithmetic > 66 - 73

2015 IEEE 22nd Symposium on Computer Arithmetic (ARITH)

A typical floating-point environment includes support for a small set of about 30 mathematical functions such as exponential, logarithm, trigonometric and hyperbolic functions. These functions are provided by mathematical software libraries (libm), typically in IEEE754 single, double and quad precision. This article suggests to replace this libm paradigm by a more general approach: the on-demand generation...

chapter

Arithmetic core generation using bit heaps

Nicolas Brunie, Florent de Dinechin, Matei Istoan, Guillaume Sergent, more

2013 23rd International Conference on Field programmable Logic and Applications > 1 - 8

2013 23rd International Conference on Field Programmable Logic and Applications (FPL)

A bit heap is a data structure that holds the unevaluated sum of an arbitrary number of bits, each weighted by some power of two. Most advanced arithmetic cores can be viewed as involving one or several bit heaps. We claim here that this point of view leads to better global optimization at the algebraic level, at the circuit level, and in terms of software engineering. To demonstrate it, a generic...

chapter

Simultaneous branch and warp interweaving for sustained GPU performance

Nicolas Brunie, Sylvain Collange, Gregory Diamos

2012 39th Annual International Symposium on Computer Architecture (ISCA) > 49 - 60

2012 ACM/IEEE 39th International Symposium on Computer Architecture (ISCA)

Instruction Multiple-Thread (SIMT) micro-architectures implemented in Graphics Processing Units (GPUs) run fine-grained threads in lockstep by grouping them into units, referred to as warps, to amortize the cost of instruction fetch, decode and control logic over multiple execution units. As individual threads take divergent execution paths, their processing takes place sequentially, defeating part...

chapter

A mixed-precision fused multiply and add

Nicolas Brunie, Florent de Dinechin, Benoit de Dinechin

2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR) > 165 - 169

2011 45th Asilomar Conference on Signals, Systems and Computers

The floating-point fused multiply and add, computing R=AB+C with a single rounding, is now an IEEE-754 standard operator. This article investigates variants in which the addend C and the result R are of a larger format, for instance binary64 (double precision), while the multiplier inputs A and B are of a smaller format, for instance binary32 (single precision). Like the standard FMA operator, the...

INFONA - science communication portal

Search results for: Nicolas Brunie

Modified Fused Multiply and Add for Exact Low Precision Product Accumulation

Computing floating-point logarithms with fixed-point operations

Code Generators for Mathematical Functions

Arithmetic core generation using bit heaps

Simultaneous branch and warp interweaving for sustained GPU performance

A mixed-precision fused multiply and add

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Nicolas Brunie

Modified Fused Multiply and Add for Exact Low Precision Product Accumulation

Computing floating-point logarithms with fixed-point operations

Code Generators for Mathematical Functions

Arithmetic core generation using bit heaps

Simultaneous branch and warp interweaving for sustained GPU performance

A mixed-precision fused multiply and add

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options