Low-Precision Floating-Point Arithmetic Library for Z80

Introduction

The purpose of this library is to provide fast floating-point operations for visualizations and simulations. In particular, 3D games and especially simulators can benefit from it. Extensive use of look-up tables and 8-bit arithmetics provide for adequate performance.

The library uses 9-bit mantissa in the [1..2) range and 7 bit signed exponent. It provides approx. 3 significant decimal digits of precision that allows it to accurately represent integers in the [0..512] range. This enables an angular resolution of almost 0.1 degree (for comparison, the Solar and Lunar discs are approx. 0.5 degrees in diameter) and sub-pixel resolution for the displays of typical 8-bit computers. The representable range is between (10⁻²⁰..10²⁰) giving the user more than 1000 light years or range if the unit is 1 meter or nanometer precision if the unit is 1 light second, in which case the entire observable universe fits into the range.

However, users must exercise extra caution with numerical stability when using low-precision floating-point arithmetics, as precision degrades very rapidly with accummulating errors. In particular, subtraction can be highly inaccurate and division by the result of a subtraction can arbitrarily magnify such errors.

The library also provides conversion into unsigned 64-bit integers (cannot overflow) as well as the means of calculating the remainder of a 64-bit integer after division by a floating-point number. This is useful for accurately representing time and cyclic events, such as the rotation and orbital movement of planets and other bodies.

Specification

Floating-point numbers are represented as 16 bits, typically stored in Z80 register pairs. Bits are interpreted as follows:

bits	semantics
0-7	Mantissa bits -8..-1. Mantissa bit 0 is always 1, not represented
8-14	Exponent bits 0..6. The actual exponent is this number minus 64. Thus, 64 corresponds to 0
15	Sign. 0 for positive, 1 for negative

This representation provides for preserving lexicographic ordering for positive numbers.

No special values such as 0, Inf or NaN are represented. Undeflows result in the smallest representable number, ε=2⁻⁶⁴. Please note that this number does not always behave as an algebraic zero. In particular, ε+ε≠ε even though 0+0=0. While it is generally advised to avoid checking for strict equality in most cases, strict equality with ε should never be used as a zero check. Overflows result in the largerst representable number, 2⁶⁴. This also does not behave as algebraic infinity in many cases. As there is no zero and no infinity, no operations can result in NaN. Every result is a number.

Numerical Tips

Finding a leg b given the hypothenuse c=a+δ and the other leg a of a right triangle should be computed very carefully to avoid catastrophic loss of precision. In particular, do not use ~~b = SQRT(c²-a²)~~, but b = SQRT(δ(2a + δ)), based on the c²-a² = (c - a)(c + a) identity.

Similarly, when calculating sin² α of an angle between two unit vectors a and b, do not take the shortcut of using ~~1-(a⋅b)²~~. Instead, use the inner product of (a⋅b)a - b with itself.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
raytracing		raytracing
test		test
LICENSE		LICENSE
README.md		README.md
addtab.asm		addtab.asm
align.asm		align.asm
cross.asm		cross.asm
decimal.asm		decimal.asm
dectab.asm		dectab.asm
divtab.asm		divtab.asm
dot.asm		dot.asm
faddsub-fast.asm		faddsub-fast.asm
faddsub.asm		faddsub.asm
fdivmul.asm		fdivmul.asm
fint64.asm		fint64.asm
fint8.asm		fint8.asm
fixp.asm		fixp.asm
fsquare.asm		fsquare.asm
hpfp.asm		hpfp.asm
lpfpcalc.asm		lpfpcalc.asm
mul8bit.asm		mul8bit.asm
multab.asm		multab.asm
octa.asm		octa.asm
octab.asm		octab.asm
rotation.asm		rotation.asm
sigsqrt.asm		sigsqrt.asm
squaretab.asm		squaretab.asm
subtab.asm		subtab.asm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Low-Precision Floating-Point Arithmetic Library for Z80

Introduction

Specification

Numerical Tips

About

Releases

Packages

Languages

License

nagydani/lpfp

Folders and files

Latest commit

History

Repository files navigation

Low-Precision Floating-Point Arithmetic Library for Z80

Introduction

Specification

Numerical Tips

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages