Message-ID:

19 May, 2024: Line wrapping has been changed to be more consistent with Usenet standards.
If you find that it is broken please let me know here rocksolid.nodes.help

devel / comp.arch / Re: bad float, Fantasy architecture: the 10-bit byte

Re: Fantasy architecture: the 10-bit byte

<tnq4um$d0c9$1@dont-email.me>

https://www.novabbs.com/devel/article-flat.php?id=29644&group=comp.arch#29644

Path: i2pn2.org!i2pn.org!eternal-september.org!reader01.eternal-september.org!.POSTED!not-for-mail
From: cr88...@gmail.com (BGB)
Newsgroups: comp.arch
Subject: Re: Fantasy architecture: the 10-bit byte
Date: Mon, 19 Dec 2022 10:53:41 -0600
Organization: A noiseless patient Spider
Lines: 110
Message-ID: <tnq4um$d0c9$1@dont-email.me>
References: <41a60986-1046-4318-823e-e07a9f175e70n@googlegroups.com>
<2022Dec18.175256@mips.complang.tuwien.ac.at>
<e8b88b4b-409d-41af-a98b-1487a3dbf91fn@googlegroups.com>
<tnovcp$79fh$1@dont-email.me> <tnp38h$160bb$1@newsreader4.netcologne.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 19 Dec 2022 16:53:42 -0000 (UTC)
Injection-Info: reader01.eternal-september.org; posting-host="8aab342171ee03fe9a468bf9b13e8277";
logging-data="426377"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+jyj8ciw9TfdERKmHLc992"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.6.0
Cancel-Lock: sha1:DTXnPF8MewlxnT5j8d7bXx9bDlw=
Content-Language: en-US
In-Reply-To: <tnp38h$160bb$1@newsreader4.netcologne.de>

by: BGB - Mon, 19 Dec 2022 16:53 UTC

On 12/19/2022 1:18 AM, Thomas Koenig wrote:
> BGB <cr88192@gmail.com> schrieb:
>> On 12/18/2022 5:35 PM, Russell Wallace wrote:
>>> On Sunday, December 18, 2022 at 6:03:50 PM UTC, Anton Ertl wrote:
>>>> Russell Wallace <russell...@gmail.com> writes:
>>>>> I saw Fred Brooks in an interview remark that one of his few regrets about =
>>>>> the 8-bit byte was that 64-bit floating point is not quite enough for doubl=
>>>>> e precision.
>>>>
>>>> Those who made the IEEE 754 standard thought differently.
>>>
>>> Did they? Look at the resources Intel spent on getting 80-bit floating point into the 8087.
>>>
>>
>> For most use cases, 80 bit is overkill.
>>
>> Meanwhile:
>> 64-bit can do "nearly everything";
>> 32-bit is "usually sufficient";
>> 16-bit is "often sufficient", but more hit/miss.
>>
>>
>> I suspect likely Binary16 would have been far more popular had it been
>> popular 20 years earlier.
>>
>>
>> Not sure why Binary16 wasn't a thing in the 80s, given values were
>> (generally) smaller back then.
>
> For scientific or engineering computations, 32-bit floats are
> barely adequate, or they don't work at all. This is why a lot
> of scientific code is in 64-bit. 36-bit floats were much better.
>

But, the IBM PC was mostly intended for home and business users? ...

Or, I think it was like, the main PC was intended for business, PCjr for
home users, but the PCjr was a flop, people either going for the full PC
or the PC clones.

Granted, I guess it is possible Intel could have had markets for this
other than the IBM PC ?...

> And binary16 is only useful for very limited applications, like neural
> networks. Even calculating everyday geometries will get you into
> trouble.

They are usable IME for:
Neural Nets (*1);
Pixel calculations;
Audio filtering;
...

*1: Though, in my own testing here, it is necessary to fiddle with stuff
here to try to prevent the net from training itself in ways that values
will go out of range (in effect, the training algorithm also needs to
use Binary16).

For 3D geometry, they work well enough if the 3D model isn't too large
or doesn't require too much detail.

For something like a character model, no one is likely to notice.

For something like scene geometry, usually need a little more than this.
For things like the transformation matrices, one really needs full
single precision if possible.

But, with ~ 3 significant figures, they should be fairly widely
applicable to many sorts of problems (along with the common tendency of
people to express typical values as x.xx*10^y or similar).

And, also all of the people that use 3.14 as their standard value for
PI, 2.72 for E, ... And also often don't want answers that are much
longer than 2 or 3 digits.

Granted, the dynamic range for Binary16 isn't particularly large.

But, as noted, some things it wouldn't really work. For example, it
couldn't give cent-accurate values for adding up values on a typical
shopping receipt, ...

But, for many of the types of problems where one might otherwise use,
say, 13.3 fixed point or similar, Binary16 could have been be a usable
alternative.

More just a question of why Binary32 was seemingly seen as the minimum
here, in an era where aggressive cost optimization of pretty much
everything seemed like a sensible option.

Or, for x87, why it bothered with a bunch of complex operators (such as
FSIN and FCOS, ...), when presumably the FPU could have been simpler and
cheaper had they had people to do most of these in software ?...

Or, if Binary16 itself would have added cost, why not have had an
instruction that truncated the Binary32 format to 16 bits on store, and
padded it with with zeroes on load. Shouldn't have added too much
additional cost to whatever mechanism they were using to Load/Store
values to RAM.

BGB <cr88192@gmail.com> writes:
[8087]
>Granted, I guess it is possible Intel could have had markets for this
>other than the IBM PC ?...

Certainly.

Intel started the 8087 project in 1977 and the 8087 was launched in
1980, before the IBM PC (1981). And the IBM PC (and its clones) were
not an instant hit; e.g., in the early stages of the 386 project
(started 1982) the 386 was just a minor project.

Interestingly, there were earlier math coprocessors: The Am9511 (1977,
fixed point, but with, e.g., trigonometric functions) and Am9512
(1979, floating-point, compatible with draft 754; binary32 and
binary64), which Intel licensed as 8231 and 8232.

>Or, for x87, why it bothered with a bunch of complex operators (such as
>FSIN and FCOS, ...), when presumably the FPU could have been simpler and
>cheaper had they had people to do most of these in software ?...

CISC was in full swing in 1977, RISC at the time limited to the IBM
801 group.

There are also cases where a more complex instruction allows later
improved hardware implementations that are not possible in software,
although I think this kind of thinking was far more common in those
times than waranted, and it's unclear to me whether the 8087 or its
successors ever made use of that (e.g., additional precision in
intermediate results).

But thinking about it again (after reading up on the earlier
co-processors), I think the main reason was that the 8087 ran
asynchronously to the 8086/8088. So the benefit of FSIN was that the
CPU could start an FSIN operation, then do a lot of other stuff, then
deal with the result of the FSIN; by contrast, with an FSIN software
function, you call it, and the CPU is blocked until it is finished.

>Or, if Binary16 itself would have added cost, why not have had an
>instruction that truncated the Binary32 format to 16 bits on store, and
>padded it with with zeroes on load.

The 8087 converts everything into its internal 80+-bit format on
loading and back on storing (plus some optional rounding of the
mantissa to 53 or 23 bits on computations). Binary16 would certainly
have added a cost, and, at the time, provided no benefit. People were
not interesting in FP16 until a few years ago.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Russell Wallace <russell.wallace@gmail.com> writes:
>On Sunday, December 18, 2022 at 6:03:50 PM UTC, Anton Ertl wrote:
>> Russell Wallace <russell...@gmail.com> writes:=20
>> >I saw Fred Brooks in an interview remark that one of his few regrets abo=
>ut =3D=20
>> >the 8-bit byte was that 64-bit floating point is not quite enough for do=
>ubl=3D=20
>> >e precision.=20
>>=20
>> Those who made the IEEE 754 standard thought differently.=20
>
>Did they?

According to
<https://en.wikipedia.org/wiki/IEEE-754#Basic_and_interchange_formats>:

|The binary32 and binary64 formats are the single and double formats of
|IEEE 754-1985 respectively.

>> A 10-bit data bus is 25% more expensive than an 8-bit data bus. At=20
>> the same time, with your 20/40-bit instructions, a 10-bit bus requires=20
>> 2-4 bus cycles to load an instruction, which reduces performance=20
>> significantly.=20
>
>Compared to what?

Compared to having 10-bit instructions (plus immediate operands).
There is a reason why instruction sets with many implicit registers
like the 6502 and 8086 were successful when we still had 8-bit busses
and no I-cache.

>The 2010 can add a pair of 20-bit numbers in two cycles. =

A 2010 with 10-bit instructions can take one cycle for an addition (if
you have a 20-bit ALU).

>The 6502 transistor count is much more c=
>onstrained. (For good reason; they were explicitly aimed at a minimum-cost =
>CPU for embedded applications.)

Maybe. But it was hugely successful in general-purpose computers for
the cost reason.

>I don't see a lot of room to do better with=
>in that transistor count.

I do. And actually it's a small-enough project that a dedicated
hoppyist could achieve it in reasonable time. But I am not taking it
on, proving that point is not important enough to me.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

According to Thomas Koenig <tkoenig@netcologne.de>:
>Russell Wallace <russell.wallace@gmail.com> schrieb:
>
>> I saw Fred Brooks in an interview remark that one of his few
>> regrets about the 8-bit byte was that 64-bit floating point is
>> not quite enough for double precision.
>
>No regrets about the 32-bit floating point real that was introduced?
>IBM certainly knew better, from the 704.
>
>Chosing the exponent range of real and double to coincide was not
>a great decision, either.

Indeed, but none of that was as bad as doing hex normalization, which
precluded a hidden bit, and no rounding. That lost three bits of
accuracy on each operation. They retrofitted guard digits in the
field which helped, but not enough.

It is really strange that they did all sorts of simulations but
somehow missed the key fact that leading float digits are distributed
geometrically, not linearly. It's not like it's hard to figure out.

--
Regards,
John Levine, johnl@taugh.com, Primary Perpetrator of "The Internet for Dummies",
Please consider the environment before reading this e-mail. https://jl.ly

Re: 12 bits, Fantasy architecture: the 10-bit byte

<tnr411$qpg$1@gal.iecc.com>

Subject	Author
Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	Brett
Re: Fantasy architecture: the 10-bit byte	Stephen Fuld
Re: Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	Marcus
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Thomas Koenig
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	robf...@gmail.com
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	robf...@gmail.com
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Terje Mathisen
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Terje Mathisen
Re: Fantasy architecture: the 10-bit byte	JimBrakefield
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	Terje Mathisen
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	Terje Mathisen
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	Terje Mathisen
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	Michael S
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	robf...@gmail.com
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	John Levine
Re: Fantasy architecture: the 10-bit byte	MitchAlsup
Re: Fantasy architecture: the 10-bit byte	Quadibloc
Re: Fantasy architecture: the 10-bit byte	Russell Wallace
Re: Fantasy architecture: the 10-bit byte	robf...@gmail.com
Re: old circuits, Fantasy architecture: the 10-bit byte	John Levine
Re: old circuits, Fantasy architecture: the 10-bit byte	Stephen Fuld
Re: old circuits, Fantasy architecture: the 10-bit byte	BGB
Re: Fantasy architecture: the 10-bit byte	Scott Lurndal
Re: 12 bits, Fantasy architecture: the 10-bit byte	John Levine
Re: Fantasy architecture: the 10-bit byte	Anton Ertl
Re: Fantasy architecture: the 10-bit byte	mac
Re: Fantasy architecture: the 10-bit byte	Thomas Koenig
Re: bad float, Fantasy architecture: the 10-bit byte	John Levine
Re: bad float, Fantasy architecture: the 10-bit byte	Tim Rentsch
Re: bad float, Fantasy architecture: the 10-bit byte	Quadibloc
Re: bad float, Fantasy architecture: the 10-bit byte	Quadibloc
Re: bad float, Fantasy architecture: the 10-bit byte	David Brown
Re: bad float, Fantasy architecture: the 10-bit byte	Anton Ertl
Re: bad float, Fantasy architecture: the 10-bit byte	John Levine
Re: bad float, Fantasy architecture: the 10-bit byte	MitchAlsup
Re: bad float, Fantasy architecture: the 10-bit byte	Quadibloc
Re: bad float, Fantasy architecture: the 10-bit byte	Scott Lurndal
Re: bad float, Fantasy architecture: the 10-bit byte	Quadibloc
Re: bad float, Fantasy architecture: the 10-bit byte	Stephen Fuld
Re: science and commerce, was bad float, Fantasy architecture: the 10-bit byte	John Levine
Re: bad float, Fantasy architecture: the 10-bit byte	JimBrakefield
Re: Fantasy architecture: the 10-bit byte	Quadibloc
Re: Fantasy architecture: the 10-bit byte	Stephen Fuld
Re: Fantasy architecture: the 10-bit byte	Quadibloc
Re: Fantasy architecture: the 10-bit byte	Quadibloc
Re: Fantasy architecture: the 10-bit byte	EricP
Re: Fantasy architecture: the 10-bit byte	Paul A. Clayton

19 May, 2024: Line wrapping has been changed to be more consistent with Usenet standards. If you find that it is broken please let me know here rocksolid.nodes.help

devel / comp.arch / Re: bad float, Fantasy architecture: the 10-bit byte

devel / comp.arch / Re: bad float, Fantasy architecture: the 10-bit byte

19 May, 2024: Line wrapping has been changed to be more consistent with Usenet standards.
If you find that it is broken please let me know here rocksolid.nodes.help