novaBBS - comp.arch - Re: Why separate 32-bit arithmetic on a 64-bit architecture?

Re: Why separate 32-bit arithmetic on a 64-bit architecture?

<2022Jan27.184124@mips.complang.tuwien.ac.at>

https://www.novabbs.com/devel/article-flat.php?id=23175&group=comp.arch#23175

Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Why separate 32-bit arithmetic on a 64-bit architecture?
Date: Thu, 27 Jan 2022 17:41:24 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 89
Message-ID: <2022Jan27.184124@mips.complang.tuwien.ac.at>
References: <sso6aq$37b$1@newsreader4.netcologne.de> <UPXHJ.6202$9O.4300@fx12.iad> <2022Jan25.235313@mips.complang.tuwien.ac.at> <5NdIJ.15873$mS1.13257@fx10.iad> <sss30f$io0$1@newsreader4.netcologne.de> <3a90c85b-1371-481b-bf81-574e3c0af68en@googlegroups.com> <2022Jan27.103742@mips.complang.tuwien.ac.at> <7PyIJ.359540$aF1.217448@fx98.iad>
Injection-Info: reader02.eternal-september.org; posting-host="df00b91258d925986056b62bd2d5bc82";
logging-data="30553"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+f4NeedeTK14wo/rM+Foo2"
Cancel-Lock: sha1:bfm1UMG1PMCLunkc2l3w969vdAA=
X-newsreader: xrn 10.00-beta-3

by: Anton Ertl - Thu, 27 Jan 2022 17:41 UTC

EricP <ThatWouldBeTelling@thevillage.com> writes:
>Anton Ertl wrote:
>> MitchAlsup <MitchAlsup@aol.com> writes:
>>> On Wednesday, January 26, 2022 at 12:13:38 PM UTC-6, Thomas Koenig wrote:
>>>> EricP <ThatWould...@thevillage.com> schrieb:
>>>>> They left out the byte and word load and stores, supposedly because=20
>>>>> that would require the byte shifter network on the critical path.
>>
>> The story I have heard is that they required ECC for write-back caches
>> (and parity for write-through caches); while the first implementations
>> had write-through D-caches, they designed the architecture for
>> implementations with write-back D-caches, and there byte stores would
>> have cost either ECC on the byte level (50% overhead compared to ~20%
>> for ECC on 32-bit units) or implementing stores by performing a
>> read-modify-write of a larger unit.
>
>That seems reasonable too, except Alpha has 32 bit stores and,
>assuming they used the standard off-the-shelf 64+8 bit ECC DRAM,
>those dword stores would require a read-modify-write memory cycle
>and recomputing SECDED ECC.

I don't know how they implemented it, but they had a write-back L2
cache on the 21064 and 21164, and already a write-back L1 on the
21264. For Alpha without BWX I would then store 32-bit units + 5 bits
ECC in the write-back cache, and when a cache line is evicted from the
write-back cache, check and correct the ECC of these units and
generate ECC for 64-bit units, resulting in stuff that fits in
standard ECC DRAM.

I did not look closely at the RAM of our Alphas, but my impression was
that they used pretty standard PC stuff. Just looked up information
for our first Alpha generation, which used an AlphaPC64 board
<https://www.manualslib.com/manual/667049/Digital-Equipment-Alphapc64.html?page=9#manual>:

|The AlphaPC64 memory subsystem supports DRAM memory arrays of 16MB to
|512MB with a 128-bit data bus. The memory is contained in two banks of
|four commodity single inline memory modules (SIMMs). Each SIMM is 36
|bits wide, with 32 data bits, 1 parity bit, and 3 unused bits with
|70-ns or less access.

So they did not go to the complications I imagined, but they also did
not give us ECC (it was not a DEC machine, so I guess the ECC
requirement did not apply).

>From the text below, it would seem
>that they used a different ECC applied to 32 bit words,
>which implies they did not used standard 72 bit DRAM DIMMs
>(which in turn would have hit them on main memory cost).

At least the AlphaPC64 was apparently too early for DIMMs.

>> Funnily, they introduced byte stores before they introduced an
>> implementation with a write-back D-cache.
>
>And according to DEC benchmarks, code got faster and smaller
>so it would seem that many of the assumptions used to
>justify leaving out byte/word were not correct.
>
>Maybe later implementations switched to using 72 bit DIMMs
>so it was doing the RMW for 32-bit dwords anyway.

Looking at <https://www.omnistep.com/~advantag/matrix.htm>, the
AlphaPC164 used parity FPM (IIRC fast path memory) DRAM in the form of
SIMMs, while the AlphaPC164LX used ECC SDRAM DIMMs.

My design for byte stores in the 21164A (EV56) would be: if the cache
line is not in L1, load it into L1 (and have byte parity there), and
the store waits until the cache line is there (but the common case is
that there is no need to wait). Then replace the stored byte(s), and
write the cache line (or at least 64-bit parcels) into the L2,
generating ECC on the way. When a cache line is evicted from L2, you
have the 64-bit parcels ready, including ECC. So essentially the
write-through D-cache provides the RMW functionality necessary for
ECC.

What they apparently did in the AlphaPC164 is a memory controller that
worked with 32-bit parcels on the DRAM side and therefore threw the
ECC away (or checked it) and used parity instead on writing to DRAM;
on reading from DRAM the memory controller checked the parity and then
faked the ECC data.

On the AlphaPC164LX they changed to 64-bit ECC SDRAM DIMMs, wich is
better aligned to (my guess of) the L2 representation of the data, and
therefore they now used ECC functionality in DRAM.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Why separate 32-bit arithmetic on a 64-bit architecture?

<jwvwnilui61.fsf-monnier+comp.arch@gnu.org>

Subject	Author
Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	BGB
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	David Brown
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	BGB
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	BGB
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Marcus
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Marcus
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	EricP
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	EricP
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Terje Mathisen
The cost of gradual underflow (was: Why separate 32-bit arithmetic on a 64-bit a	Stefan Monnier
Re: The cost of gradual underflow	Terje Mathisen
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	antispam
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Terje Mathisen
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Terje Mathisen
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Terje Mathisen
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Terje Mathisen
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Thomas Koenig
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	EricP
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	George Neuner
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Spectre ane EPIC (was: Why separate 32-bit arithmetic...)	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Spectre (was: Why separate 32-bit arithmetic ...)	Anton Ertl
Re: Spectre (was: Why separate 32-bit arithmetic ...)	Michael S
Re: Spectre	EricP
Re: Spectre	MitchAlsup
Re: Spectre	EricP
Re: Spectre	MitchAlsup
Re: Spectre	Anton Ertl
Re: Spectre (was: Why separate 32-bit arithmetic ...)	Anton Ertl
Re: Spectre (was: Why separate 32-bit arithmetic ...)	MitchAlsup
Re: Spectre (was: Why separate 32-bit arithmetic ...)	Thomas Koenig
Re: Spectre (was: Why separate 32-bit arithmetic ...)	Anton Ertl
Re: Spectre	EricP
Re: Spectre	Anton Ertl
Memory encryption (was: Spectre)	Thomas Koenig
Re: Memory encryption (was: Spectre)	Anton Ertl
Re: Memory encryption (was: Spectre)	Elijah Stone
Re: Memory encryption (was: Spectre)	Michael S
Re: Memory encryption (was: Spectre)	Anton Ertl
Re: Memory encryption (was: Spectre)	MitchAlsup
Re: Memory encryption (was: Spectre)	Thomas Koenig
Re: Memory encryption (was: Spectre)	Anton Ertl
Re: Spectre	Terje Mathisen
Re: Spectre	Thomas Koenig
Re: Spectre	Anton Ertl
Re: Spectre	Thomas Koenig
Re: Spectre	Anton Ertl
Re: Spectre	Michael S
Re: Spectre	MitchAlsup
Re: Spectre (was: Why separate 32-bit arithmetic ...)	MitchAlsup
Re: Spectre (was: Why separate 32-bit arithmetic ...)	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Bill Findlay
Re: Imprecision, was Why separate 32-bit arithmetic on a 64-bit architecture?	John Levine
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Michael S
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	MitchAlsup
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Anton Ertl
Re: Why separate 32-bit arithmetic on a 64-bit architecture?	Quadibloc

As a computer, I find your faith in technology amusing.

devel / comp.arch / Re: Why separate 32-bit arithmetic on a 64-bit architecture?