Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

The clash of ideas is the sound of freedom.


devel / comp.arch / Paper about ISO C

SubjectAuthor
* Paper about ISO Cclamky
+- Re: Paper about ISO CBGB
+* Re: Paper about ISO CDavid Brown
|+* Re: Paper about ISO CBGB
||+* Re: Paper about ISO CMitchAlsup
|||+* Re: Paper about ISO CMitchAlsup
||||`* Re: Paper about ISO CBranimir Maksimovic
|||| `* Re: Paper about ISO CGeorge Neuner
||||  +- Re: Paper about ISO CBranimir Maksimovic
||||  `* Re: Paper about ISO CEricP
||||   `* Re: Paper about ISO CIvan Godard
||||    `* Re: Paper about ISO CEricP
||||     `* Re: Paper about ISO CMitchAlsup
||||      +* Re: Paper about ISO CBGB
||||      |`* Re: Paper about ISO CStephen Fuld
||||      | +* Re: Paper about ISO CIvan Godard
||||      | |+* Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | ||+* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |||+- Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | |||+* Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | ||||`- Re: addressing and protection, was Paper about ISO CQuadibloc
||||      | |||`- Re: addressing and protection, was Paper about ISO CBill Findlay
||||      | ||`* Re: addressing and protection, was Paper about ISO CQuadibloc
||||      | || +- Re: addressing and protection, was Paper about ISO CStephen Fuld
||||      | || `- Re: addressing and protection, was Paper about ISO CBranimir Maksimovic
||||      | |`* Re: addressing and protection, was Paper about ISO CEricP
||||      | | +* Re: addressing and protection, was Paper about ISO CStefan Monnier
||||      | | |`- Re: addressing and protection, was Paper about ISO CEricP
||||      | | `* Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | |  +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  |+* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  ||`* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  || `* Re: addressing and protection, was Paper about ISO CJohn Dallman
||||      | |  ||  +* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  ||  |+- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  ||  |+* Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  ||  ||`* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  ||  || `* Re: addressing and protection, was Paper about ISO CAnton Ertl
||||      | |  ||  ||  `- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  ||  |`* Re: addressing and protection, was Paper about ISO CQuadibloc
||||      | |  ||  | +- Re: addressing and protection, was Paper about ISO CQuadibloc
||||      | |  ||  | `- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  ||  `- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  |`* Re: addressing and protection, was Paper about ISO CJohn Dallman
||||      | |  | +* Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |`* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | | `* Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |  +* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |  |+* Re: addressing and protection, was Paper about ISO CStefan Monnier
||||      | |  | |  ||+- Re: addressing and protection, was Paper about ISO CChris M. Thomasson
||||      | |  | |  ||+* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |  |||`- Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |  ||+* Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |  |||`* Re: addressing and protection, was Paper about ISO CEricP
||||      | |  | |  ||| `- Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |  ||`- Re: addressing and protection, was Paper about ISO CJohn Dallman
||||      | |  | |  |`- Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |  `* Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | |  | |   +- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |   `* Re: addressing and protection, was Paper about ISO CStephen Fuld
||||      | |  | |    +* Address space consumption (was: addressing and protection, was Paper about ISO CStefan Monnier
||||      | |  | |    |`- Re: Address space consumption (was: addressing and protection, wasMitchAlsup
||||      | |  | |    +* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |    |`- Re: addressing and protection, was Paper about ISO CThomas Koenig
||||      | |  | |    `* Re: addressing and protection, was Paper about ISO CGeorge Neuner
||||      | |  | |     `* Re: addressing and protection, was Paper about ISO CStephen Fuld
||||      | |  | |      +* Re: addressing and protection, was Paper about ISO CMichael S
||||      | |  | |      |`* Re: addressing and protection, was Paper about ISO CTerje Mathisen
||||      | |  | |      | `- Re: addressing and protection, was Paper about ISO CMichael S
||||      | |  | |      `* Re: addressing and protection, was Paper about ISO CStefan Monnier
||||      | |  | |       +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       |`* Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | |  | |       | +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |`* Re: addressing and protection, was Paper about ISO CStephen Fuld
||||      | |  | |       | | `* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |  `* Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | |  | |       | |   +- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |   +- Re: addressing and protection, was Paper about ISO Cclamky
||||      | |  | |       | |   `* Re: addressing and protection, was Paper about ISO CGeorge Neuner
||||      | |  | |       | |    +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    |`* Re: addressing and protection, was Paper about ISO CGeorge Neuner
||||      | |  | |       | |    | +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | |+* Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | |  | |       | |    | ||+* Re: educational computation, was addressing and protection, was Paper about ISO John Levine
||||      | |  | |       | |    | |||`* Re: educational computation, was addressing and protection, was PaperIvan Godard
||||      | |  | |       | |    | ||| `- Re: educational computation, was addressing and protection, was PaperTerje Mathisen
||||      | |  | |       | |    | ||`* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | || `* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |       | |    | ||  +- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | ||  `- Re: addressing and protection, was Paper about ISO CDavid Brown
||||      | |  | |       | |    | |+- Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | |`* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |       | |    | | +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | | |+- Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | |  | |       | |    | | |`* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |       | |    | | | +* Re: addressing and protection, was Paper about ISO CIvan Godard
||||      | |  | |       | |    | | | |+* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |       | |    | | | ||`- Re: addressing and protection, was Paper about ISO CJimBrakefield
||||      | |  | |       | |    | | | |`* Re: addressing and protection, was Paper about ISO CTerje Mathisen
||||      | |  | |       | |    | | | | `* Re: addressing and protection, was Paper about ISO CTim Rentsch
||||      | |  | |       | |    | | | |  `- Re: addressing and protection, was Paper about ISO CTerje Mathisen
||||      | |  | |       | |    | | | +* Re: addressing and protection, was Paper about ISO CBGB
||||      | |  | |       | |    | | | `- Re: addressing and protection, was Paper about ISO CTerje Mathisen
||||      | |  | |       | |    | | `- Re: addressing and protection, was Paper about ISO CAnne & Lynn Wheeler
||||      | |  | |       | |    | `- Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | |       | |    `* Re: what is cheap these days, addressing and protection, was Paper about ISO CJohn Levine
||||      | |  | |       | +* Re: addressing and protection, was Paper about ISO CStephen Fuld
||||      | |  | |       | +* Re: addressing and protection, was Paper about ISO CMichael S
||||      | |  | |       | `* Re: addressing and protection, was Paper about ISO CMichael S
||||      | |  | |       +* RAM size (was: addressing and protection, was Paper about ISO C)Anton Ertl
||||      | |  | |       `- Re: addressing and protection, was Paper about ISO CTerje Mathisen
||||      | |  | +* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  | `* Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | |  +* Re: addressing and protection, was Paper about ISO CMitchAlsup
||||      | |  `* Re: addressing and protection, was Paper about ISO CJohn Levine
||||      | `* Re: Paper about ISO CBGB
||||      `- Re: Paper about ISO CEricP
|||+* Re: Paper about ISO CBranimir Maksimovic
|||+* Re: Paper about ISO CThomas Koenig
|||+* Re: Paper about ISO Cantispam
|||`- Re: Paper about ISO CQuadibloc
||+* Re: Paper about ISO CThomas Koenig
||`* Re: Paper about ISO CDavid Brown
|+* Re: Paper about ISO CThomas Koenig
|`* Re: Paper about ISO CVictor Yodaiken
`* Re: Paper about ISO CKent Dickey

Pages:123456789101112131415161718192021222324252627282930313233
Re: Inter-address space access, debugging [was Paper about ISO C]

<MDffJ.4738$6a3.1363@fx41.iad>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21775&group=comp.arch#21775

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!aioe.org!news.uzoreto.com!news-out.netnews.com!news.alt.net!fdc2.netnews.com!peer03.ams1!peer.ams1.xlned.com!news.xlned.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx41.iad.POSTED!not-for-mail
From: ThatWoul...@thevillage.com (EricP)
User-Agent: Thunderbird 2.0.0.24 (Windows/20100228)
MIME-Version: 1.0
Newsgroups: comp.arch
Subject: Re: Inter-address space access, debugging [was Paper about ISO C]
References: <87fstdumxd.fsf@hotmail.com> <cb6bbb41-398f-4e2a-9a19-08bc4582b291n@googlegroups.com> <sk437c$672$1@dont-email.me> <jwvee8qgxk0.fsf-monnier+comp.arch@gnu.org> <2021Oct12.185057@mips.complang.tuwien.ac.at> <jwvzgre88y4.fsf-monnier+comp.arch@gnu.org> <5f97b29e-e958-49e2-bb1c-c0e9870f9c2bn@googlegroups.com> <sku3dr$1hb$2@dont-email.me> <5d25afd4-0e2c-4e98-a457-a04be5ae88dbn@googlegroups.com> <sl3c2g$n4b$1@dont-email.me> <2021Oct25.195829@mips.complang.tuwien.ac.at> <sl79bl$jei$1@dont-email.me> <jwvh7d4twhd.fsf-monnier+comp.arch@gnu.org> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com> <sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad>
In-Reply-To: <IMEeJ.219$g81.51@fx19.iad>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 54
Message-ID: <MDffJ.4738$6a3.1363@fx41.iad>
X-Complaints-To: abuse@UsenetServer.com
NNTP-Posting-Date: Sat, 30 Oct 2021 17:52:44 UTC
Date: Sat, 30 Oct 2021 13:52:35 -0400
X-Received-Bytes: 4449
 by: EricP - Sat, 30 Oct 2021 17:52 UTC

EricP wrote:
> Michael S wrote:
>> On Thursday, October 28, 2021 at 7:19:19 PM UTC+3, EricP wrote:
>>> There are two other ways to do this kind of stuff which I quite like
>>> so I'll mention them (but I don't know if or how this could fit with
>>> your approaches).
>>> (1) First method is for threads to support what I call Thread
>>> Interrupt Procedures (TIP's), VMS calls them Asynchronous System
>>> Traps (AST's) and WinNT calls them Asynchronous Procedure Calls
>>> (APC's). It is all the same basic concept - a software interrupt to a
>>> thread.
>>> A TIP is a kernel packet that contains a pointer to a routine to call
>>> and 3 or 4 arguments to pass to that routine, and links for a list.
>>> Each thread has a list of pending TIPs. When a TIP is posted to a
>>> thread, if that thread is in a Wait state it wakes up to a Ready
>>> state. When the thread next runs, the first pending TIP is delivered
>>> in kernel mode and its routine is called passing the args,
>>> interrupting whatever the thread was doing, and blocking TIP delivery
>>> until it returns.
>>
>> I heard that AST on VMS works as you described. But APC on WNT does not.
>> Despite the name, WNT ASP is asynchronous only in a sense that
>> QueueUserAPC() call is non-blocking. But execution at destination
>> tread is synchronous and happens only after the target thread entered
>> an alertable state by SleepEx(), WaitForSingleObjectEx(),
>> WaitForMultipleObjectsEx() and couple of other similar syscalls.
>
> WinNT kernel mode APCs do work that way, as thread interrupts.
> Yes, the the _documented_ user mode "APCs" are actually synchronous,
> delivered only at certain points and only on request.
>
> There are undocumented user mode APCs that are asynchronous and delivered
> at the next user-kernel transition (which you can force so they behave
> like interrupts) but these are only available from within kernel mode
> and have been intentionally lobotomized so they can only deliver
> a program counter on MS approved list.

On a related note, I just noticed that Intel has added new instructions
for Adler Lake, a big-little architecture, some of which support
_user mode interrupts_.

Intel Architecture Instruction Set Extensions and Future Features May 2021
https://software.intel.com/content/dam/develop/external/us/en/documents-tps/architecture-instruction-set-extensions-programming-reference.pdf

They also add Advanced Matrix Extensions (AMX) that can do 2048 8-bit
multiply-adds per clock per core.

There is a video about Adler Lake microArch, first for Gracemont,
the little efficient one, and then for Golden Cove, the big
performance one. 2 hours 16 min
https://www.youtube.com/watch?v=3jU_YhZ1NQA&t=422s

Re: [OFFTOPIC] Re: Paper about ISO C

<slkicm$di8$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21776&group=comp.arch#21776

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: bak...@iitbombay.org (Bakul Shah)
Newsgroups: comp.arch
Subject: Re: [OFFTOPIC] Re: Paper about ISO C
Date: Sat, 30 Oct 2021 15:53:08 -0700
Organization: A noiseless patient Spider
Lines: 47
Message-ID: <slkicm$di8$1@dont-email.me>
References: <Gwh8J.222306$T_8.186929@fx48.iad>
<j6i3mg16rc2f6d99n7u64cmebnvq02d407@4ax.com>
<9a019b14-481d-4ae9-a8ae-8f0c2d94bf57n@googlegroups.com>
<vpc8mg9nie7kcpik5knob6po2rt4goftvd@4ax.com> <864k9mp8d2.fsf@linuxsc.com>
<sk4eht$b89$1@dont-email.me> <86ee8bibia.fsf@linuxsc.com>
<23a7aa84-1619-497c-b435-95d647cb0c62n@googlegroups.com>
<jwvmtmy75gb.fsf-monnier+comp.arch@gnu.org>
<249305d9-ef33-4751-a2b2-80afabc7e3a2n@googlegroups.com>
<sl5k9d$g20$1@newsreader4.netcologne.de>
<tanknght82p32476bocsneeaa985mgersh@4ax.com>
<0001HW.272B870700598F1B70000B74038F@news.individual.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Sat, 30 Oct 2021 22:53:11 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="24d824442b3cdf1c58debfe0aaf0ab97";
logging-data="13896"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/+jO+kKwBxXo5GJ/en6KaY"
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.16; rv:60.0)
Gecko/20100101 Firefox/60.0 SeaMonkey/2.53.9.1
Cancel-Lock: sha1:imFwe94sN+3JnYTmGxGhTueVVlo=
In-Reply-To: <0001HW.272B870700598F1B70000B74038F@news.individual.net>
 by: Bakul Shah - Sat, 30 Oct 2021 22:53 UTC

On 10/28/21 6:33 PM, Bill Findlay wrote:
> On 28 Oct 2021, George Neuner wrote
> (in article<tanknght82p32476bocsneeaa985mgersh@4ax.com>):
>
>> On Mon, 25 Oct 2021 08:53:31 +0200, Bernd Linsel
>> <bl1-removethis@gmx.com> wrote:
>>
>>> On 24.10.2021 19:38, MitchAlsup wrote:
>>>> On Sunday, October 24, 2021 at 11:31:59 AM UTC-5, Stefan Monnier wrote:
>>>> <
>>>> PL/1 had areas. If you deallocated an area, everything you allocated in
>>>> the area
>>>> "went away".
>>>
>>> Original Pascal: Mark(b); ... New(p); ... Dispose(b);
>>
>> Umm. My (perhaps faulty) recollection was that it was mark/release,
>> and that it was not in Wirth's Pascal originally, but rather was an
>> extension introduced by UCSD and then copied by others ... notably
>> Turbo.
>
> Correct.
>
> But truly original Pascal had "class" types, which were locally
> declarable areas within which new allocated storage on demand.
> Pointers were bound to classes, not to the types pointed-to.
> A class was completely deallocated on exit from its scope.

FYI: This was removed in about two years. According to Wirth's
recollections[1]:
Thereafter, the bootstrapping process could begin [Wirth, 1971b].
This compiler was completed by mid 1970, and at this time the
language definition was published. With the exception of some
slight revisions in 1972, it remained stable thereafter.

The revised report[2] came out in 1972 and mentions this:
The class structure is eliminated: pointer variables are bound
to a data type instead of a class variable.

I don't have a copy of Wirth's 1970 paper on Pascal so I don't quite
understand their semantics. I have a scanned listing of the original
CDC6000 pascal compiler but don't feel like wading through it to
figure out how class structures were implemented!

[1]
http://pascal.hansotten.com/niklaus-wirth/recollections-about-the-development-of-pascal/
[2] http://bernd-oppolzer.de/PascalReport.pdf

Re: Inter-address space access, debugging [was Paper about ISO C]

<b53ced53-a7f5-4133-8818-8367d24d9887n@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21777&group=comp.arch#21777

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a0c:8c81:: with SMTP id p1mr20244803qvb.7.1635651562553;
Sat, 30 Oct 2021 20:39:22 -0700 (PDT)
X-Received: by 2002:a05:6808:10c7:: with SMTP id s7mr20694745ois.30.1635651562365;
Sat, 30 Oct 2021 20:39:22 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sat, 30 Oct 2021 20:39:22 -0700 (PDT)
In-Reply-To: <MDffJ.4738$6a3.1363@fx41.iad>
Injection-Info: google-groups.googlegroups.com; posting-host=2001:56a:fa3a:e00:1df8:c2aa:1dbf:980d;
posting-account=1nOeKQkAAABD2jxp4Pzmx9Hx5g9miO8y
NNTP-Posting-Host: 2001:56a:fa3a:e00:1df8:c2aa:1dbf:980d
References: <87fstdumxd.fsf@hotmail.com> <cb6bbb41-398f-4e2a-9a19-08bc4582b291n@googlegroups.com>
<sk437c$672$1@dont-email.me> <jwvee8qgxk0.fsf-monnier+comp.arch@gnu.org>
<2021Oct12.185057@mips.complang.tuwien.ac.at> <jwvzgre88y4.fsf-monnier+comp.arch@gnu.org>
<5f97b29e-e958-49e2-bb1c-c0e9870f9c2bn@googlegroups.com> <sku3dr$1hb$2@dont-email.me>
<5d25afd4-0e2c-4e98-a457-a04be5ae88dbn@googlegroups.com> <sl3c2g$n4b$1@dont-email.me>
<2021Oct25.195829@mips.complang.tuwien.ac.at> <sl79bl$jei$1@dont-email.me>
<jwvh7d4twhd.fsf-monnier+comp.arch@gnu.org> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com>
<sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com>
<sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com>
<sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com>
<84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com>
<IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <b53ced53-a7f5-4133-8818-8367d24d9887n@googlegroups.com>
Subject: Re: Inter-address space access, debugging [was Paper about ISO C]
From: jsav...@ecn.ab.ca (Quadibloc)
Injection-Date: Sun, 31 Oct 2021 03:39:22 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 7
 by: Quadibloc - Sun, 31 Oct 2021 03:39 UTC

On Saturday, October 30, 2021 at 11:52:47 AM UTC-6, EricP wrote:

> They also add Advanced Matrix Extensions (AMX) that can do 2048 8-bit
> multiply-adds per clock per core.

This sounds exciting.

John Savard

Architectural extensions (was: Inter-address space access ...)

<2021Oct31.104940@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21779&group=comp.arch#21779

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Architectural extensions (was: Inter-address space access ...)
Date: Sun, 31 Oct 2021 09:49:40 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 43
Message-ID: <2021Oct31.104940@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com> <sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="10071"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/4bXdoyyPxFdqWV34Jz3fL"
Cancel-Lock: sha1:oHpqKCR/SFSJsm+7VtY2K1/5oWg=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 09:49 UTC

EricP <ThatWouldBeTelling@thevillage.com> writes:
>Intel Architecture Instruction Set Extensions and Future Features May 2021
>https://software.intel.com/content/dam/develop/external/us/en/documents-tps/architecture-instruction-set-extensions-programming-reference.pdf

A 214-page document describing various architectural features
(although lot all relevant for user-level code), some of them
additions that are designed to play with earlier extensions.

One problem is that with a base architecture and optional extensions
using the extensions means that you should write a base version and a
version with the extensions (and given that Intel disables extensions
for market segmentation reasons, yes, you really have to do that).
ARM is somewhat better here by defining architecture levels (ARM v8-A,
8.1 ..., ARM v9-A, ...), although the benefit is somewhat lessened by
having so many levels and so confusingly named (e.g., the headline
feature of ARM v8 seemed to be the Aarch64 or A64 instruction set, but
ARM v8-M does not include this instruction set).

Another problem is getting an overview. So you have these piecewise
documents and you have the big tome that lists the instructions
albhabetically, but no good overview of specific groups of
instructions (at least not for all); in particular, I recently wanted
to learn about AVX-512, because the Tiger Lake in my laptop has that.
I found some application notes from Intel that give an example of the
AVX-512 stuff in the Xeon Phi CPUs, but little else. My guess is that
AVX-512 in mainstream CPUs is mostly AVX2, plus the predicate stuff
(for which I did not find an overview), plus the instructions
discussed in the application note.

>There is a video about Adler Lake microArch, first for Gracemont,
>the little efficient one, and then for Golden Cove, the big
>performance one. 2 hours 16 min
>https://www.youtube.com/watch?v=3jU_YhZ1NQA&t=422s

I wonder if such a video is cheaper to produce than a written
document. I certainly find a written document much preferable, so
much that I will probably not watch this video, while I would read a
paper.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions (was: Inter-address space access ...)

<slltnv$ed1$1@newsreader4.netcologne.de>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21780&group=comp.arch#21780

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!paganini.bofh.team!news.dns-netz.com!news.freedyn.net!newsreader4.netcologne.de!news.netcologne.de!.POSTED.2001-4dd7-5748-0-7285-c2ff-fe6c-992d.ipv6dyn.netcologne.de!not-for-mail
From: tkoe...@netcologne.de (Thomas Koenig)
Newsgroups: comp.arch
Subject: Re: Architectural extensions (was: Inter-address space access ...)
Date: Sun, 31 Oct 2021 11:13:03 -0000 (UTC)
Organization: news.netcologne.de
Distribution: world
Message-ID: <slltnv$ed1$1@newsreader4.netcologne.de>
References: <87fstdumxd.fsf@hotmail.com>
<8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com>
<sl99b3$q2h$1@dont-email.me>
<a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com>
<sl9c25$jdu$1@dont-email.me>
<69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com>
<sla5tu$6sh$1@dont-email.me>
<3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com>
<84AeJ.12005$ya3.4258@fx38.iad>
<32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com>
<IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad>
<2021Oct31.104940@mips.complang.tuwien.ac.at>
Injection-Date: Sun, 31 Oct 2021 11:13:03 -0000 (UTC)
Injection-Info: newsreader4.netcologne.de; posting-host="2001-4dd7-5748-0-7285-c2ff-fe6c-992d.ipv6dyn.netcologne.de:2001:4dd7:5748:0:7285:c2ff:fe6c:992d";
logging-data="14753"; mail-complaints-to="abuse@netcologne.de"
User-Agent: slrn/1.0.3 (Linux)
 by: Thomas Koenig - Sun, 31 Oct 2021 11:13 UTC

Anton Ertl <anton@mips.complang.tuwien.ac.at> schrieb:
> EricP <ThatWouldBeTelling@thevillage.com> writes:
>>Intel Architecture Instruction Set Extensions and Future Features May 2021
>>https://software.intel.com/content/dam/develop/external/us/en/documents-tps/architecture-instruction-set-extensions-programming-reference.pdf
>
> A 214-page document describing various architectural features
> (although lot all relevant for user-level code), some of them
> additions that are designed to play with earlier extensions.
>
> One problem is that with a base architecture and optional extensions
> using the extensions means that you should write a base version and a
> version with the extensions (and given that Intel disables extensions
> for market segmentation reasons, yes, you really have to do that).

Which is a PITA for everybody involved. This sort of thing only
makes some sense for people who compile their own code, which very
few people do.

But even so, compilers then have to offer a host of options,
both for "tune for this architecture, but make sure it runs
everywhere" and for "you can assume you can use any feature of
this architecture". It is not surprising that this is a continuous
source of (performance) bugs.

Finally, for people who distribute binaries - they may decide
to do as you described above, but that they can hardly expect to
have a system with every microarchitecture around, and generating
x binaries from a single source and checking them for performance
is a bit of a nightmare.

I guess Intel is hoping for vendor lock here, but so far it does
not appear to have worked.

Re: Architectural extensions

<rCxfJ.4565$JZ3.1536@fx05.iad>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21783&group=comp.arch#21783

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!newsreader4.netcologne.de!news.netcologne.de!peer03.ams1!peer.ams1.xlned.com!news.xlned.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx05.iad.POSTED!not-for-mail
From: ThatWoul...@thevillage.com (EricP)
User-Agent: Thunderbird 2.0.0.24 (Windows/20100228)
MIME-Version: 1.0
Newsgroups: comp.arch
Subject: Re: Architectural extensions
References: <87fstdumxd.fsf@hotmail.com> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com> <sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at>
In-Reply-To: <2021Oct31.104940@mips.complang.tuwien.ac.at>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 29
Message-ID: <rCxfJ.4565$JZ3.1536@fx05.iad>
X-Complaints-To: abuse@UsenetServer.com
NNTP-Posting-Date: Sun, 31 Oct 2021 14:20:07 UTC
Date: Sun, 31 Oct 2021 10:19:57 -0400
X-Received-Bytes: 2213
 by: EricP - Sun, 31 Oct 2021 14:19 UTC

Anton Ertl wrote:
> EricP <ThatWouldBeTelling@thevillage.com> writes:
>
>> There is a video about Adler Lake microArch, first for Gracemont,
>> the little efficient one, and then for Golden Cove, the big
>> performance one. 2 hours 16 min
>> https://www.youtube.com/watch?v=3jU_YhZ1NQA&t=422s
>
> I wonder if such a video is cheaper to produce than a written
> document. I certainly find a written document much preferable, so
> much that I will probably not watch this video, while I would read a
> paper.
>
> - anton

I agree, I would prefer a document too but I stumbled on it
so just passing it along.

The above look like a collection of talks they would have given
if we were still having conferences and covered more topics than
just the processor cores.

Web page version:

Intel Architecture Day 2021:
Alder Lake, Golden Cove, and Gracemont Detailed, 19-Aug-2021
https://www.anandtech.com/show/16881/a-deep-dive-into-intels-alder-lake-microarchitectures

Re: Inter-address space access, debugging [was Paper about ISO C]

<de489717-d523-41a3-b09e-98bebd06838dn@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21786&group=comp.arch#21786

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:5bc4:: with SMTP id t4mr8328165qvt.3.1635692211498;
Sun, 31 Oct 2021 07:56:51 -0700 (PDT)
X-Received: by 2002:a05:6808:2106:: with SMTP id r6mr12646626oiw.110.1635692211274;
Sun, 31 Oct 2021 07:56:51 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!border1.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 31 Oct 2021 07:56:51 -0700 (PDT)
In-Reply-To: <b53ced53-a7f5-4133-8818-8367d24d9887n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=2001:56a:fa3a:e00:7902:a783:1983:ac76;
posting-account=1nOeKQkAAABD2jxp4Pzmx9Hx5g9miO8y
NNTP-Posting-Host: 2001:56a:fa3a:e00:7902:a783:1983:ac76
References: <87fstdumxd.fsf@hotmail.com> <cb6bbb41-398f-4e2a-9a19-08bc4582b291n@googlegroups.com>
<sk437c$672$1@dont-email.me> <jwvee8qgxk0.fsf-monnier+comp.arch@gnu.org>
<2021Oct12.185057@mips.complang.tuwien.ac.at> <jwvzgre88y4.fsf-monnier+comp.arch@gnu.org>
<5f97b29e-e958-49e2-bb1c-c0e9870f9c2bn@googlegroups.com> <sku3dr$1hb$2@dont-email.me>
<5d25afd4-0e2c-4e98-a457-a04be5ae88dbn@googlegroups.com> <sl3c2g$n4b$1@dont-email.me>
<2021Oct25.195829@mips.complang.tuwien.ac.at> <sl79bl$jei$1@dont-email.me>
<jwvh7d4twhd.fsf-monnier+comp.arch@gnu.org> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com>
<sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com>
<sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com>
<sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com>
<84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com>
<IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <b53ced53-a7f5-4133-8818-8367d24d9887n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <de489717-d523-41a3-b09e-98bebd06838dn@googlegroups.com>
Subject: Re: Inter-address space access, debugging [was Paper about ISO C]
From: jsav...@ecn.ab.ca (Quadibloc)
Injection-Date: Sun, 31 Oct 2021 14:56:51 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 13
 by: Quadibloc - Sun, 31 Oct 2021 14:56 UTC

On Saturday, October 30, 2021 at 9:39:23 PM UTC-6, Quadibloc wrote:
> On Saturday, October 30, 2021 at 11:52:47 AM UTC-6, EricP wrote:

> > They also add Advanced Matrix Extensions (AMX) that can do 2048 8-bit
> > multiply-adds per clock per core.

> This sounds exciting.

It's not as exciting as I hoped. Apparently, it only supports operations on
either bytes or short floating-point numbers in BF16 format.

Of course, I was hoping for FP64.

John Savard

Re: Architectural extensions

<tlzfJ.7101$lz3.1534@fx34.iad>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21788&group=comp.arch#21788

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!aioe.org!news.uzoreto.com!news-out.netnews.com!news.alt.net!fdc2.netnews.com!peer02.ams1!peer.ams1.xlned.com!news.xlned.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx34.iad.POSTED!not-for-mail
From: ThatWoul...@thevillage.com (EricP)
User-Agent: Thunderbird 2.0.0.24 (Windows/20100228)
MIME-Version: 1.0
Newsgroups: comp.arch
Subject: Re: Architectural extensions
References: <87fstdumxd.fsf@hotmail.com> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com> <sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at>
In-Reply-To: <2021Oct31.104940@mips.complang.tuwien.ac.at>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 34
Message-ID: <tlzfJ.7101$lz3.1534@fx34.iad>
X-Complaints-To: abuse@UsenetServer.com
NNTP-Posting-Date: Sun, 31 Oct 2021 16:18:33 UTC
Date: Sun, 31 Oct 2021 12:18:13 -0400
X-Received-Bytes: 3161
 by: EricP - Sun, 31 Oct 2021 16:18 UTC

Anton Ertl wrote:
> EricP <ThatWouldBeTelling@thevillage.com> writes:
>> Intel Architecture Instruction Set Extensions and Future Features May 2021
>> https://software.intel.com/content/dam/develop/external/us/en/documents-tps/architecture-instruction-set-extensions-programming-reference.pdf
>
> A 214-page document describing various architectural features
> (although lot all relevant for user-level code), some of them
> additions that are designed to play with earlier extensions.
>
> One problem is that with a base architecture and optional extensions
> using the extensions means that you should write a base version and a
> version with the extensions (and given that Intel disables extensions
> for market segmentation reasons, yes, you really have to do that).
> ARM is somewhat better here by defining architecture levels (ARM v8-A,
> 8.1 ..., ARM v9-A, ...), although the benefit is somewhat lessened by
> having so many levels and so confusingly named (e.g., the headline
> feature of ARM v8 seemed to be the Aarch64 or A64 instruction set, but
> ARM v8-M does not include this instruction set).
>
> Another problem is getting an overview. So you have these piecewise
> documents and you have the big tome that lists the instructions
> albhabetically, but no good overview of specific groups of
> instructions (at least not for all); in particular, I recently wanted
> to learn about AVX-512, because the Tiger Lake in my laptop has that.
> I found some application notes from Intel that give an example of the
> AVX-512 stuff in the Xeon Phi CPUs, but little else. My guess is that
> AVX-512 in mainstream CPUs is mostly AVX2, plus the predicate stuff
> (for which I did not find an overview), plus the instructions
> discussed in the application note.

Adler Lake apparently disables AVX-512 for both desktop and mobile.
It's not clear why. It remains for "data centers", presumably Xeon.

Re: Architectural extensions

<slmhfs$rfa$2@newsreader4.netcologne.de>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21791&group=comp.arch#21791

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!paganini.bofh.team!news.dns-netz.com!news.freedyn.net!newsreader4.netcologne.de!news.netcologne.de!.POSTED.2001-4dd7-5748-0-7285-c2ff-fe6c-992d.ipv6dyn.netcologne.de!not-for-mail
From: tkoe...@netcologne.de (Thomas Koenig)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Sun, 31 Oct 2021 16:50:04 -0000 (UTC)
Organization: news.netcologne.de
Distribution: world
Message-ID: <slmhfs$rfa$2@newsreader4.netcologne.de>
References: <87fstdumxd.fsf@hotmail.com>
<8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com>
<sl99b3$q2h$1@dont-email.me>
<a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com>
<sl9c25$jdu$1@dont-email.me>
<69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com>
<sla5tu$6sh$1@dont-email.me>
<3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com>
<84AeJ.12005$ya3.4258@fx38.iad>
<32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com>
<IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad>
<2021Oct31.104940@mips.complang.tuwien.ac.at>
<tlzfJ.7101$lz3.1534@fx34.iad>
Injection-Date: Sun, 31 Oct 2021 16:50:04 -0000 (UTC)
Injection-Info: newsreader4.netcologne.de; posting-host="2001-4dd7-5748-0-7285-c2ff-fe6c-992d.ipv6dyn.netcologne.de:2001:4dd7:5748:0:7285:c2ff:fe6c:992d";
logging-data="28138"; mail-complaints-to="abuse@netcologne.de"
User-Agent: slrn/1.0.3 (Linux)
 by: Thomas Koenig - Sun, 31 Oct 2021 16:50 UTC

EricP <ThatWouldBeTelling@thevillage.com> schrieb:

> Adler Lake apparently disables AVX-512 for both desktop and mobile.
> It's not clear why. It remains for "data centers", presumably Xeon.

I assume they disable it because it has rather turned out to be
a fiasco because they had to clock down the whole CPU to use
it or risk overheating.

Maybe they assume better cooling on CPUs for data centers.

Re: Architectural extensions

<2021Oct31.184628@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21792&group=comp.arch#21792

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Sun, 31 Oct 2021 17:46:28 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 18
Message-ID: <2021Oct31.184628@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at> <tlzfJ.7101$lz3.1534@fx34.iad>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="22140"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+WbwNIMbWsCTviPwj5qM+8"
Cancel-Lock: sha1:zZ99q7cqVfX5t8twPlWbfMZBNwg=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 17:46 UTC

EricP <ThatWouldBeTelling@thevillage.com> writes:
>Adler Lake apparently disables AVX-512 for both desktop and mobile.
>It's not clear why.

The speculation I have read is that Gracemont (the efficiency core)
does not have AVX-512, so they also disabled it for Golden Cove (the
performance core).

>It remains for "data centers", presumably Xeon.

Either they disable Gracemont there (or, for those CPUs that don't use
the same die, don't have Gracemont), or the idea is that data center
OSs have schedulers that can deal with heterogeneous instruction sets.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<2021Oct31.185326@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21793&group=comp.arch#21793

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Sun, 31 Oct 2021 17:53:26 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 33
Distribution: world
Message-ID: <2021Oct31.185326@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at> <tlzfJ.7101$lz3.1534@fx34.iad> <slmhfs$rfa$2@newsreader4.netcologne.de>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="22060"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/YRnAOvU5qRFEmvTk4kEpU"
Cancel-Lock: sha1:SsP3d8nlcVcvsrO5koueWqNs1S8=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 17:53 UTC

Thomas Koenig <tkoenig@netcologne.de> writes:
>EricP <ThatWouldBeTelling@thevillage.com> schrieb:
>
>> Adler Lake apparently disables AVX-512 for both desktop and mobile.
>> It's not clear why. It remains for "data centers", presumably Xeon.
>
>I assume they disable it because it has rather turned out to be
>a fiasco because they had to clock down the whole CPU to use
>it or risk overheating.

Looking at the Ice Lake results on
<https://travisdowns.github.io/blog/2020/08/19/icl-avx512-freq.html>,
the downclocking seems to be not very pronounced on Ice Lake even for
low-power CPUs. I have to measure it on my Tiger Lake at some point.

>Maybe they assume better cooling on CPUs for data centers.

Certainly on Skylake things were the other way round: desktop CPUs
that used AVX2 did not downclock while server CPUs did. My guess is
that this is because server customers are more power-conscious than
desktop customers. E.g., you can see how AVX2 affects the Xeon Gold
5120 (a Skylake-server CPU) at
<https://stackoverflow.com/questions/56852812/simd-instructions-lowering-cpu-frequency/56861355#56861355>.
So if you have load on 4 cores, the Core i7-6700K client CPU runs it
at 4200MHz Turbo whether the code uses AVX2 or not, while the Xeon
Gold 5120 server CPU runs it at 3000MHz without AVX2, and at 2900MHz
if the code contains AVX2 instructions (in the last ms or so). The
AVX-512 slowdown is bigger, but the client CPU does not have AVX-512.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions (was: Inter-address space access ...)

<2021Oct31.192349@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21795&group=comp.arch#21795

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions (was: Inter-address space access ...)
Date: Sun, 31 Oct 2021 18:23:49 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 43
Message-ID: <2021Oct31.192349@mips.complang.tuwien.ac.at>
References: <slltnv$ed1$1@newsreader4.netcologne.de> <memo.20211031145426.12464o@jgd.cix.co.uk>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="31797"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19s8+oI9I49poHXUb66NIvQ"
Cancel-Lock: sha1:NsB5xqx6iLn5um7EyWDm2lR1Vyw=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 18:23 UTC

jgd@cix.co.uk (John Dallman) writes:
>In article <slltnv$ed1$1@newsreader4.netcologne.de>,
>tkoenig@netcologne.de (Thomas Koenig) wrote:
>> I guess Intel is hoping for vendor lock here, but so far it does
>> not appear to have worked.

Of course it does not work, because Intel fails to provide these
extensions on Intel CPUs. Buy a Pentium or Celeron, and AVX is
disabled, even if the die has it.

So I don't think that Intel is hoping for vendor lock-in, they are
trying to do market segmentation. But my impression is that it is not
working. Nobody buys a more expensive Core i3 over a Pentium because
the Core i3 has AVX and the Pentium does not. They buy the Core i3
because

1) Intel was very successful at marketing the names. Even people who
should be in the know act as if a Core i7 was better than a Core i5
even when comparing between generations (where it is definitely not
the case).

2) it offers more cores or a higher clock rate or
hyperthreading (or several of those).

3) they can afford it (this is actually reason 1 in dusguise).

>Intel seems to operate under some hardware-centric assumptions. These
>make good sense for firmware, but much less for application software:

I don't what they are doing makes any sense for PC firmware, either.
The firmware is on the mainboard, and you can put in anything from a
Celeron (without AVX) to a Core i9 (with AVX-512 on Rocket Lake), so
the firmware will not use AVX.

By contrast, AMD has had AVX support in all CPUs since Bulldozer
(2011) and Jaguar (2013) and AVX2 support since Excavator (2015). If
everybody had acted like AMD, AVX and AVX2 could become base features
at some point.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<2021Oct31.194638@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21796&group=comp.arch#21796

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Sun, 31 Oct 2021 18:46:38 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 34
Message-ID: <2021Oct31.194638@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at> <rCxfJ.4565$JZ3.1536@fx05.iad>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="31797"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18mhHL/gLew7gru3qKeDF5c"
Cancel-Lock: sha1:eBgSlKD7naFIxtFkpWz3BcpqoQU=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 18:46 UTC

EricP <ThatWouldBeTelling@thevillage.com> writes:
>Anton Ertl wrote:
>> EricP <ThatWouldBeTelling@thevillage.com> writes:
>>
>>> There is a video about Adler Lake microArch, first for Gracemont,
>>> the little efficient one, and then for Golden Cove, the big
>>> performance one. 2 hours 16 min
>>> https://www.youtube.com/watch?v=3jU_YhZ1NQA&t=422s
>>
>> I wonder if such a video is cheaper to produce than a written
>> document. I certainly find a written document much preferable, so
>> much that I will probably not watch this video, while I would read a
>> paper.
>>
>> - anton
>
>I agree, I would prefer a document too but I stumbled on it
>so just passing it along.

Thank you.

>Web page version:
>
>Intel Architecture Day 2021:
>Alder Lake, Golden Cove, and Gracemont Detailed, 19-Aug-2021
>https://www.anandtech.com/show/16881/a-deep-dive-into-intels-alder-lake-microarchitectures

And the video-averse people like me have to thank Anandtech for
putting it in writing.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions (was: Inter-address space access ...)

<b04d29a8-c454-4e98-8ec7-deca9996b5d3n@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21798&group=comp.arch#21798

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:620a:818:: with SMTP id s24mr19627481qks.395.1635707782782;
Sun, 31 Oct 2021 12:16:22 -0700 (PDT)
X-Received: by 2002:a05:6830:2b1f:: with SMTP id l31mr9181790otv.333.1635707782586;
Sun, 31 Oct 2021 12:16:22 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 31 Oct 2021 12:16:22 -0700 (PDT)
In-Reply-To: <2021Oct31.192349@mips.complang.tuwien.ac.at>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:1700:291:29f0:7d65:9bc5:f275:a261;
posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 2600:1700:291:29f0:7d65:9bc5:f275:a261
References: <slltnv$ed1$1@newsreader4.netcologne.de> <memo.20211031145426.12464o@jgd.cix.co.uk>
<2021Oct31.192349@mips.complang.tuwien.ac.at>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <b04d29a8-c454-4e98-8ec7-deca9996b5d3n@googlegroups.com>
Subject: Re: Architectural extensions (was: Inter-address space access ...)
From: MitchAl...@aol.com (MitchAlsup)
Injection-Date: Sun, 31 Oct 2021 19:16:22 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 50
 by: MitchAlsup - Sun, 31 Oct 2021 19:16 UTC

On Sunday, October 31, 2021 at 1:44:10 PM UTC-5, Anton Ertl wrote:
> j...@cix.co.uk (John Dallman) writes:
> >In article <slltnv$ed1$1...@newsreader4.netcologne.de>,
> >tko...@netcologne.de (Thomas Koenig) wrote:
> >> I guess Intel is hoping for vendor lock here, but so far it does
> >> not appear to have worked.
> Of course it does not work, because Intel fails to provide these
> extensions on Intel CPUs. Buy a Pentium or Celeron, and AVX is
> disabled, even if the die has it.
>
> So I don't think that Intel is hoping for vendor lock-in, they are
> trying to do market segmentation. But my impression is that it is not
> working. Nobody buys a more expensive Core i3 over a Pentium because
> the Core i3 has AVX and the Pentium does not. They buy the Core i3
> because
>
> 1) Intel was very successful at marketing the names. Even people who
> should be in the know act as if a Core i7 was better than a Core i5
> even when comparing between generations (where it is definitely not
> the case).
>
> 2) it offers more cores or a higher clock rate or
> hyperthreading (or several of those).
>
> 3) they can afford it (this is actually reason 1 in dusguise).
> >Intel seems to operate under some hardware-centric assumptions. These
> >make good sense for firmware, but much less for application software:
<
> I don't what they are doing makes any sense for PC firmware, either.
> The firmware is on the mainboard, and you can put in anything from a
> Celeron (without AVX) to a Core i9 (with AVX-512 on Rocket Lake), so
> the firmware will not use AVX.
<
Firmware would simply::
a) ignore the extra instructions
b) use CPUID to figure out which instructions it can use.
More likely a than b.
>
> By contrast, AMD has had AVX support in all CPUs since Bulldozer
> (2011) and Jaguar (2013) and AVX2 support since Excavator (2015). If
> everybody had acted like AMD, AVX and AVX2 could become base features
> at some point.
<
This is like pointing out that every car-radio has the same electronics inside
(since about 1995) and they "customize" or "downgrade" based on (essentially
fuses inside the radio) what the model number is.
<
> - anton
> --
> 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
> Mitch Alsup, <c17fcd89-f024-40e7...@googlegroups.com>

Re: Architectural extensions

<2021Oct31.230648@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21800&group=comp.arch#21800

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Sun, 31 Oct 2021 22:06:48 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 35
Message-ID: <2021Oct31.230648@mips.complang.tuwien.ac.at>
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk>
Injection-Info: reader02.eternal-september.org; posting-host="fe77bbe53a44e15cab56ae824db7f3cb";
logging-data="13952"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/HxuO5atJ39N0mzgH3heZV"
Cancel-Lock: sha1:geV9DXTX1EvEVds+jC0d+tjG5c4=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Sun, 31 Oct 2021 22:06 UTC

jgd@cix.co.uk (John Dallman) writes:
>In article <2021Oct31.184628@mips.complang.tuwien.ac.at>,
>anton@mips.complang.tuwien.ac.at (Anton Ertl) wrote:
>
>> EricP <ThatWouldBeTelling@thevillage.com> writes:
>> >It remains for "data centers", presumably Xeon.
>>
>> Either they disable Gracemont there (or, for those CPUs that don't
>> use the same die, don't have Gracemont), or the idea is that data
>> center OSs have schedulers that can deal with heterogeneous
>> instruction sets.
>
>The scheduler approach sets a very high bar for the scheduler in never,
>ever, letting a process that uses AVX-512 onto a Gracemont core, because
>SIGIL spoils your whole day.

Does not sound so difficult to me. I can tell Linux with taskset to
never, ever use core 3 or 4 for a certain process.

The more difficult thing in practice is to know whether a program uses
AVX-512. Either these programs are marked as requiring a separate
architecture, or the OS runs it until the CPU traps on an unsupported
instruction; then the OS sets the CPU affinity of the thread to Golden
Cove cores only, migrates the thread to an appropriate core, and lets
it continue.

Another problem is that AVX-512 state has more and longer registers,
so if you want to support Golden Cove -> Gracemont migrations, you
would need to know whether any of the additional state is in use (then
you probably don't want such a migration anyway)

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<b8c39eff-e9cd-44ec-aa05-1ff32ead902dn@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21801&group=comp.arch#21801

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ac8:5f54:: with SMTP id y20mr26963245qta.324.1635738167365; Sun, 31 Oct 2021 20:42:47 -0700 (PDT)
X-Received: by 2002:a05:6808:190d:: with SMTP id bf13mr18171370oib.7.1635738167139; Sun, 31 Oct 2021 20:42:47 -0700 (PDT)
Path: i2pn2.org!i2pn.org!aioe.org!news.uzoreto.com!tr1.eu1.usenetexpress.com!feeder.usenetexpress.com!tr2.iad1.usenetexpress.com!border1.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 31 Oct 2021 20:42:46 -0700 (PDT)
In-Reply-To: <tlzfJ.7101$lz3.1534@fx34.iad>
Injection-Info: google-groups.googlegroups.com; posting-host=2001:56a:fa3a:e00:7902:a783:1983:ac76; posting-account=1nOeKQkAAABD2jxp4Pzmx9Hx5g9miO8y
NNTP-Posting-Host: 2001:56a:fa3a:e00:7902:a783:1983:ac76
References: <87fstdumxd.fsf@hotmail.com> <8f6f1f13-9cbc-4142-ba79-8a0eba101523n@googlegroups.com> <sl99b3$q2h$1@dont-email.me> <a80e2b7f-fc38-49cf-8b66-86df4eb3643cn@googlegroups.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at> <tlzfJ.7101$lz3.1534@fx34.iad>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <b8c39eff-e9cd-44ec-aa05-1ff32ead902dn@googlegroups.com>
Subject: Re: Architectural extensions
From: jsav...@ecn.ab.ca (Quadibloc)
Injection-Date: Mon, 01 Nov 2021 03:42:47 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 18
 by: Quadibloc - Mon, 1 Nov 2021 03:42 UTC

On Sunday, October 31, 2021 at 10:18:36 AM UTC-6, EricP wrote:

> Adler Lake apparently disables AVX-512 for both desktop and mobile.
> It's not clear why. It remains for "data centers", presumably Xeon.

The problem with Alder Lake is that only the performance cores, and not
the efficiency cores, support AVX-512.
Thus, it is necessary to either disable the efficiency cores, or disable
AVX-512, for the computer to run properly, until the operating system is
updated to distinguish between programs that use AVX-512 and programs
that don't, restricting the former to the performance cores only.
I'm very disappointed in this, as I consider AVX-512 a very important
feature, as under the right circumstances it can almost double a program's
performance.
Since the Atom-based cores in the Xeon Phi were where AVX-512
originated, it should have been possible to include AVX-512 support
on the E-cores of Aldler Lake to avoid this horrendous compromise.

John Savard

Re: Architectural extensions

<328f1496-cff7-4928-8c7f-375416eafa86n@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21802&group=comp.arch#21802

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a37:8ec6:: with SMTP id q189mr20982072qkd.145.1635738555455;
Sun, 31 Oct 2021 20:49:15 -0700 (PDT)
X-Received: by 2002:a05:6808:1444:: with SMTP id x4mr17779290oiv.157.1635738555272;
Sun, 31 Oct 2021 20:49:15 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Sun, 31 Oct 2021 20:49:15 -0700 (PDT)
In-Reply-To: <memo.20211031195903.12464q@jgd.cix.co.uk>
Injection-Info: google-groups.googlegroups.com; posting-host=2001:56a:fa3a:e00:7902:a783:1983:ac76;
posting-account=1nOeKQkAAABD2jxp4Pzmx9Hx5g9miO8y
NNTP-Posting-Host: 2001:56a:fa3a:e00:7902:a783:1983:ac76
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <328f1496-cff7-4928-8c7f-375416eafa86n@googlegroups.com>
Subject: Re: Architectural extensions
From: jsav...@ecn.ab.ca (Quadibloc)
Injection-Date: Mon, 01 Nov 2021 03:49:15 +0000
Content-Type: text/plain; charset="UTF-8"
 by: Quadibloc - Mon, 1 Nov 2021 03:49 UTC

On Sunday, October 31, 2021 at 1:59:06 PM UTC-6, John Dallman wrote:

> The scheduler approach sets a very high bar for the scheduler in never,
> ever, letting a process that uses AVX-512 onto a Gracemont core, because
> SIGIL spoils your whole day.

Calling working properly a "very high bar" may well be borne out
by practical experience with the quality of certain operating
systems and their updates, but, really, it is not. It is a minimum
acceptable standard.

Since the efficiency cores avoid wasted electricity and excess heat,
and AVX-512 can almost double the performance of some programs,
they're both features that are too important to do without. The
scheduler approach is the one that allows one to have both of them,
so it is incumbent on operating system writers to define an alternate
format for executables using AVX-512 to give the scheduler the information
it needs.

John Savard

Re: Architectural extensions

<zlLfJ.4920$cW6.3012@fx08.iad>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21803&group=comp.arch#21803

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!ecngs!feeder2.ecngs.de!178.20.174.213.MISMATCH!feeder1.feed.usenet.farm!feed.usenet.farm!newsfeed.xs4all.nl!newsfeed7.news.xs4all.nl!news-out.netnews.com!news.alt.net!fdc2.netnews.com!peer02.ams1!peer.ams1.xlned.com!news.xlned.com!peer02.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx08.iad.POSTED!not-for-mail
From: ThatWoul...@thevillage.com (EricP)
User-Agent: Thunderbird 2.0.0.24 (Windows/20100228)
MIME-Version: 1.0
Newsgroups: comp.arch
Subject: Re: Architectural extensions
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk> <2021Oct31.230648@mips.complang.tuwien.ac.at>
In-Reply-To: <2021Oct31.230648@mips.complang.tuwien.ac.at>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 49
Message-ID: <zlLfJ.4920$cW6.3012@fx08.iad>
X-Complaints-To: abuse@UsenetServer.com
NNTP-Posting-Date: Mon, 01 Nov 2021 05:57:51 UTC
Date: Mon, 01 Nov 2021 01:57:20 -0400
X-Received-Bytes: 2906
 by: EricP - Mon, 1 Nov 2021 05:57 UTC

Anton Ertl wrote:
> jgd@cix.co.uk (John Dallman) writes:
>> In article <2021Oct31.184628@mips.complang.tuwien.ac.at>,
>> anton@mips.complang.tuwien.ac.at (Anton Ertl) wrote:
>>
>>> EricP <ThatWouldBeTelling@thevillage.com> writes:
>>>> It remains for "data centers", presumably Xeon.
>>> Either they disable Gracemont there (or, for those CPUs that don't
>>> use the same die, don't have Gracemont), or the idea is that data
>>> center OSs have schedulers that can deal with heterogeneous
>>> instruction sets.
>> The scheduler approach sets a very high bar for the scheduler in never,
>> ever, letting a process that uses AVX-512 onto a Gracemont core, because
>> SIGIL spoils your whole day.
>
> Does not sound so difficult to me. I can tell Linux with taskset to
> never, ever use core 3 or 4 for a certain process.
>
> The more difficult thing in practice is to know whether a program uses
> AVX-512. Either these programs are marked as requiring a separate
> architecture, or the OS runs it until the CPU traps on an unsupported
> instruction; then the OS sets the CPU affinity of the thread to Golden
> Cove cores only, migrates the thread to an appropriate core, and lets
> it continue.
>
> Another problem is that AVX-512 state has more and longer registers,
> so if you want to support Golden Cove -> Gracemont migrations, you
> would need to know whether any of the additional state is in use (then
> you probably don't want such a migration anyway)
>
> - anton

An OS exception handler could easily diagnose an AVX-512
instruction and re-route the thread to the big core.
Cpus have been doing this kind of stuff for a very long time.

Also this sounds similar to what the Thread Director technology does.
It dynamically measures performance and instruction set usage,
and works with the OS to move threads to the appropriate cores
for the work load.

See time index 36min 50sec

https://www.youtube.com/watch?v=3jU_YhZ1NQA&t=422s

So they appear to have multiple the tools necessary
to support asymmetric instruction sets.

Re: Architectural extensions

<2021Nov1.141753@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21806&group=comp.arch#21806

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Mon, 01 Nov 2021 13:17:53 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 23
Message-ID: <2021Nov1.141753@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <sl9c25$jdu$1@dont-email.me> <69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me> <3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad> <32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad> <MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at> <tlzfJ.7101$lz3.1534@fx34.iad> <b8c39eff-e9cd-44ec-aa05-1ff32ead902dn@googlegroups.com>
Injection-Info: reader02.eternal-september.org; posting-host="0d65796f621b5a49322afdb7c8e7fd32";
logging-data="21477"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18uAmeELWglo9nBCuVzIGhU"
Cancel-Lock: sha1:yhJ5eXXYEQqU9e1wLH7YZhUhWJ0=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Mon, 1 Nov 2021 13:17 UTC

Quadibloc <jsavard@ecn.ab.ca> writes:
>Since the Atom-based cores in the Xeon Phi were where AVX-512
>originated, it should have been possible to include AVX-512 support
>on the E-cores of Aldler Lake to avoid this horrendous compromise.

The most recent Xeon Phi is Silvermont-based and supports a different
AVX-512 than Golden Cove. Silvermont is several generations behind
Gracemont: Silvermont, Goldmont, Goldmont+, Tremont, Gracemont. It's
probably non-trivial to adapt Xeon Phi AVX-512 to Gracemont and extend
it to the full extent of present-day AVX-512. This was also in a
larger process, and was not designed in a way that makes shrinking
easy. And the main idea of Gracemont is to be small and consume
little power, and Xeon Phi AVX-512 units were probably designed more
for performance.

An alternative would be to implement AVX-512 with narrower units (like
AMD implemented AVX-256 before Zen 2, including on its small and
efficient cores). But for some reason Intel did not go there.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<2021Nov1.142922@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21807&group=comp.arch#21807

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Mon, 01 Nov 2021 13:29:22 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 38
Message-ID: <2021Nov1.142922@mips.complang.tuwien.ac.at>
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk> <2021Oct31.230648@mips.complang.tuwien.ac.at> <zlLfJ.4920$cW6.3012@fx08.iad>
Injection-Info: reader02.eternal-september.org; posting-host="0d65796f621b5a49322afdb7c8e7fd32";
logging-data="17648"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/bPZNEHCwAh5sBVawmtzq1"
Cancel-Lock: sha1:RLtI2qJ6Lb217Q5ataxTpMAP9Hg=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Mon, 1 Nov 2021 13:29 UTC

EricP <ThatWouldBeTelling@thevillage.com> writes:
>An OS exception handler could easily diagnose an AVX-512
>instruction and re-route the thread to the big core.

Yes. This is an interesting difference between the Intel way of
adding instruction set extensions for the additional width and
capability, and ARM's SVE approach (as far as I understand it). While
SVE allows to run the program on a CPU with arbitrary width (and make
use of the width without recompilation), it does not allow to migrate
a running thread to a core with different width, because the thread
can have the SVE width in many places in its state; foremost the
vector registers. This will be interesting when people want to
migrate virtual machines between ARM CPUs with different SVE widths.

By contrast, you can at least migrate an AVX program from a core that
has AVX to one that has AVX and AVX-512.

So, much as SVE is an advance in width independence, there is still
something missing. Maybe a special-purpose register that indicates
the currently-used width and lets wider cores work as if they were
narrower. This would allow at least migration to wider cores.

Making use of the full width soon after could be facilitated by
letting the software read the real width at points with empty vector
state, and propagating the possibly new width to all the places where
it matters (including the "currently-used width" register). However,
given that migration to wider cores is a rare event, such an approach
might lead to bugs popping up in the field, and I am skeptical that
the additional utilization is worth the risk.

I am sure that Mitch Alsup will point out that his VVM does not have
these migration problems, because the architectural state does not
include vectors or some kind of width.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<fe487697-83a3-4085-8dac-74b39fa587bdn@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21809&group=comp.arch#21809

  copy link   Newsgroups: comp.arch
X-Received: by 2002:a05:622a:134e:: with SMTP id w14mr11047705qtk.33.1635782371735;
Mon, 01 Nov 2021 08:59:31 -0700 (PDT)
X-Received: by 2002:a9d:6e8:: with SMTP id 95mr15860953otx.112.1635782371433;
Mon, 01 Nov 2021 08:59:31 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Mon, 1 Nov 2021 08:59:31 -0700 (PDT)
In-Reply-To: <2021Nov1.142922@mips.complang.tuwien.ac.at>
Injection-Info: google-groups.googlegroups.com; posting-host=104.59.204.55; posting-account=H_G_JQkAAADS6onOMb-dqvUozKse7mcM
NNTP-Posting-Host: 104.59.204.55
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk>
<2021Oct31.230648@mips.complang.tuwien.ac.at> <zlLfJ.4920$cW6.3012@fx08.iad> <2021Nov1.142922@mips.complang.tuwien.ac.at>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <fe487697-83a3-4085-8dac-74b39fa587bdn@googlegroups.com>
Subject: Re: Architectural extensions
From: MitchAl...@aol.com (MitchAlsup)
Injection-Date: Mon, 01 Nov 2021 15:59:31 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 39
 by: MitchAlsup - Mon, 1 Nov 2021 15:59 UTC

On Monday, November 1, 2021 at 9:01:52 AM UTC-5, Anton Ertl wrote:
> EricP <ThatWould...@thevillage.com> writes:
> >An OS exception handler could easily diagnose an AVX-512
> >instruction and re-route the thread to the big core.
> Yes. This is an interesting difference between the Intel way of
> adding instruction set extensions for the additional width and
> capability, and ARM's SVE approach (as far as I understand it). While
> SVE allows to run the program on a CPU with arbitrary width (and make
> use of the width without recompilation), it does not allow to migrate
> a running thread to a core with different width, because the thread
> can have the SVE width in many places in its state; foremost the
> vector registers. This will be interesting when people want to
> migrate virtual machines between ARM CPUs with different SVE widths.
>
> By contrast, you can at least migrate an AVX program from a core that
> has AVX to one that has AVX and AVX-512.
>
> So, much as SVE is an advance in width independence, there is still
> something missing. Maybe a special-purpose register that indicates
> the currently-used width and lets wider cores work as if they were
> narrower. This would allow at least migration to wider cores.
>
> Making use of the full width soon after could be facilitated by
> letting the software read the real width at points with empty vector
> state, and propagating the possibly new width to all the places where
> it matters (including the "currently-used width" register). However,
> given that migration to wider cores is a rare event, such an approach
> might lead to bugs popping up in the field, and I am skeptical that
> the additional utilization is worth the risk.
>
> I am sure that Mitch Alsup will point out that his VVM does not have
> these migration problems, because the architectural state does not
> include vectors or some kind of width.
<
Yes, but that would be too obvious...........
<
> - anton
> --
> 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
> Mitch Alsup, <c17fcd89-f024-40e7...@googlegroups.com>

Re: Architectural extensions

<lwUfJ.93806$mU7.75573@fx46.iad>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21810&group=comp.arch#21810

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!paganini.bofh.team!news.dns-netz.com!news.freedyn.net!newsreader4.netcologne.de!news.netcologne.de!peer03.ams1!peer.ams1.xlned.com!news.xlned.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx46.iad.POSTED!not-for-mail
From: ThatWoul...@thevillage.com (EricP)
User-Agent: Thunderbird 2.0.0.24 (Windows/20100228)
MIME-Version: 1.0
Newsgroups: comp.arch
Subject: Re: Architectural extensions
References: <2021Oct31.184628@mips.complang.tuwien.ac.at> <memo.20211031195903.12464q@jgd.cix.co.uk> <2021Oct31.230648@mips.complang.tuwien.ac.at> <zlLfJ.4920$cW6.3012@fx08.iad> <2021Nov1.142922@mips.complang.tuwien.ac.at>
In-Reply-To: <2021Nov1.142922@mips.complang.tuwien.ac.at>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Lines: 28
Message-ID: <lwUfJ.93806$mU7.75573@fx46.iad>
X-Complaints-To: abuse@UsenetServer.com
NNTP-Posting-Date: Mon, 01 Nov 2021 16:23:45 UTC
Date: Mon, 01 Nov 2021 12:23:34 -0400
X-Received-Bytes: 2303
 by: EricP - Mon, 1 Nov 2021 16:23 UTC

Anton Ertl wrote:
> EricP <ThatWouldBeTelling@thevillage.com> writes:
>> An OS exception handler could easily diagnose an AVX-512
>> instruction and re-route the thread to the big core.
>
> Yes. This is an interesting difference between the Intel way of
> adding instruction set extensions for the additional width and
> capability, and ARM's SVE approach (as far as I understand it). While
> SVE allows to run the program on a CPU with arbitrary width (and make
> use of the width without recompilation), it does not allow to migrate
> a running thread to a core with different width, because the thread
> can have the SVE width in many places in its state; foremost the
> vector registers. This will be interesting when people want to
> migrate virtual machines between ARM CPUs with different SVE widths.

On ARM SVE couldn't the OS thread management routines patch the
thread context? In which case this issue would mostly be one of
documentation of where in the save state SVE width is stored.

Or is there more to it than that? For example, if the thread was
interrupted part way through a vector or multiple vectors then
I can see that switching the width on the fly could be problematic.

Hardware might provide the OS with a sync mechanism which if the
core was executing any vector ops would finish the current vectors
and then trap to the OS, kinda like a single step.

Re: Architectural extensions

<jwv5ytbu7ql.fsf-monnier+comp.arch@gnu.org>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21811&group=comp.arch#21811

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: monn...@iro.umontreal.ca (Stefan Monnier)
Newsgroups: comp.arch
Subject: Re: Architectural extensions
Date: Mon, 01 Nov 2021 13:09:19 -0400
Organization: A noiseless patient Spider
Lines: 17
Message-ID: <jwv5ytbu7ql.fsf-monnier+comp.arch@gnu.org>
References: <2021Oct31.184628@mips.complang.tuwien.ac.at>
<memo.20211031195903.12464q@jgd.cix.co.uk>
<2021Oct31.230648@mips.complang.tuwien.ac.at>
<zlLfJ.4920$cW6.3012@fx08.iad>
<2021Nov1.142922@mips.complang.tuwien.ac.at>
Mime-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Injection-Info: reader02.eternal-september.org; posting-host="221e77a1557d86cf1c7c34bd720428f9";
logging-data="22402"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+iKmhEoUboDkNY8OGpNr/o"
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux)
Cancel-Lock: sha1:wUg4SI3qgZYTA/clXZJ+jaLVMUY=
sha1:nx8hRj+gdOlreRCFIl8+H9eQclQ=
 by: Stefan Monnier - Mon, 1 Nov 2021 17:09 UTC

> By contrast, you can at least migrate an AVX program from a core that
> has AVX to one that has AVX and AVX-512.

IIUC, with SVE it is still safe to migrate from a CPU with vector length
N to a CPU with vector length ≥N (but once that process has run there it's
not safe to move it back).

I'm not sure what happens in SVE if you request a vector length higher
than the CPU supports (which is akin to the case of running AVX-512 code
on a CPU that doesn't support it), but I expect it would signal an
exception, which the OS could then catch to move the process to a CPU
with a longer vector length.

So it doesn't look much worse than the situation with AVX.

Stefan

Re: Paper about ISO C

<2021Nov1.163845@mips.complang.tuwien.ac.at>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21812&group=comp.arch#21812

  copy link   Newsgroups: comp.arch
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ant...@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Paper about ISO C
Date: Mon, 01 Nov 2021 15:38:45 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 51
Message-ID: <2021Nov1.163845@mips.complang.tuwien.ac.at>
References: <87fstdumxd.fsf@hotmail.com> <sku3dr$1hb$2@dont-email.me> <5d25afd4-0e2c-4e98-a457-a04be5ae88dbn@googlegroups.com> <sl3c2g$n4b$1@dont-email.me> <2021Oct25.195829@mips.complang.tuwien.ac.at> <sl8m0n$ipt$1@newsreader4.netcologne.de> <2021Oct26.174949@mips.complang.tuwien.ac.at> <slarar$2qd$1@newsreader4.netcologne.de> <slatma$8au$1@dont-email.me> <2021Oct27.193426@mips.complang.tuwien.ac.at> <sldr8s$kvi$1@dont-email.me> <slh023$nqe$1@dont-email.me>
Injection-Info: reader02.eternal-september.org; posting-host="0d65796f621b5a49322afdb7c8e7fd32";
logging-data="15189"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18UOtot8umURTlHjaJNtl8A"
Cancel-Lock: sha1:2ImG87iosMyRAMun972mF4oCo8w=
X-newsreader: xrn 10.00-beta-3
 by: Anton Ertl - Mon, 1 Nov 2021 15:38 UTC

David Brown <david.brown@hesbynett.no> writes:
>On 28/10/2021 11:41, David Brown wrote:
>> On 27/10/2021 19:34, Anton Ertl wrote:
>>> Read all about it in Section 4.2 of
>>> <https://www.complang.tuwien.ac.at/kps2015/proceedings/KPS_2015_submission_29.pdf>.
....
>How about presenting
>some facts and data?
[Later you write:]
>Eventually, I gave up reading the rest of the paper too.

If you really are interested in facts and data, maybe you should not
have skipped them.

>The paper has references taken out of context or designed to give the
>incorrect impression. For example, on page 4 we have:
>
>"""
>The GCC maintainers subsequently disabled this optimization for the case
>occuring in SPEC, demonstrating that, inofficially, GCC supports Cbench,
>not “C”.
>"""
>
>The associated footnote refers to a gcc bug report which you presumably
>hope readers of your paper will not bother checking. If they do, they
>will see that the reporter of the "bug" thanks the gcc developers for
>helping him find the error in his code, and that report is marked as
>"invalid" - not a bug in the compiler.

If the gcc developers mark a bug report as "invalid", that means that
they don't consider it to be a bug. It does not mean that it is not a
bug.

In the case at hand, the bug report is
<https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66875>. Here gcc finds
out that it can prove that there will be an out-of-bounds access if
execution reaches the loop. What does gcc do? Does it warn about the
out-of-bounds access? No. There would not have been a bug report if
it did. Which reminds me of the first specification for FORTRAN,
which states: "no special provisions havebeen included in the FORTRAN
system for locating errors in formulas".

So what does gcc do instead? It quietly deletes the loop.

I checked with gcc-10.2 -Wall -O3 and it still does not warn about the
out-of-bounds access and it still deletes the loop.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Re: Architectural extensions

<2465598e-7ff5-4a25-97cf-f7619452d9cdn@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=21813&group=comp.arch#21813

  copy link   Newsgroups: comp.arch
X-Received: by 2002:ad4:54ca:: with SMTP id j10mr8446487qvx.2.1635787895105;
Mon, 01 Nov 2021 10:31:35 -0700 (PDT)
X-Received: by 2002:a9d:7482:: with SMTP id t2mr22664081otk.350.1635787894939;
Mon, 01 Nov 2021 10:31:34 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.arch
Date: Mon, 1 Nov 2021 10:31:34 -0700 (PDT)
In-Reply-To: <2021Nov1.141753@mips.complang.tuwien.ac.at>
Injection-Info: google-groups.googlegroups.com; posting-host=199.203.251.52; posting-account=ow8VOgoAAAAfiGNvoH__Y4ADRwQF1hZW
NNTP-Posting-Host: 199.203.251.52
References: <87fstdumxd.fsf@hotmail.com> <sl9c25$jdu$1@dont-email.me>
<69b29aca-621c-4346-8ff6-f7a30f60a59dn@googlegroups.com> <sla5tu$6sh$1@dont-email.me>
<3acef73c-9f89-4c16-bc93-87b37400e56fn@googlegroups.com> <84AeJ.12005$ya3.4258@fx38.iad>
<32eddf0a-2e06-454b-928f-871c8d766b26n@googlegroups.com> <IMEeJ.219$g81.51@fx19.iad>
<MDffJ.4738$6a3.1363@fx41.iad> <2021Oct31.104940@mips.complang.tuwien.ac.at>
<tlzfJ.7101$lz3.1534@fx34.iad> <b8c39eff-e9cd-44ec-aa05-1ff32ead902dn@googlegroups.com>
<2021Nov1.141753@mips.complang.tuwien.ac.at>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <2465598e-7ff5-4a25-97cf-f7619452d9cdn@googlegroups.com>
Subject: Re: Architectural extensions
From: already5...@yahoo.com (Michael S)
Injection-Date: Mon, 01 Nov 2021 17:31:35 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 30
 by: Michael S - Mon, 1 Nov 2021 17:31 UTC

On Monday, November 1, 2021 at 3:29:00 PM UTC+2, Anton Ertl wrote:
> Quadibloc <jsa...@ecn.ab.ca> writes:
> >Since the Atom-based cores in the Xeon Phi were where AVX-512
> >originated, it should have been possible to include AVX-512 support
> >on the E-cores of Aldler Lake to avoid this horrendous compromise.
> The most recent Xeon Phi is Silvermont-based and supports a different
> AVX-512 than Golden Cove. Silvermont is several generations behind
> Gracemont: Silvermont, Goldmont, Goldmont+, Tremont, Gracemont. It's
> probably non-trivial to adapt Xeon Phi AVX-512 to Gracemont and extend
> it to the full extent of present-day AVX-512. This was also in a
> larger process, and was not designed in a way that makes shrinking
> easy. And the main idea of Gracemont is to be small

It would surprise me if Gracemont is smaller than Broadwell, process-adjusted.

> and consume
> little power, and Xeon Phi AVX-512 units were probably designed more
> for performance.

For high issue rate (up to two 512-bit FMA/clock) and low frequency.
I don't know if it could be called "for performance", because in practice high issue rate was unattainable due to other bottlenecks, esp. decode.
IMHO, KNL was stupidly bad design that should have never been allowed to ship.

>
> An alternative would be to implement AVX-512 with narrower units (like
> AMD implemented AVX-256 before Zen 2, including on its small and
> efficient cores). But for some reason Intel did not go there.
> - anton
> --
> 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
> Mitch Alsup, <c17fcd89-f024-40e7...@googlegroups.com>

Pages:123456789101112131415161718192021222324252627282930313233
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor