Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

One person's error is another person's data.


tech / sci.math / More philosophy about AMD Ryzen Threadripper PRO and Nvidia V100 PCIe (Volta) and more..

SubjectAuthor
o More philosophy about AMD Ryzen Threadripper PRO and Nvidia V100 PCIeAmine Moulay Ramdane

1
More philosophy about AMD Ryzen Threadripper PRO and Nvidia V100 PCIe (Volta) and more..

<591cc670-aab5-498a-bf65-f2212d79947cn@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=69647&group=sci.math#69647

  copy link   Newsgroups: sci.math
X-Received: by 2002:ad4:4b25:: with SMTP id s5mr6982798qvw.52.1628192253121;
Thu, 05 Aug 2021 12:37:33 -0700 (PDT)
X-Received: by 2002:a25:804d:: with SMTP id a13mr8278510ybn.257.1628192252910;
Thu, 05 Aug 2021 12:37:32 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.math
Date: Thu, 5 Aug 2021 12:37:32 -0700 (PDT)
Injection-Info: google-groups.googlegroups.com; posting-host=173.178.84.155; posting-account=R-6XjwoAAACnHXTO3L-lyPW6wRsSmYW9
NNTP-Posting-Host: 173.178.84.155
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <591cc670-aab5-498a-bf65-f2212d79947cn@googlegroups.com>
Subject: More philosophy about AMD Ryzen Threadripper PRO and Nvidia V100 PCIe
(Volta) and more..
From: amine...@gmail.com (Amine Moulay Ramdane)
Injection-Date: Thu, 05 Aug 2021 19:37:33 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Amine Moulay Ramdane - Thu, 5 Aug 2021 19:37 UTC

Hello,

More philosophy about AMD Ryzen Threadripper PRO and Nvidia V100 PCIe (Volta) and more..

I am a white arab, and i think i am smart since i have also
invented many scalable algorithms and algorithms..

I think that in the very near future CPUs will be able to replace GPUs
in specialized works of GPUs, since we can approximate it by noticing below that the Next-Gen Sapphire Rapids Xeon with High Bandwidth Memory will be near the memory bandwidth and near the GFLOPS of Nvidia V100 PCIe (Volta) powerful GPU and it will come with a competitive price, read all my thoughts and do your comparison to to notice it.

And I invite you to look at the following spec of AMD Ryzen Threadripper PRO 3975WX 32-Core CPU:

https://www.techpowerup.com/cpu-specs/ryzen-threadripper-pro-3975wx.c2315

And look carefully at the following benchmark:

https://www.xcelerit.com/computing-benchmarks/insights/benchmarks-intel-xeon-scalable-processor-vs-nvidia-v100-gpu/

So as you are noticing that the spec of Nvidia V100 PCIe (Volta) 16 GB is 7,014 GFLOPs (double), and AMD Ryzen Threadripper PRO 3975WX 32-Core CPU is around 6,451.2 GFLOPS, but look carefully at the price of Nvidia V100 PCIe (Volta) 16 GB that is 7124 US dollars:

https://www.amazon.ca/PNY-TCSV100MPCIE-PB-Nvidia-Tesla-v100/dp/B076P84525/ref=pd_di_sccai_3?pd_rd_w=AmXIj&pf_rd_p=e92f388e-b766-4f7f-aac1-ee1d0056e8fb&pf_rd_r=77B7DWXEVBM5VSXT4NZG&pd_rd_r=27e26b6a-c0a6-4558-8a68-97e286ba6213&pd_rd_wg=HxaUi&pd_rd_i=B076P84525&psc=1

And look at the price of AMD Ryzen Threadripper PRO 3975WX 32-Core CPU
that is 2790 US dollars:

https://www.newegg.ca/amd-ryzen-threadripper-pro-3975wx/p/N82E16819113677

So i think that AMD Ryzen Threadripper PRO 3975WX 32-Core CPU is competitive in performance and price for the GFLOPS with Nvidia V100 PCIe (Volta) 16 GB.

I have just read the following interesting article about AVX512

On the dangers of Intel's frequency scaling

https://blog.cloudflare.com/on-the-dangers-of-intels-frequency-scaling/

So as you have just noticed by reading the above article that you have
not to use AVX512, because it heats a lot the CPU cores, so what Intel
is doing is to reduce a lot the speed of the CPU cores, but this is not good for performance. So what i advice is to avoid AVX2 or AVX512
and choose to use AVX that has not this problem.

More of my philosophy about Bus Hierarchies with Distributed Memory...

I have to explain more since i am specialized in parallel programming and concurrency, you have to know that AMD Threadripper Pro and such CPUs also uses Bus Hierarchies of many buses with Distributed Memory with many memory controllers so that to scale, and it uses snooping on each bus of the many buses, so even if it doesn't use the scalable
Directory-Based Cache Coherence, it scales well by good placement that reduces global bus traffic and latency.

More of my philosophy about AMD Threadripper Pro multicores CPU..

You can read more about my education and my way of doing here:

Here is more proof of the fact that i have invented many scalable algorithms and algorithms:

https://groups.google.com/g/comp.programming.threads/c/V9Go8fbF10k

I am specialized in parallel programming and concurrency and think i know more about multicores CPUs, and the AMD Threadripper Pro multicores CPU has many controllers that send on the many buses of NUMA nodes by doing snooping, so it is not Directory-Based Cache Coherence, but it is an efficient way that efficiently reduces snooping since it is parallelized, and here is more of my thoughts about this subject:

More about data centers and about NUMA multicore servers..

I have just written the the following:

--------------------------

I have just written the following:

---
About Snooping vs. Directory-based coherency..

Performance Scalability of a Multi-core Web Server

Read more here:

https://www.cse.wustl.edu/ANCS/slides/Bryan%20Veal%20ANCS%20Presentation.pdf

As you notice above that the Address bus saturation causes
poor scaling! And the Address Bus carries requests and
responses for data, called snoops, and more caches mean more sources
and more destinations for snoops that is causing the poor scaling.

So to solve the problem of poor scalability above, you have to use
Directory-based coherence that is a mechanism to handle Cache coherence problem in Distributed shared memory (DSM) a.k.a. Non-Uniform Memory Access (NUMA).

---

And you have to know that Directory-Based Cache Coherence is scalable.

Read more here about it:

http://www.cs.cmu.edu/afs/cs/academic/class/15418-s19/www/lectures/13_directory.pdf

So you have to choose Directory-Based Cache Coherence that is scalable by using NUMA systems.

-------------------------------

You have to know that data centers are now typically using NUMA multicore servers that provide "scalable" system performance and
"cost-effective" property and that provide Directory-Based Cache Coherence that is scalable.

Read more here:

https://books.google.ca/books?id=3iy6BQAAQBAJ&pg=PA32&lpg=PA32&dq=NUMA+systems+are+cost+effective&source=bl&ots=zXHJZ7oqqW&sig=ACfU3U24MbzxiuPXJB6W6p0JCtkl9hxCHQ&hl=en&sa=X&ved=2ahUKEwjX9o_gubTqAhVBc98KHU0WDJ8Q6AEwCnoECAgQAQ#v=onepage&q=NUMA%20systems%20are%20cost%20effective&f=false

I invite you to read the following interesting article about AMD Threadripper Pro CPU and you will notice that it is a very good CPU
that supports ECC memory and is effectively a faster version of AMD’s EPYC, limited for single CPU workstation use:

AMD Threadripper Pro Review: An Upgrade Over Regular Threadripper?

https://www.anandtech.com/show/16805/amd-threadripper-pro-review-an-upgrade-over-regular-threadripper

And read the following:

AMD's Zen 4 EPYC could get HBM upgrade to fight Intel Sapphire Rapids

Read more here:

https://www.tweaktown.com/news/80637/amds-zen-4-epyc-could-get-hbm-upgrade-to-fight-intel-sapphire-rapids/index.html

And read my following writing about Intel Next-Gen Sapphire Rapids Xeon with High Bandwidth Memory:

Intel to Launch Next-Gen Sapphire Rapids Xeon with High Bandwidth Memory

"Current eight-channel DDR4-3200 memory designs, for example, have a theoretical maximum of 204.8 gigabytes per second, which pales in comparison to GPUs which quote 1000 gigabytes per second or more. GPUs are able to achieve higher bandwidths because they use GDDR.

At the very least, we expect the equivalent of up to 8-Hi stacks of HBM2e, up to 16GB each, with 1-4 stacks onboard leading to 64 GB of HBM. At a theoretical top speed of 460 GB/s per stack, this would mean 1840 GB/s of bandwidth, although we can imagine something more akin to 1 TB/s for yield and power which would still give a sizeable uplift."

Read more here:

https://www.anandtech.com/show/16795/intel-to-launch-next-gen-sapphire-rapids-xeon-with-high-bandwidth-memory

Russia To Build RISC-V Processors for Laptops: 8-core, 2 GHz, 12nm, 2025

Read more here:

https://www.anandtech.com/show/16827/russia-to-build-riscv-processors-for-laptops-8core-2-ghz-12nm-2025

Thank you,
Amine Moulay Ramdane.

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor