InvestorsHub Logo
Followers 17
Posts 1373
Boards Moderated 1
Alias Born 08/25/2022

Re: None

Tuesday, 09/12/2023 11:09:41 PM

Tuesday, September 12, 2023 11:09:41 PM

Post# of 204336
Something to think about:

https://www.edn.com/generative-ai-and-memory-wall-a-wakeup-call-for-ic-industry/

"The impact on Generative AI: Out of control cost

Today, the impact of the memory wall on Generative AI processing is out of control.

In less than one year, GPT, the foundation model powering ChatGPT, evolved from GPT-2 to GPT-3/GPT-3.5 to the current GPT-4. Each generation inflated the model size and the number of parameters (weights, tokens, and states) by an order of magnitude. GPT-3 models incorporated 175 billion parameters. The most recent GPT-4 models pushed the size to 1.7 trillion parameters.

Since these parameters must be stored in memory, the memory size requirement exploded into terabytes territory. To make things worse, all these parameters must be accessed simultaneously at high speed during training/inference, pushing memory bandwidth to hundreds of gigabytes/sec, if not terabytes/sec.

The daunting data transfer bandwidth between memory and processor brings the processor efficiency to its knees. "

L_R
Volume:
Day Range:
Bid:
Ask:
Last Trade Time:
Total Trades:
  • 1D
  • 1M
  • 3M
  • 6M
  • 1Y
  • 5Y
Recent LWLG News