[UPDATE IV - Last Update] Will increased CPU Memory bandwidth make a difference (existing: 4090/13900K/DDR5 6000)?

long story short: unclear outcome and mobo vendor pulled a fast one on me :\

  • benchmark: no material change
  • rendering (quick tests, for my workload): range went from 6.1-9.3 to 6.6-9.X. Some tests show as high as 9.7-10 but only for 2-3 seconds

Next steps: kick off some long jobs and see how they do.

Topaz Video AI  v4.0.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K  47.715 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: -2 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	40.46 fps 	2X: 	18.95 fps 	4X: 	05.22 fps 	
Iris		1X: 	39.38 fps 	2X: 	21.32 fps 	4X: 	05.63 fps 	
Proteus		1X: 	33.47 fps 	2X: 	17.52 fps 	4X: 	05.10 fps 	
Gaia		1X: 	14.11 fps 	2X: 	09.56 fps 	4X: 	05.63 fps 	
Nyx		1X: 	16.80 fps 	
4X Slowmo		Apollo: 	39.47 fps 	APFast: 	85.41 fps 	Chronos: 	31.12 fps 	CHFast: 	34.29 fps 	

Notes:

  1. my mobo vendor (asus pro art) decided to change their QVL in the last few days (SINCE I CREATED THE THREAD…sigh) to remove the 8000mhz ram stick :expressionless:

  • 8000 xmp I / xmp II both cause my mobo to go in safe mode.
  • Given that this is all SK Hynix, I tried 7800 but it was unstable.
  • I then tried the 7600 setting (with the timings on the site, given that mine is probably more than capable of running them) mirroring the 24GB ram chip listing and it has been pretty stable now.
  • Current setting: 7600mhz / 38-48-48-121 / 1.35v

This has thrown off my custom overlock settings so I am just using the ā€œAI overclockingā€ settings for now. I am only seeing about 5-5.6ghz when rendering.

  1. I did update the bios which might have hurt performance. folks in the asus rog forums are complaining of stability and overlocking issues with high-speed memory (different boards but some of it mirrors my experiences)

  2. I had also updated my nvidia drivers which tanked performance. reverted.

  3. throughout my overlocking / stability adventure I was seeing as high as 10fps but alas, my machine was crashing due to all sort of memory errors. Additionally I had VBS / memory isolation etc turned off to use hwinfo, etc. Now that things have stabilized and security is back on I am still getting a stable high of 9.3fps.

  4. At least in some initial testing, the low-end of the fps has definitely improved.

with memory integrity off, high of 9.8fps

Topaz Video AI  v4.0.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K  47.715 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: -2 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	41.03 fps 	2X: 	21.35 fps 	4X: 	05.29 fps 	
Iris		1X: 	41.76 fps 	2X: 	22.61 fps 	4X: 	05.76 fps 	
Proteus		1X: 	34.74 fps 	2X: 	18.29 fps 	4X: 	06.05 fps 	
Gaia		1X: 	14.70 fps 	2X: 	09.79 fps 	4X: 	05.55 fps 	
Nyx		1X: 	17.24 fps 	
4X Slowmo		Apollo: 	41.21 fps 	APFast: 	93.79 fps 	Chronos: 	32.50 fps 	CHFast: 	35.92 fps 	

Since this overall seems within the fluctuatios of the benchmark accuracy (at least on the Mac there are at least such differences from one run to another without changing anything) I guess doing a longer real encode and comparing times of that would be better.

Also, it might be that the higher RAM speed could help with things that are not measured by the benchmark (e.g. encoding process of the resulting video and the like)

And after all, this is what counts: real-life performance and not a Benchmark result.

2 Likes

That’s where I expect to see any real benefits as well.

I have a few test clips that take about 8hrs each - we’ll see how it goes :slight_smile:

I also have a lot of tuning to do due to the potential memory incompatibility so my results are subject to change - I just had to adjust my timings again due to crashes :frowning:

1 Like

What a roller coaster! I’m also thinking of upgrading the RAM in both of my computers… but I’m not wanting to deal with the same things you are!

1 Like

I tested different memory operating modes on my server. Increase of a fraction of a percent.
An increase in gaia of more than 50% can be obtained by adding a second video card.

1 Like

UPDATE II:

Summary
I was finally able to achieve at least basic stability with 7600mhz (XMPII) / custom timing / increased voltages using 8000mhz sticks.

  • [NEW] Cinebench short-run results are stable at 39K even after consecutive runs. 10min tests crash.
  • [NEW] Highest TVAI FPS during a short custom workload test is 9.7fps (vs 9.3fps - correction)
  • [NEW] Lowest TVAI FPS during a short custom workload test is 7.1fps (vs 6.1fps - correction)
  • [NEW] Render jobs seem to be working well. Never goes below 7.0fps.
  • [NEW] TVAI Benchmarks aren’t that much faster but are much MORE CONSISTENT, even after back to back runs.

Drama (in case you’re interested) :face_vomiting:
So after both the QVL fiasco and some research, I purchased some DDR5 7600mhz memory. To my dismay I still had quite a bit of instability. Running Cinebench R23 twice or TVAI once (?!) would force a reboot. I thought the QVL took care of both timings and voltages but it isn’t enough. You need multiple voltage increases to achieve stability.

The following link proved very helpful. Using these settings I was able to get the 7600mhz ram stable. I then figured I’d try it with the 8000mhz sticks. Same result - so I stuck with the 8000mhz sticks for extra potential stability / headroom if that’s how it works. Haven’t tried overclocking the CPU or increasing the GPU overlock to see if it stays stable / makes a difference.
How do I max Hero z790 + 8000mhz Memory + 14900k i… - Republic of Gamers Forum - 965774 (asus.com)

stats when I was momentarily back@ 6000mhz on my 6000mhz sticks

Topaz Video AI  v4.0.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K  63.714 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: -2 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	42.17 fps 	2X: 	20.69 fps 	4X: 	05.71 fps 	
Iris		1X: 	40.33 fps 	2X: 	21.56 fps 	4X: 	06.48 fps 	
Proteus		1X: 	35.83 fps 	2X: 	19.40 fps 	4X: 	05.73 fps 	
Gaia		1X: 	14.62 fps 	2X: 	09.78 fps 	4X: 	05.82 fps 	
Nyx		1X: 	17.37 fps 	
4X Slowmo		Apollo: 	43.96 fps 	APFast: 	98.14 fps 	Chronos: 	32.74 fps 	CHFast: 	  ...    fps 	

stats @ 7600mhz 38-48-48-121 on my 8000mhz sticks (current)

Topaz Video AI  v4.0.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K  47.714 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: -2 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	42.91 fps 	2X: 	20.07 fps 	4X: 	05.42 fps 	
Iris		1X: 	40.84 fps 	2X: 	21.74 fps 	4X: 	06.14 fps 	
Proteus		1X: 	36.74 fps 	2X: 	18.77 fps 	4X: 	05.42 fps 	
Gaia		1X: 	14.59 fps 	2X: 	09.73 fps 	4X: 	05.87 fps 	
Nyx		1X: 	17.10 fps 	
4X Slowmo		Apollo: 	42.62 fps 	APFast: 	95.43 fps 	Chronos: 	32.60 fps 	CHFast: 	36.70 fps 	

Some low level memory benchmarks would be interesting (e.g. AIDA64). If your (sub)-timings go up too much, there might not be a lot of difference between 7600 and 6000 mhz frequency. When overclocking RAM for max performance, timings are often as-important as frequency, which is why it’s sometimes worth running lower frequency but tighter timings.

Very good OC threads can be found on overclock.net:
https://www.overclock.net/threads/z790-hero-gskill-qvl-a-lie-now-i-need-to-manually-oc.1807865/

https://www.overclock.net/threads/asus-maximus-z790-and-intel-i9-13900k-14900k-an-overclocking-and-tuning-guide.1801569/

https://www.overclock.net/threads/official-asus-strix-maximus-z790-owners-thread.1800191/

1 Like

thanks for the links! I’m rendering right now but will try to get some AIDA64 MT stats for you all at one point.

I too was concerned about timings but read a lot about how DDR5 timings were difficult to decipher.
What’s good is that my performance is up (currently running at 9.8fps) but I will definitely review those links - and maybe explore some tighter timings :wink:

so I read these and they were alarmingly informative.

looks like I am either pushing or exceeding the capabilities of my asus z790-based board and that’s that.

I’ll have to focus on lower clocks and faster timings.

At least I got a speed boost out of it. My jobs are finishing at least 30min-1hr faster (oh 8 hrs that’s a LOT!)

As promised…AIDA64 Results. I haven’t had time to fully overclock so this is using Intel XT Speed Optimizer settings. Highest temp the memory reached was 44C :stuck_out_tongue:

It’s been awhile since I’ve looked at memory transfer speeds. Is this anything to write home about?

AIDA64 - Read

AIDA64 - Write

AIDA64 - Copy

AIDA64 - Latency

great to hear! high RAM overclocks also depend on CPU IMC (integrated memory controller) lottery. seems like u got a good one!

here is a link to aida64 results on a z790 hero board with 5500mhz 13900kf CPU and 7800 Mhz @ 34-45-42-36 CR 2 memory:
https://www.igorslab.de/en/the-ddr5-hero-we-need-asus-maximus-z790-hero-review-with-teardown-ram-and-cpu-oc/3/

read: 123GB/s
write: 120GB/s
copy: 118GB/s
latency: 52.4ns

These can be considered ā€˜best case’ results which are not really achievable without a lot of OC experience and select hardware components.

The one area which might be relatively easy for u to improve through timings, is latency. For this, u should look at tRFC and tREFI settings. Although my guess would be that it’s not so important for TVAI. Sequential Read is probably most important and in that area your performance is already within 10% of those ā€˜high-end’ results.
And for long workloads, stability is the most important aspect in the end anyway.

1 Like

Thanks for the info and the recommendation to focus on latency.

One apparent source of instability has been heat. Accordingly to some of the forums linked here, you apparently have to keep ddr5 very cool at high frequencies. Folks have mentioned 44/45C.

I have pretty good case airflow but I have noticed that the memory temps have reached 50c and higher on long benchmarks, leading to errors. I am revisiting how I cool my ram before spending too much more time on it.

This is a silly looking contraption but it might hopefully prevent me from having to run my case fans @100% to cool the ram and might even do a better job

JONSBO NF-1 BLACK Memory Cooling, DDR5 Active Cooling,Compatible with ATX MB,Dual 5020 Fans, Addressable RGB Fan, Black - Newegg.com

I use a case fan on top of my RAM but my mobo sits horizontal, so no fastening necessary.

1 Like

UPDATE III:

I installed this silly thing (haven’t messed with timings / voltages again just yet). I have the little fans running at 100%.

To see if it actually improves cooling, I tried OCT Memory test. It used to take 50 seconds to get to 50C. Now it takes 10 minutes. Also, the ram stays cool during both Cinebench and TVAI.

  • TVAI running @ 9.8fps results in max temps of 41.8C/43C after 3-4hrs
  • I tested a longer 7hr job and the pc rebooted so I’ll have to move onto voltage / timing testing since the cause doesn’t seem to be temperature related anymore.
  • note on the different temps: the cooler isn’t centered due to lack of space so one ram stick gets better cooling right now and the temp readouts are different.

Additionally, these mini fans are in pull mode. I wonder if Push (blowing onto the ram) would make a difference. :thinking:

One final observation is that I have my pc connected through a UPS and the wattage consumption is MUCH more stable now. Right now it’s hovering at 830watts (+/- 10 watts) during my TVAI jobs. It jumps to maybe 750/850 maybe once a minute or two. Before, it was jumping all over the place all the time. Anecdotal, but the only thing I’ve changed is improved the cooling of the RAM.

Thanks for your help and the links everyone! :slight_smile:


Nice setup!
I would try push mode because the volume of air sucked from below in pull mode will probably be lower due to the more restricted inlet on the side of the RAM. And since u run water cooling on the GPU, there is also no worry to blow hot GPU exhaust air onto the RAM.

Regarding power consumption, for TVAI, I run my 4090 @ 2115Mhz @ 880mv undervolted, which uses about 250W max. When maxing it out, it will consume twice as much but performance is only maybe 10% better (but that also depends on how limiting CPU + RAM are, my 7950 + DDR 6200 CL30 RAM is about 10% slower than your setup).

1 Like

thanks for the feedback.

  • I’ll look into flipping the fans (if possible)
  • This is also my gaming pc so I was only pulling back a little on the 4090 overlock when running TVAI because higher core clocks reduced TVAI performance. Unfortunately some of these jobs take 7hrs+ so a small performance bump adds up. I haven’t revisited my overlocks in awhile for TVAI so maybe it’s time to take a look again.

This is my overlock when running TVAI. You gotta love water cooling - even with an AIO. Running the gpu at nearly 3ghz (~up to 3100 when gaming), pulling down 480+ watts, and my hotspot temp is only as high as 72C. :sunglasses:

Once again the radiator exhaust at the top of the case is an absolute furnace though! :sweat_smile:

Update IV - Last update - This will be the last time I tinker with it for a while :wink:

Conclusion
Increased memory bandwidth beyond 6000mhz for this workload helps even out performance for this demanding chronos/gaia workload but doesn’t improve overall speed by much.

You have to have a special motherboard and get lucky on the RAM vendor / binning and CPU IMC lottery to achieve stability in the high 7x000mhz range or beyond.

Was it worth it now that I already had 6000mhz RAM - not really. :confused:
My FOMO is your gain. :sunglasses:

Final Configuration
G.SKILL Trident Z5 RGB Series (Intel XMP 3.0) DDR5 RAM 48GB (2x24GB) 7600MT/s CL38-48-48-121 with max voltage set at about 1.4xv (forgot the exact value) and using 1.41V via hwinfo

Deets

  • I couldn’t get the 8000mhz memory (removed from the QVL) to be stable above 7200mhz and even then it wasn’t perfectly stable after a few hours.
  • The 7600mhz memory, while on the QVL, was not stable using XMP II. I had to update the voltage past 1.4v to achieve stability.
  • The above is contingent on keeping RAM temperature low. I removed the shroud from the NF-1 fan and raised it a little higher from the motherboard. This, paired with running my case fans a tad higher, helped me achieve stable RAM temps of between 40C-41C.
  • My latency is about 72ns at the defaults and I’m fine with that for now
  • It’s been running jobs for about 24hrs and hasn’t blinked. :relieved:



4 Likes

Regarding those XMP 8000 promises:

That guy is a little bit crazy but know his stuff regarding OC.

Topaz Video AI  v3.5.4
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 7950X 16-Core Processor 127.15 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
GPU: AMD Radeon(TM) Graphics  0.47444 GB
Processing Settings
device: 0 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis     1X:     29.90 fps   2X:     12.16 fps   4X:      03.01 fps   
Iris        1X:     38.49 fps   2X:     13.11 fps   4X:      03.67 fps   
Proteus     1X:     22.43 fps   2X:     10.55 fps   4X:      03.01 fps   
Gaia        1X:     15.91 fps   2X:     11.27 fps   4X:      03.28 fps   
Nyx         1X:     18.62 fps   
4X Slowmo   Apollo: 35.43 fps   APFast: 61.60 fps   Chronos: 32.73 fps   CHFast: 27.60 fps   

CPU clock: 4.8 GHz (locked, under-clocked, no turbo boost)

GPU clock: Stock

Memory

  • DIMMs: 4 x Corsair DDR5-6600 (CMK64GX5M2B6600C32)
  • DRAM Freq: 1.8 GHz (infinity fabric & memory controller clocked the same)
  • Clock Ratio: 18 x
  • Timings: CAS=30, RCD=30, RP=30, RAS=58, RC=88, RFC=530

Hardware utilization graph

In my case it seems the DRAM is the bottleneck.

It would be great if someone else who uses DDR5 with two DIMMs at the full rated memory speed could log their DRAM bandwidth utilization stats as well.