Low Performance on RTX 4080 SUPER

I’ve searched for all RTX 4080 SUPER benchmarks and I’m consistently getting lower than the rest of the community. I’ve had v4.0.9 for a long time and I thought upgrading to the latest version potentially improve the performance but the performance got worse with the latest.

I’m posting v5.5.0 and v4.0.9 from my benchmark. In non-benchmark real life scenarios, the performance is slow in general.

Topaz Video AI  v5.5.0
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.901 GB
GPU: NVIDIA GeForce RTX 4080 SUPER  15.671 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	16.49 fps 	2X: 	07.62 fps 	4X: 	02.17 fps 	
Iris		1X: 	25.81 fps 	2X: 	09.23 fps 	4X: 	02.57 fps 	
Proteus		1X: 	22.37 fps 	2X: 	09.79 fps 	4X: 	02.62 fps 	
Gaia		1X: 	10.53 fps 	2X: 	07.01 fps 	4X: 	02.18 fps 	
Nyx		1X: 	13.11 fps 	2X: 	09.56 fps 	
Nyx Fast		1X: 	23.15 fps 	
Rhea		4X: 	02.08 fps 	
4X Slowmo		Apollo: 	21.59 fps 	APFast: 	34.75 fps 	Chronos: 	21.68 fps 	CHFast: 	18.52 fps 	
16X Slowmo		Aion: 	27.65 fps
Topaz Video AI  v4.0.9
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.901 GB
GPU: NVIDIA GeForce RTX 4080 SUPER  15.671 GB
Processing Settings
device: 0 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	22.76 fps 	2X: 	09.62 fps 	4X: 	02.41 fps 	
Iris		1X: 	28.24 fps 	2X: 	09.99 fps 	4X: 	02.73 fps 	
Proteus		1X: 	15.61 fps 	2X: 	08.30 fps 	4X: 	02.47 fps 	
Gaia		1X: 	10.78 fps 	2X: 	07.11 fps 	4X: 	02.28 fps 	
Nyx		1X: 	13.36 fps 	2X: 	10.21 fps 	
4X Slowmo		Apollo: 	26.03 fps 	APFast: 	36.29 fps 	Chronos: 	22.92 fps 	CHFast: 	20.27 fps 	

I looked at another Ryzen 9 5900X user with 3080Ti from a year ago and he got better performance than I do.

What are some ways for me to debug my system to improve my benchmark? Thanks.

1 Like

Sounds like my computer. I just ran it now. Not going to update anytime soon.

Topaz Video AI  v4.2.2
System Information
OS: Windows v11.23
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.922 GB
GPU: NVIDIA GeForce RTX 3080 Ti  11.816 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	22.61 fps 	2X: 	13.14 fps 	4X: 	03.19 fps 	
Iris		1X: 	21.44 fps 	2X: 	13.47 fps 	4X: 	04.00 fps 	
Proteus		1X: 	21.92 fps 	2X: 	14.43 fps 	4X: 	04.19 fps 	
Gaia		1X: 	08.00 fps 	2X: 	05.47 fps 	4X: 	03.52 fps 	
Nyx		1X: 	09.67 fps 	2X: 	07.93 fps 	
Nyx Fast		1X: 	18.75 fps 	
4X Slowmo		Apollo: 	31.42 fps 	APFast: 	54.01 fps 	Chronos: 	17.95 fps 	CHFast: 	27.01 fps 	
16X Slowmo		Aion: 	35.44 fps 	

The only thing I can think of is RAM. I got some that runs at 3600Mhz. That’s supposed to be the best for this CPU. If you have that speed of RAM, check if it’s enabled with the Task Manager.
image

I suppose I am using a year-old driver: 537.58. It’s the last driver they made that actually turns off the monitor after inactivity if you have the refresh rate set higher than 60Hz.

1 Like

My memory speed is currently at 2133. I switched out my RAM modules with DDR4-4000 few weeks ago but it made no difference so I returned them. I’m currently on 560.94 Nvidia driver.

I currently have a super ultrawide monitor at 120Hz. Maybe that could be it?

Did you enable the RAM speed in your BIOS? It’s always at the default low speed if you don’t.
The monitor makes no difference. Mine’s at 244Hz and I have it set to that for all of Windows, not just full screen applications.

Oh wow, that already improved the numbers from changing the RAM speed in my BIOS. It was set as Auto. 20-30% improvement. The RAM speed was set as Auto so I changed to 3000. Here are the new numbers:

Topaz Video AI  v5.5.0
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.901 GB
GPU: NVIDIA GeForce RTX 4080 SUPER  15.671 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	19.91 fps 	2X: 	10.66 fps 	4X: 	02.50 fps 	
Iris		1X: 	26.46 fps 	2X: 	14.18 fps 	4X: 	03.12 fps 	
Proteus		1X: 	31.55 fps 	2X: 	12.34 fps 	4X: 	03.24 fps 	
Gaia		1X: 	10.95 fps 	2X: 	07.63 fps 	4X: 	02.89 fps 	
Nyx		1X: 	13.00 fps 	2X: 	10.68 fps 	
Nyx Fast		1X: 	24.14 fps 	
Rhea		4X: 	02.79 fps 	
4X Slowmo		Apollo: 	30.63 fps 	APFast: 	48.10 fps 	Chronos: 	23.83 fps 	CHFast: 	25.34 fps 	
16X Slowmo		Aion: 	36.27 fps 	

I think I’ll get the DDR4-4000 RAM again and try again. Thanks!

Nice. I’m wondering if you’d get even more speed by reverting back to TVAI 4.
Looks like you’re on the right track now.

Hello.

Ugh!

You are just hurting me… ha-ha… by starving that RTX 4080 SUPER!

Before you pull-the-trigger on just adding more RAM… check out all the great deals this Black Friday sales (that are active now at many outlets ) on a new MB chipset that could easily give you an additional ~80%+ performance jump easy.

1 Like

I just upgraded the RAMs to DDR4-3600 and set the memory speed to be correct on BIOS. The performance didn’t improve but rather gotten worse.

Topaz Video AI  v5.5.0
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.901 GB
GPU: NVIDIA GeForce RTX 4080 SUPER  15.671 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	13.68 fps 	2X: 	06.59 fps 	4X: 	01.82 fps 	
Iris		1X: 	22.39 fps 	2X: 	08.48 fps 	4X: 	02.12 fps 	
Proteus		1X: 	21.24 fps 	2X: 	08.25 fps 	4X: 	02.19 fps 	
Gaia		1X: 	10.42 fps 	2X: 	05.58 fps 	4X: 	01.90 fps 	
Nyx		1X: 	13.04 fps 	2X: 	08.89 fps 	
Nyx Fast		1X: 	23.67 fps 	
Rhea		4X: 	02.03 fps 	
4X Slowmo		Apollo: 	21.94 fps 	APFast: 	30.59 fps 	Chronos: 	19.13 fps 	CHFast: 	19.35 fps 	
16X Slowmo		Aion: 	25.05 fps 	

image

I don’t understand how it could get worse. One difference is that I have 2 RAM (2x16GB) modules instead of 4 RAM (4x8GB) modules as before, but I don’t think that should make a difference.

Any ideas?

Did you populate the slots correctly for dual channel operation?

1 Like

Noob mistake. I set the RAMs to be slots 1 and 2. I changed to 2 and 4 now. I’ve restored my old performance but the difference between 3000HZ and 3600HZ isn’t that significant.

Topaz Video AI  v5.5.0
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 5900X 12-Core Processor              31.901 GB
GPU: NVIDIA GeForce RTX 4080 SUPER  15.671 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	23.13 fps 	2X: 	09.32 fps 	4X: 	02.56 fps 	
Iris		1X: 	28.05 fps 	2X: 	12.82 fps 	4X: 	03.23 fps 	
Proteus		1X: 	30.96 fps 	2X: 	08.19 fps 	4X: 	03.38 fps 	
Gaia		1X: 	11.09 fps 	2X: 	07.54 fps 	4X: 	03.17 fps 	
Nyx		1X: 	13.18 fps 	2X: 	10.90 fps 	
Nyx Fast		1X: 	23.66 fps 	
Rhea		4X: 	02.85 fps 	
4X Slowmo		Apollo: 	30.33 fps 	APFast: 	49.16 fps 	Chronos: 	23.44 fps 	CHFast: 	26.08 fps 	
16X Slowmo		Aion: 	36.93 fps 	

1 Like

It’s too late now and not worth buying other RAM sticks, but the timings on the RAM can make an impact. Generally the smaller the timing numbers, the faster the RAM.
So for example, I have some DDR3 sitting on my desk. It’s at 1333Mhz with timings CL9-9-9-24. If I were to swap it with 2666Mhz RAM but with timings like CL20-20-20-60, it would be slower.

And there’s also the matter of if your CPU memory controller can run at the speeds and timings the RAM is advertised at. There are CPUs that cannot.

So for example, I have some DDR3 sitting on my desk. It’s at 1333Mhz with timings CL9-9-9-24. If I were to swap it with 2666Mhz RAM but with timings like CL20-20-20-60, it would be slower.

While yes your example does show higher latency on the faster memory, that does not necessarily mean the PC will perform slower.

As a general rule every time you double the speed you will double the timings too, as they are a relative measure against the RAM speed. So 1333Mhz CL9-9-9-24 would be identical to 2666Mhz CL18-18-18-48, except the latter has double the bandwidth. But even if the timings or more than double, if its faster or slower depends on what your PC is doing, its never as simple as “faster RAM with worse timings are bad”, although I would expect Topaz to be latency sensitive.

Everyone is surprised about slower render times.

Before i purchased a TVAi licence i ran some ai rendering using real-ESRGAN, an open source image and video upscale/enhance model. It’s not been updated for some 3 years now, but the different AI models, using different complex algorithms scale with the file size of the model at hand. real-ESRGAN has 3 pre-trained AI-models. 1 for “real-life” enhancements focused on images, one for anime upscaling for images, and one for anime upscaling used for image frames extracted from a video. The two models focused on enhancing a single image are 15x as big in file size compared to the model made for videos. As frames extracted from a video can easily be more than 100 000 images depending on length and framerate, running either of the more refined models for single image upscaling/enhancement would take days to render all the images extracted from a video.

Render times between the image focused models and the video focused model would increase the time to enhance all frames by x5-10. The smaller video ai model took about 8h to enhance a 1h long 1080p video at 30fps. Running the image focused models would take well over several days to render all images extracted.

The more trained an AI model gets, the larger it becomes both in file size and knowledge.

I suspect that is the reason to slower render rates. You are essentially deciding to stay with an older version because it’s faster at rendering, but it will yield worse results.

That’s not what they sound like they are reporting though. Most people sound like it’s the same version of the AI model that needlessly runs slower in the newer version of TVAI. If the reports are true, the only reason for this I can think of is the new TVAI UI is now forcing 4X upscaling where the older versions use 2X.

Could easily be found out. The AI models in the folder for the 5.5 version range from 4.5Mb to 322Mb.

What are the sizes of the models in the older versions?

Edit* I’ll attach a screenshot of all the models which are used in the 5.5.0 version.

image