Video AI 5.1.X - User Benchmarking Results

Topaz Video AI  v5.1.4
System Information
OS: Windows v11.23
CPU: 12th Gen Intel(R) Core(TM) i5-12500  63.745 GB
GPU: NVIDIA GeForce RTX 4060  7.7705 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	10.86 fps 	2X: 	07.26 fps 	4X: 	02.30 fps 	
Iris		1X: 	10.71 fps 	2X: 	06.35 fps 	4X: 	01.77 fps 	
Proteus		1X: 	10.25 fps 	2X: 	07.22 fps 	4X: 	02.43 fps 	
Gaia		1X: 	03.41 fps 	2X: 	02.27 fps 	4X: 	01.54 fps 	
Nyx		1X: 	04.12 fps 	2X: 	03.57 fps 	
Nyx Fast		1X: 	07.54 fps 	
4X Slowmo		Apollo: 	15.24 fps 	APFast: 	40.58 fps 	Chronos: 	08.18 fps 	CHFast: 	12.55 fps 	
16X Slowmo		Aion: 	05.72 fps 	

Topaz Video AI  v5.1.4
System Information
OS: Windows v11.23
CPU: Intel(R) Xeon(R) w5-2465X  127.25 GB
GPU: NVIDIA GeForce RTX 4070  11.73 GB
GPU: NVIDIA RTX A4000  15.79 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	18.15 fps 	2X: 	12.64 fps 	4X: 	03.84 fps 	
Iris		1X: 	18.51 fps 	2X: 	11.33 fps 	4X: 	03.25 fps 	
Proteus		1X: 	17.80 fps 	2X: 	12.31 fps 	4X: 	04.23 fps 	
Gaia		1X: 	06.14 fps 	2X: 	04.24 fps 	4X: 	02.88 fps 	
Nyx		1X: 	07.22 fps 	2X: 	06.10 fps 	
Nyx Fast		1X: 	13.47 fps 	
4X Slowmo		Apollo: 	23.52 fps 	APFast: 	51.67 fps 	Chronos: 	13.10 fps 	CHFast: 	19.57 fps 	
16X Slowmo		Aion: 	ERR fps 	

Please open a new thread for the upcoming 5.2.0 version of Video AI. :slight_smile:

The difference when the models have TensorRT in Windows and do not have them in Linux is notable. I hope TensorRT is a priority on the linux development side. It is a necessary component of processing video in a reasonable amount of time.

Topaz Video AI Alpha  v5.0.3.2.a
System Information
OS: Linux v6.9
CPU: AMD Ryzen 9 7950X 16-Core Processor  61.933 GB
GPU: NVIDIA GeForce RTX 4090  23.988 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	54.07 fps 	2X: 	19.55 fps 	4X: 	04.93 fps 	
Iris		1X: 	20.46 fps 	2X: 	06.65 fps 	4X: 	01.64 fps 	
Proteus		1X: 	14.28 fps 	2X: 	06.59 fps 	4X: 	01.42 fps 	
Gaia		1X: 	08.91 fps 	2X: 	05.42 fps 	4X: 	03.41 fps 	
Nyx		1X: 	ERR fps 	2X: 	ERR fps 	
Nyx Fast		1X: 	22.26 fps 	
4X Slowmo		Apollo: 	27.04 fps 	APFast: 	75.49 fps 	Chronos: 	38.13 fps 	CHFast: 	21.29 fps 	
16X Slowmo		Aion: 	48.99 fps 	

Pretty disappointed with the speed on my Mac Studio 2022```
Topaz Video AI v5.1.4
System Information
OS: Mac v14.05
CPU: Apple M1 Max 32 GB
GPU: Apple M1 Max 21.333 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 3840x2160
Benchmark Results
Artemis 1X: 01.69 fps 2X: 01.16 fps 4X: 00.49 fps
Iris 1X: 01.53 fps 2X: 00.81 fps 4X: 00.21 fps
Proteus 1X: 01.46 fps 2X: 01.08 fps 4X: 00.39 fps
Gaia 1X: 00.53 fps 2X: 00.38 fps 4X: 00.29 fps
Nyx 1X: 00.44 fps 2X: 00.43 fps
Nyx Fast 1X: 01.10 fps
4X Slowmo Apollo: 01.87 fps APFast: 07.54 fps Chronos: 00.65 fps CHFast: 00.96 fps
16X Slowmo Aion: 02.45 fps

Topaz Video AI  v5.1.4
System Information
OS: Windows v10.22
CPU: AMD Ryzen Threadripper 3970X 32-Core Processor   127.87 GB
GPU: NVIDIA GeForce RTX 3090  23.756 GB
Processing Settings
device: 0 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	25.83 fps 	2X: 	13.56 fps 	4X: 	03.81 fps 	
Iris		1X: 	23.24 fps 	2X: 	14.33 fps 	4X: 	03.92 fps 	
Proteus		1X: 	24.91 fps 	2X: 	15.08 fps 	4X: 	04.05 fps 	
Gaia		1X: 	08.67 fps 	2X: 	05.86 fps 	4X: 	03.35 fps 	
Nyx		1X: 	10.46 fps 	2X: 	08.65 fps 	
Nyx Fast		1X: 	20.31 fps 	
4X Slowmo		Apollo: 	33.31 fps 	APFast: 	69.17 fps 	Chronos: 	19.35 fps 	CHFast: 	27.34 fps 	
16X Slowmo		Aion: 	28.55 fps 	

So slow on the Mac mini :frowning:

Topaz Video AI  v5.1.2
System Information
OS: Mac v14.05
CPU: Apple M2 Pro  16 GB
GPU: Apple M2 Pro  10.667 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	07.96 fps 	2X: 	04.88 fps 	4X: 	01.53 fps 	
Iris		1X: 	10.20 fps 	2X: 	02.97 fps 	4X: 	00.89 fps 	
Proteus		1X: 	07.46 fps 	2X: 	05.35 fps 	4X: 	01.35 fps 	
Gaia		1X: 	02.51 fps 	2X: 	01.80 fps 	4X: 	01.32 fps 	
Nyx		1X: 	02.23 fps 	2X: 	02.26 fps 	
Nyx Fast		1X: 	05.90 fps 	
4X Slowmo		Apollo: 	08.50 fps 	APFast: 	33.20 fps 	Chronos: 	02.42 fps 	CHFast: 	03.94 fps 	
16X Slowmo		Aion: 	06.79 fps 	

Try the 1080 benchmark?

Topaz Video AI  v5.2.0
System Information
OS: Windows v11.23
CPU: 12th Gen Intel(R) Core(TM) i5-12500  63.745 GB
GPU: NVIDIA GeForce RTX 4060  7.7705 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	10.58 fps 	2X: 	07.36 fps 	4X: 	02.36 fps 	
Iris		1X: 	10.69 fps 	2X: 	06.36 fps 	4X: 	01.78 fps 	
Proteus		1X: 	10.41 fps 	2X: 	07.50 fps 	4X: 	02.42 fps 	
Gaia		1X: 	03.38 fps 	2X: 	02.28 fps 	4X: 	01.55 fps 	
Nyx		1X: 	04.13 fps 	2X: 	03.55 fps 	
Nyx Fast		1X: 	07.55 fps 	
Rhea		4X: 	01.36 fps 	
4X Slowmo		Apollo: 	15.44 fps 	APFast: 	39.98 fps 	Chronos: 	08.21 fps 	CHFast: 	12.57 fps 	
16X Slowmo		Aion: 	06.04 fps 	

Before this thread is going to be closed and latest release mGPU, I noticed finally these thigbs :

  • The integrated benchmark does not help us a lot … Results can vary up to 30% from a bench to the other. The more you add GPU, the more it varies. → Unreliable
  • The benchamrk is too short, GPUs don’t even ramp up. In real situation we’ve got half of these results because of the need of thermal managment.
  • RTX 3070 PCIe bandwidth used is 1.35 to 1.8 GBps with an average of 1.45 GBps. It fits in PCIe Gen3/4 x4 (edited)

Benchmarks curves, similar, but results aren’t.

Average taken from 4 best runs among dozens, light o/c on VRAM (+250)

Topaz Video AI  v5.1.3
System Information
OS: Windows v11.23
CPU: AMD Ryzen 9 7950X 16-Core Processor              47.712 GB
GPU: NVIDIA GeForce RTX 3070  7.8301 GB
GPU: NVIDIA GeForce RTX 3070  7.8301 GB
GPU: NVIDIA GeForce RTX 3070  7.8301 GB
Processing Settings
device: 3 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	29.39 fps 	2X: 	16.02 fps 	4X: 	04.53 fps 	
Iris		1X: 	36.40 fps 	2X: 	18.02 fps 	4X: 	05.70 fps 	
Proteus		1X: 	28.82 fps 	2X: 	20.00 fps 	4X: 	05.98 fps 	
Gaia		1X: 	13.74 fps 	2X: 	09.27 fps 	4X: 	05.49 fps 	
Nyx		1X: 	10.55 fps 	2X: 	13.24 fps 	
Nyx Fast		1X: 	14.98 fps 	
4X Slowmo		Apollo: 	25.19 fps 	APFast: 	58.26 fps 	Chronos: 	25.35 fps 	CHFast: 	20.52 fps 	
16X Slowmo		Aion: 	30.07 fps 	

What I’ve learnt

  • For unknown reason, as soon as you are 2+ GPUs, the first one, never load 100%
  • For unknown reason, it scales well only with to 2 GPUs
  • For unknowm reason, it starts to deliver erratic results 3+ GPUs, some models raise up when other fall down …

I have benched and tortured all parameters, CPU, DDR5, PCIe lanes, cooling aos … this lead me almost to the same results than my 2990WX workstation, so …

image

This sounds to be a VERY particular case. Going to reproduce it and verify if true. I have finished a 5 day 8K master with tons of noise.

Will go for a mixed 3000 and 4000 series as a last try, even if it is not recommended.

NOTE : ASUS Prime X670-P is a pure sh**, tons of useless parameters to tweak CPU and VDDR, but MOSFET are crap. Not even more than one line to tweak PCIe lanes … unbelievable.
(edited : X670/E are all very poor supported with PCIe splitting, only slot 1 is tweakable whatever the brand :frowning: I have to turn M.2 slots sot PCIe4 x4 slots, again new bucks flamed …)

Looks like you are CPU limited the more processings run simultanously on your GPU. Even on my 13900K I see a noticeably loss in processing speed when the Intel Turbo Boost period ends.

Mhh interesting, seeing the core he’s not doing much … but I am going to push it.

To me there’s a bandwiwdth limitationt somewhere but hard to find … nothing I sensored was at max level.

Typical GPU/CPU load with Proteus

Raphael RPL-B2
Ampere GA104-300-A1

I added graphs with some informations about PCIe load for those like who want to split them.

Capture d'écran 2024-08-20 064135

I would like to see some AMD 9xxx CPU benchmarks. Their gaming performance improvement is unimpressive, but the productivity benchmarks have looked promising. I want to know if TVAI is one of the pograms that see a meaningful improvement with the ne chips.

I would have bought one, if they didn’t randomly decide they needed to implement core parking on the two options I might buy.

Going to it as soon my waterchiller is ready, have to design AM5 support
That 7950X allready generate a lot of heat if pushed over 180W

X670E ROG Crosshair has arrived with 6000 MTs 30-36-36-36
Will swapp 7950X to 9950X without any touch, to see CPU gain

EDIT : Is there anybody with acurate plans of an AM5 ? screw spacements aso

I know it’s NOT 5.1.x, but topic is closed and wanted to give feedback of scaling. Waiting for Kyle’s reply and PCIE splitters.
Everything is o/ced, not finely tuned, depending on outide temp :confused:
Top of 3 runs, still crap with x2 or x4, need CPU clocks to feed …

Topaz Video AI  v5.0.4
System Information
OS: Windows v11.23
CPU: AMD Ryzen 9 7950X 16-Core Processor              47.712 GB
GPU: NVIDIA GeForce RTX 4090  23.576 GB
GPU: NVIDIA GeForce RTX 3070  7.8301 GB
GPU: NVIDIA GeForce RTX 3070  7.8301 GB
Processing Settings
device: 3 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	43.91 fps 	2X: 	16.77 fps 	4X: 	04.53 fps 	
Iris		1X: 	53.12 fps 	2X: 	19.91 fps 	4X: 	05.43 fps 	
Proteus		1X: 	53.64 fps 	2X: 	20.22 fps 	4X: 	05.76 fps 	
Gaia		1X: 	28.82 fps 	2X: 	16.43 fps 	4X: 	05.74 fps 	
Nyx		1X: 	11.10 fps 	2X: 	16.59 fps 	
Nyx Fast		1X: 	44.33 fps 	
4X Slowmo		Apollo: 	29.39 fps 	APFast: 	73.99 fps 	Chronos: 	30.03 fps 	CHFast: 	32.86 fps 	
16X Slowmo		Aion: 	33.08 fps