Video AI v3.5.X - User Benchmarking Results

There you are:

Topaz Video AI  v3.5.0
System Information
OS: Mac v14
CPU: Apple M2 Ultra  64 GB
GPU: Apple M2 Ultra  48 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 3840x2160
Benchmark Results
Artemis		1X: 	03.50 fps 	2X: 	02.53 fps 	4X: 	00.27 fps 	
Iris		1X: 	02.65 fps 	2X: 	01.34 fps 	4X: 	00.31 fps 	
Proteus		1X: 	03.50 fps 	2X: 	02.36 fps 	4X: 	00.29 fps 	
Gaia		1X: 	01.08 fps 	2X: 	00.77 fps 	4X: 	00.35 fps 	
Nyx		1X: 	00.93 fps 	
4X Slowmo		Apollo: 	03.30 fps 	APFast: 	14.23 fps 	Chronos: 	01.28 fps 	CHFast: 	01.91 fps 	

1 Like
Topaz Video AI v3.5.0
System Information

OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K 
MEM: 31.775 GB DDR5 6000 MHZ CL32-38-38 @ 1.35V XMP 3
GPU: NVIDIA GeForce RTX 4090 22.096 GB

MAX CPU WATT: 188.2
MAX GPU WATT: 568.1
MAX CPU TEMP: 100 Ā°C
MAX GPU TEMP: 072 Ā°C

Processing Settings
device: 0 vram: 1 instances: 0
Input Resolution: 3840x2160

Benchmark Results
Artemis		1X: 	09.70 fps 	2X: 	04.57 fps 	4X: 	ERR 	
Iris		1X: 	08.58 fps 	2X: 	04.30 fps 	4X: 	ERR 	
Proteus		1X: 	08.72 fps 	2X: 	04.09 fps 	4X: 	ERR 	
Gaia		1X: 	03.10 fps 	2X: 	02.29 fps 	4X: 	01.27 fps 	
Nyx		    1X: 	02.92 fps
 	
4X Slowmo		
Apollo: 	17.19 fps 	
APFast: 	27.98 fps 	
Chronos:    07.13 fps 	
CHFast: 	14.12 fps 	

2 Likes

That canā€™t be repeated often enough, especially for upscaling (not so much slomo).

I will post benches later when some processing of 3.5.0 has finished but I did just try using 2 instances of TVAI on a 1080p 2x upscale, Artemis High, 50p video. rtx 4080 with 5800x CPU and DDR4, 32GB.

Result:
one instance, GPU avg 28%, CPU 100%. 5.2 fps.
two instances,GPU avg 28%, CPU 100%, 2.5 to 2.6 fps.

Now THATā€™s a CPU block for ya! So much talk about powerful GPUs but really, for upscaling, people should think more about their CPU and matching memory, even those known to be fast in their day.

This seems to be true mostly for Windows systems.
On the Mac you donā€™t get that high CPU load - which is fine as that way you can fully use the computer for ā€œnormal tasksā€ without noticing that thereā€™s an encode running in the background.

@jo.vo
@Imo

Thank you so much.

Topaz Video AI  v3.5.0
System Information
OS: Windows v11.22
CPU: Intel(R) Core(TM) i9-10900K CPU @ 3.70GHz  31.847 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	31.62 fps 	2X: 	14.79 fps 	4X: 	03.69 fps 	
Iris		1X: 	29.78 fps 	2X: 	13.56 fps 	4X: 	03.80 fps 	
Proteus		1X: 	25.58 fps 	2X: 	12.58 fps 	4X: 	03.60 fps 	
Gaia		1X: 	14.37 fps 	2X: 	10.25 fps 	4X: 	03.58 fps 	
Nyx		1X: 	16.50 fps 	
4X Slowmo		Apollo: 	34.78 fps 	APFast: 	63.38 fps 	Chronos: 	29.91 fps 	CHFast: 	30.20 fps 	

I just discovered this benchmark comparison and donā€™t have the numbers for my 3080 Ti, but I know the wait times have been the same after upgrading to a 4090.

Donā€™t understand what the problem is.
A CPU bottleneck, really? My numbers are slightly worse than a 7950X and only the 13900K looks to be an improvement, based on the limited examples here.

Regardless, it feels like there shouldā€™ve been quite the difference in wait times going from a 3080 Ti to a 4090, not more or less the sameā€¦ which got extended by 1 hour with the April or May update for some reason, by the way.

Itā€™s starting to look like bad optimization for Nvidia cards or at least for the 30 & 40 series, especially since that update. Maybe others can confirm having their wait times increase by 1 hour with Artemis.

That means the 4090 is hardly faster than the 3080ti?

Benchmarks:

Topaz Video AI v3.5.0
System Information
OS: Windows v10.22
CPU: AMD Ryzen 7 5800X 8-Core Processor 31.906 GB
GPU: NVIDIA GeForce RTX 4080 15.688 GB
Processing Settings
device: -2 vram: 0.95 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis 1X: 23.83 fps 2X: 12.31 fps 4X: 02.39 fps
Iris 1X: 31.97 fps 2X: 13.33 fps 4X: 03.35 fps
Proteus 1X: 18.93 fps 2X: 07.88 fps 4X: 02.27 fps
Gaia 1X: 10.88 fps 2X: 07.44 fps 4X: 03.10 fps
Nyx 1X: 12.93 fps
4X Slowmo Apollo: 32.25 fps APFast: 53.98 fps
Chronos: 24.53 fps CHFast: 26.18 fps

Bottlenecking aside, note the fps difference between my real world 2x upscale of Artemis and the benchmarks. 5.2 fps real world, 12.31 fps benchmark.

When I upscale just a few frames however, the ā€œOutputsā€ report 5.2 fps for 10 minutes of 1080p and also 25 frames; 7 fps for 12 frames, 8.6 for 8 frames andā€¦ 15 fps for 5 frames.

Benchmark issue - right there?

If thatā€™s true, it means TVAI isnā€™t using all available Tensor & RT cores and is indeed an optimization problem.
The fact that the CPU is in 100% utilization while GPUs are barely tapped for their potential shouldā€™ve been the 1st sign, but I saw a lot of people say itā€™s normal so didnā€™t question it.

3080 Ti vs 4090
Tensor Cores 320 vs Tensor Cores 512
RT Cores 80 vs RT Cores 128

Above is about relatively small CPUs, while using TVAI or PhotoAI I see spikes up to 20 used cores with my Threadripper.

They certainly limit depending on the resolution of the videos.

Your numbers seem quite low compared to other with latest Gen Intel CPU and the same GPU - so Iā€™d say this is due to CPU limitation.

Hi everyone. I ask for information as a TVAI user on Mac. I would like to switch to Mac Silicon M2. Do you recommend the Macbook Pro M2 Max (12/38 Core) or Mac Studio M2 Ultra (24/76 Core).

Looking at your tests I didnā€™t find much difference in performance

It would be possible to see the processing differences in fps for extreme encoding

from 1080 25p to 8K 50p

A thousand thanks

With very extreme encodings the speed differences do get smaller for all hardware.
When doing 4K=>8k up scaling even a NV 4090 is only somewhat faster than other GPUs as in such scenarios RAM and HD speed as well as CPU do come increasingly more into account.

So youā€™ll benefit from a really fast GPU the most in SD/HD sources and with 2x upscales.
See the benchmarks.

Topaz Video AI  v3.5.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i7-13700K  31.808 GB
GPU: NVIDIA GeForce GTX 1080  7.8838 GB
GPU: Intel(R) UHD Graphics 770  0.125 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	02.73 fps 	2X: 	01.99 fps 	4X: 	00.89 fps 	
Iris		1X: 	03.13 fps 	2X: 	01.84 fps 	4X: 	00.62 fps 	
Proteus		1X: 	02.74 fps 	2X: 	01.98 fps 	4X: 	00.88 fps 	
Gaia		1X: 	01.10 fps 	2X: 	00.16 fps 	4X: 	00.06 fps 	
Nyx		1X: 	00.48 fps 	
4X Slowmo		Apollo: 	03.64 fps 	APFast: 	16.21 fps 	Chronos: 	01.80 fps 	CHFast: 	02.88 fps 	

Topaz Video AI v3.5.1.0.b
System Information

OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i9-13900K 
MEM: 31.775 GB DDR5 6000 MHZ CL32-38-38 @ 1.35V XMP 3
GPU: NVIDIA GeForce RTX 4090 ASUS ROG STRIX OC 22.096 GB

MAX CPU WATT: 170.4
MAX GPU WATT: 462.6
MAX CPU TEMP: 95 Ā°C
MAX GPU TEMP: 64 Ā°C

Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080

Benchmark Results
Artemis		1X: 	44.81 fps 	2X: 	19.09 fps 	4X: 	05.49 fps 	
Iris		1X: 	42.81 fps 	2X: 	19.82 fps 	4X: 	05.52 fps	
Proteus		1X: 	42.21 fps 	2X: 	17.23 fps 	4X: 	05.38 fps	
Gaia		1X: 	14.54 fps 	2X: 	10.82 fps 	4X: 	05.27 fps 	
Nyx	    	1X: 	17.70 fps
 	
4X Slowmo		
Apollo: 	43.74 fps 	
APFast: 	84.01 fps 	
Chronos: 	33.47 fps 	
CHFast: 	36.38 fps 	

Topaz Video AI  v3.5.0
System Information
OS: Windows v10.21
CPU: AMD Ryzen 9 7950X 16-Core Processor              127.74 GB
GPU: NVIDIA GeForce RTX 4090  23.59 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	39.59 fps 	2X: 	14.28 fps 	4X: 	03.89 fps 	
Iris		1X: 	39.85 fps 	2X: 	17.76 fps 	4X: 	04.99 fps 	
Proteus		1X: 	31.83 fps 	2X: 	13.31 fps 	4X: 	03.75 fps 	
Gaia		1X: 	16.01 fps 	2X: 	10.99 fps 	4X: 	04.56 fps 	
Nyx		1X: 	17.84 fps 	
4X Slowmo		Apollo: 	45.85 fps 	APFast: 	81.76 fps 	Chronos: 	33.83 fps 	CHFast: 	37.26 fps 	

1 Like
Topaz Video AI  v3.5.0
System Information
OS: Windows v11.22
CPU: 12th Gen Intel(R) Core(TM) i9-12900KF  127.78 GB
GPU: NVIDIA GeForce RTX 4060 Ti  15.745 GB
Processing Settings
device: 0 vram: 0.95 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	14.38 fps 	2X: 	09.20 fps 	4X: 	02.81 fps 	
Iris		1X: 	15.14 fps 	2X: 	07.56 fps 	4X: 	02.16 fps 	
Proteus		1X: 	12.41 fps 	2X: 	08.86 fps 	4X: 	02.98 fps 	
Gaia		1X: 	04.48 fps 	2X: 	03.10 fps 	4X: 	02.06 fps 	
Nyx		1X: 	05.38 fps 	
4X Slowmo		Apollo: 	18.00 fps 	APFast: 	50.83 fps 	Chronos: 	10.55 fps 	CHFast: 	16.22 fps 	

Topaz Video AI  v3.5.0
System Information
OS: Windows v11.22
CPU: 13th Gen Intel(R) Core(TM) i5-13400  31.772 GB
GPU: Intel(R) Arc(TM) A750 Graphics  7.9063 GB
GPU: Intel(R) UHD Graphics 730  0.125 GB
Processing Settings
device: 0 vram: 1 instances: 1
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	10.19 fps 	2X: 	05.62 fps 	4X: 	01.71 fps 	
Iris		1X: 	07.85 fps 	2X: 	04.97 fps 	4X: 	01.63 fps 	
Proteus		1X: 	09.07 fps 	2X: 	04.64 fps 	4X: 	01.51 fps 	
Gaia		1X: 	04.48 fps 	2X: 	03.09 fps 	4X: 	02.24 fps 	
Nyx		1X: 	03.16 fps 	
4X Slowmo		Apollo: 	07.01 fps 	APFast: 	20.85 fps 	Chronos: 	05.77 fps 	CHFast: 	09.50 fps 	

Topaz Video AI  v3.5.1
System Information
OS: Windows v10.22
CPU: AMD Ryzen 9 3900X 12-Core Processor              31.914 GB
GPU: AMD Radeon RX 6800 XT  15.955 GB
Processing Settings
device: -2 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	11.41 fps 	2X: 	07.07 fps 	4X: 	02.15 fps 	
Iris		1X: 	12.87 fps 	2X: 	07.52 fps 	4X: 	02.42 fps 	
Proteus		1X: 	10.88 fps 	2X: 	06.74 fps 	4X: 	02.00 fps 	
Gaia		1X: 	05.89 fps 	2X: 	03.89 fps 	4X: 	01.87 fps 	
Nyx		1X: 	05.19 fps 	
4X Slowmo		Apollo: 	11.98 fps 	APFast: 	38.05 fps 	Chronos: 	08.21 fps 	CHFast: 	12.88 fps 	

Topaz Video AI  v3.5.1
System Information
OS: Windows v11.22
CPU: Intel(R) Xeon(R) CPU E5-2697 v4 @ 2.30GHz  63.909 GB "18 Cores
GPU: NVIDIA GeForce RTX 2080 Ti  10.782 GB
Processing Settings
device: 0 vram: 1 instances: 0
Input Resolution: 1920x1080
Benchmark Results
Artemis		1X: 	ERR fps 	2X: 	10.41 fps 	4X: 	02.28 fps 	
Iris		1X: 	14.56 fps 	2X: 	08.56 fps 	4X: 	02.88 fps 	
Proteus		1X: 	14.65 fps 	2X: 	09.47 fps 	4X: 	02.70 fps 	
Gaia		1X: 	04.81 fps 	2X: 	03.22 fps 	4X: 	02.26 fps 	
Nyx		1X: 	06.07 fps 	
4X Slowmo		Apollo: 	06.42 fps 	APFast: 	40.28 fps 	Chronos: 	10.56 fps 	CHFast: 	13.50 fps