Our second pre-release for today is an alpha with support for optimized models running on RTX 5000 series GPUs from NVIDIA.
For this first alpha we’d like to point out that RTX 5000 series performance is significantly improved over the current main channel release, but we are still seeing intermittent performance loss with older GPUs.
We appreciate any testing (RTX 5000 or any other GPU), but just know that performance may be reduced compared to main 6.1
nice, i got decent uplift in Artemis, Iris and Proteus 1X performance from 5090 over 4090. Although I see some 10% run to run differences in 1X results (not really in 2X/4X). Maybe something to improve in the benchmark.
2x and 4x seem to be limited by RAM/CPU speed.
Nyx fast, Chronos/Chronos Fast and Aion also see good uplift.
Most other models seem to be about the same.
Gains seem to be in-line with the general performance difference of 20-30% from 4090 to 5090.
How the heck did you (the team) manage this feat tony?
I see you’re still using tensorrt: 10.8.0.99, but apparently the blackwell perf has improved
Color me impressed.
My only theory is that you’re leveraging the massive bandwidth advantage of the 50 cards better with this release, such as using larger tile sizes or packing more tiles, since the cross-link I/O to the card is a performance killer for “chatty” workloads.
Don’t have a 50 card unfortunately, so can’t do empirical studies…
On my 3080 ti, all models are about the same speed except: Nyx, APfast, and Aion. They are slower by a noticeable amount in the benchmark.
Some models look to be a tiny bit faster, but I have not had a chance to try them out on any real videos yet.
The biggest design change from Ada Lovelace to Blackwell is that int32 precision is now as fast as fp32 precision.
And that the GPU is able to handle all in parallel, with AI cores.
B4 Blackwell int precisions where 50% slower.
That means if int precisions where made to perform nice with pre Backwell gpus era (Turing, Ampere & Lovelace) and you change that now the older gpus will be much slower.
Trying this version for fun to see how much faster it is for my 5090. It’s quite a bit faster at the start for a while and then the drops to go back up at the end.
The VFR bug is still there but seems to pop up when there’s decibels in the framerate, I’ve done several 25 fps → 50 fps conversions without a hitch(constant fps reported). 29.97 conversions still report variable fps.
I have been using it for a while now and it works great on my RTX 5090, thank you! The performance is no longer worse than it used to be on the 4090, much better in fact.
Only just found out about this Alpha version and was disappointed that rendering was slower on the 5090 than the 4090 on my existing TVAI install.
So, after I installed this Alpha, the one test I’ve just tried seems MUCH better
720x576 SD RHEA x2 upscale
Movie runtime: 20 mins, 20 secs
4090 render time (STANDARD TVAI VERSION): 24.3fps
5090 render time (STANDARD TVAI VERSION): 22fps
5090 render time (ALPHA 6.1.1.1 TVAI VERSION): (32fps)
No slowdown during render and maxed out GPU to 100%
I built a new PC using the same 5090 and I’ve gone from a Intel 13900KS (on the previous above tests) to an AMD Ryzen 9 9950X3D now and re-ran the same tests.
The first 720x576 SD RHEA x2 upscale test showed a 2 fps drop to 30fps.
The second 1920x720 HD File - PROTEUS test showed a decent increase to 23fps.