Factors influencing the performance of models in VEAI.
Gaia uses GPU completely so CPU performance will have very little impact.
Artemis and Theia use both GPU(80-90%) and CPU(10-20%), so the performance here will depend on the CPU a lot more.
All decoding/encoding is done on the CPU, so an older CPU will lead to slower performance unless using older codecs or uncompressed image formats like TIF.
PCIe bandwidth is very important when doing higher resolution upscaling. If using a PCIe 4 compatible GPU performance will vary a lot between an AMD 3rd gen or above processor supporting PCIe 4 vs any AMD/intel processor/motherboard not supporting it.
Always ensure that you have adequate power supply to power both the CPU and GPU at peak loads.
We try to use as many cores on the CPU as possible but most of the code is optimized for 4-8 core processors, more than 8 cores may not increase performance as expected. This will be addressed in the future.
Similarly RAM usage is optimized for 16-32GB and VEAI will not take advantage of more memory than that.
GPU VRAM optimizations are currently set at MAX VRAM of around 12GB, so if you have a card that is greater than 12GB in VRAM, run multiple instances on the same GPU for better throughput.
Disc speed matters a lot too especially when reading/writing large uncompressed image/video files. But in most cases will have minimal impact.
For NVIDIA only:
Starting with version 1.7 VEAI does not use CUDA for processing and uses DirectML. So the actual GPU usage when checking in task manager will show up under Graphics_1.
GTX performance is about 10-20% lower compared to 1.6.1 and earlier version due to a driver/library issue and will be fixed in the near future.
RTX with its FP16 tensor core support has superior performance than CUDA(2-3x) via TensorFlow, while the entire model is able to take advantage of the FP16 processing, There are certain layers in the models which also take advantage of the hardware level tensor core kernels to get unto 20x speed up. In case of VEAI these layers very few and so overall performance improvements will be still in 2-3x range.
RTX 3000 series performance is still being analyzed and optimized, it will take some months before superior performance can be achieved for the higher core 3080 and 3090 GPUs.
If you have further questions post them here.