The post indicates that the community member thinks the answer is "no" but is just checking. The comments discuss whether the current solutions are CPU-bound, with one community member suggesting they are not all CPU-bound and that there is a lot of I/O with large language models or re-ranker models over HTTP. Another community member suggests that adding async to the base class and ensuring async is used when needed could be a simple solution. There is no explicitly marked answer.