hey all: how do you ensure text fits into an embedding model? you can't know apriori what tokenizer an embedding model uses - or even its input size! Or can you somehow? if I have some arbitrary string 'text' and I need to trim it shorter so it fits into 'embed_model', whats the approach?