but this is still great trick if you want to reduce latency or inference speed e...

		pzo 7 months ago \| parent \| context \| favorite \| on: OpenAI charges by the minute, so speed up your aud... but this is still great trick if you want to reduce latency or inference speed even with local models e.g. in realtime chatbot