Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pzo
7 months ago
|
parent
|
context
|
favorite
| on:
OpenAI charges by the minute, so speed up your aud...
but this is still great trick if you want to reduce latency or inference speed even with local models e.g. in realtime chatbot
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: