Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
logicchains
11 months ago
|
parent
|
context
|
favorite
| on:
DeepSeek-R1: Incentivizing Reasoning Capability in...
The cost, as expressed in the DeepSeek V3 paper, was expressed in terms of training hours based on the market rate per hour if they'd rented the 2k GPUs they used.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: