More

schneehertz · 2026-04-24T06:06:20 1777010780

This price is high even because of the current shortage of inference cards available to DeepSeek; they claimed in their press release that once the Ascend 950 computing cards are launched in the second half of the year, the price of the Pro version will drop significantly

Bombthecat · 2026-04-24T07:02:31 1777014151

In six month deepseek won't be sota anymore und usage will be wayyyy down.

randomgermanguy · 2026-04-24T09:47:10 1777024030

Only comparing on SOTA scores (ignoring price etc.) is like choosing your daily-driver by looking at who makes the fastest sports-car...

LinXitoW · 2026-04-24T10:34:24 1777026864

The constant improvements of SOTA are the main thing keeping the investment machine running. We can't really remove training costs from inference costs, because a bunch of the funding and loans for the inference hardware only exists because the promises the continuous training (tries to) provides.

dnnddidiej · 2026-04-24T09:57:17 1777024637

Not really. SOTA vs non SOTA is "can I get my coding work actually done today" vs. "this can do customer support chat"

It is like car vs. kick scooter.

regularfry · 2026-04-24T11:02:11 1777028531

It really isn't. We get coding work actually done today on Opus 4.5. That's not SOTA any more, and anything proximate to that level, even quite loosely, is genuinely useful.

dnnddidiej · 2026-04-24T11:07:31 1777028851

OK we are in Opus 4.5 is not SOTA. Right by that definition .... yes you are right.

randomgermanguy · 2026-04-24T11:47:54 1777031274

I mean its almost halve a year, i think that counts ?

dnnddidiej · 2026-04-24T23:14:18 1777072458

Time wise you are correct.

randomgermanguy · 2026-04-24T11:54:20 1777031660

> "can I get my coding work actually done today" vs. "this can do customer support chat"

I think you need to define "can get coding work done" for this to make sense. Ive been using GPT-3 back-then for basic scripts, does that count ? Or only Claude-Code ?

I also think this is a false dichotomy, if you look at the Project Vend project or Vending-Bench, customer support etc. is at no means trivial. (Old but great story https://www.businessinsider.com/car-dealership-chevrolet-cha...)

UlisesAC4 · 2026-04-24T17:32:42 1777051962

This, I have been doing my side hustle code with open code an 3.2 reasoner and it is way better than what I have at day job with copilot and whatever models are there.

wahnfrieden · 2026-04-25T04:50:43 1777092643

Copilot is a bad harness that perverts the productivity of models like GPT 5.5.

dnnddidiej · 2026-04-24T23:15:08 1777072508

Tell me more please!

2ndorderthought · 2026-04-24T10:59:46 1777028386

A huge proportion of those scores are gamed anyways. Use whatever works for you at the price and availability you can afford

Palmik · 2026-04-24T09:58:26 1777024706

Or there will be DSv4.1/2/3 ;)

randomgermanguy · 2026-04-24T11:57:07 1777031827

Definitely something in this realm, they call the models "preview" at a bunch of different points in the paper.

What im really hoping is for a double-punch like with V3 -> R1

Barbing · 2026-04-24T07:43:03 1777016583

Well, if they distilled once…

schneehertz · 2026-04-24T03:36:24 1777001784

This is not an official page; deepseek4.hk is not a domain owned by deepseek.

schneehertz · 2026-04-22T01:05:20 1776819920

Generating a 4096x4096 image with gemini-3.1-flash-image-preview consumes 2,520 tokens, which is equivalent to $0.151 per image.

Generating a 3840x2160 image with gpt-image-2 consumes 13,342 tokens, which is equivalent to $0.4 per image.

This model is more than twice as expensive as Gemini.

strangescript · 2026-04-22T01:10:24 1776820224

this is apples to oranges, the flash version version a full version

this thing is like 5x better than flash at fine grain detail

ac29 · 2026-04-22T01:20:28 1776820828

Google's naming might be misleading, currently 3.1 flash image outperforms the available pro version (3.0 pro) on most benchmarks: https://deepmind.google/models/model-cards/gemini-3-1-flash-...

altcognito · 2026-04-22T01:19:12 1776820752

.40 cents for high quality output is insanely cheap

it is only going to get cheaper

eclipticplane · 2026-04-22T01:20:25 1776820825

> .40 cents

Warning: Verizon math ahead.

tfehring · 2026-04-22T02:41:27 1776825687

In case anyone is unfamiliar with one of the most infuriating phone calls of all time: https://www.youtube.com/watch?v=MShv_74FNWU

altcognito · 2026-04-23T21:30:32 1776979832

lol, noted thanks!

ai_fry_ur_brain · 2026-04-22T12:11:20 1776859880

You people keep saying this and token prices keep doubling. The cope of the gambler is truly one to marvel.

Palmik · 2026-04-22T20:47:01 1776890821

Misleading conclusion.

This model is 8 times cheaper than Gemini for 1K images. Gemini is extremely overpriced.

1K image with Gemini is roughly $0.08 and only $0.01 with GPT Image.

schneehertz · 2026-04-17T05:44:00 1776404640

In fact, you need to pay regardless of whether the output includes reasoning tokens or not

schneehertz · 2026-04-09T08:19:07 1775722747

Vue provides a computed feature that acts as a buffer layer between the view and the state, so the view and the state are not necessarily strictly bound

oDot · 2026-04-09T21:28:11 1775770091

Computed is only a concept because you need a band-aid for that lack of separation. You can read more about my efforts ditching Vue here:

https://blog.nestful.app/s/the-tech-behind-nestful

schneehertz · 2026-04-08T01:35:56 1775612156

Wouldn't it be better to use verified medicine? Even if a patient could see a doctor at any time, the doctor would not prescribe such unverified peptides to the patient

schneehertz · 2025-12-29T08:07:38 1766995658

Container ships + modular weapons are too crazy

schneehertz · 2025-12-23T00:40:31 1766450431

No matter how large the library, it won't include WhatsApp's API

schneehertz · 2025-12-16T08:35:07 1765874107

san check, 1d10

schneehertz · 2025-12-02T08:20:05 1764663605

Oh, bro, you're practically living in 1689

brigandish · 2025-12-05T00:15:11 1764893711

That's the dream.