Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
taskforcegemini
24 days ago
|
parent
|
context
|
favorite
| on:
Are we stuck with the same Desktop UX forever? [vi...
They are using OCR for selecting plain text?
aoeusnth1
23 days ago
|
next
[–]
It's possible to use the Gemini "ask me about this screen" to OCR the selected area of the screenshot. I guess that might be more efficient in some contexts then trying to use the native text select.
eastbound
24 days ago
|
prev
|
next
[–]
On iPhone too, taking a screenshot is the single reliable way to select text.
throwaway894345
23 days ago
|
parent
|
next
[–]
It becomes possible. Getting the handles to move correctly is still often a frustrating experience.
AlienRobot
24 days ago
|
prev
[–]
At least it's not AI... yet.
xnx
24 days ago
|
parent
|
next
[–]
Multi-modal LLMs like Gemini are better than traditional OCR in most ways.
hulitu
23 days ago
|
parent
|
prev
[–]
It is a poor person, sitting in a 3rd world country, thanscribing the text in your clipboard. See Alexa for details. /s
I'm only half joking.
doubled112
23 days ago
|
root
|
parent
[–]
There’s an API (Actually People Implemented) for that.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: