Using AI for image transcripts, yay or nay?

Gonzako@lemmy.world · 9 days ago

Using AI for image transcripts, yay or nay?

x74sys@programming.dev · edit-2 8 days ago

In my opinion, no. It has to be heavily curated. You’re not saving yourself a lot of work if you have to read it word by word (and probably correct stuff) anyway.

I think just one very short sentence describing what’s on there (it doesn’t have to be detailed) is a lot better than whatever an LLM will give you.

Baŝto@discuss.tchncs.de · 16 hours ago

It depends a lot on the image. Multi panel comics have pretty long alt texts and AI can make it faster to reproduce the text in tge image.

x74sys@programming.dev · 7 hours ago

But then you’re primarily extracting text, which you don’t need LLMs for. OCR tools will do the job much cheaper and more effective.

lambisio@feddit.cl · 7 hours ago

and AI can make it faster to reproduce the text in tge image

That was solved decades ago without AI. It’s called OCR.