Hi!

While I really enjoy seeing many of my fellow man being accommodating to people with disabilities. I find manually transcribing every image I post to be very tiring.

I thought that I could at least use some sort of AI to help with image transcripts, tho, that could probably be better used by the actual person with the disability.

So thats the question, should I skip the transcribing of an image or let an AI do it?

  • x74sys@programming.dev
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    3
    ·
    edit-2
    8 days ago

    In my opinion, no. It has to be heavily curated. You’re not saving yourself a lot of work if you have to read it word by word (and probably correct stuff) anyway.

    I think just one very short sentence describing what’s on there (it doesn’t have to be detailed) is a lot better than whatever an LLM will give you.

    • Baŝto@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      1
      ·
      16 hours ago

      It depends a lot on the image. Multi panel comics have pretty long alt texts and AI can make it faster to reproduce the text in tge image.

      • x74sys@programming.dev
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        But then you’re primarily extracting text, which you don’t need LLMs for. OCR tools will do the job much cheaper and more effective.

      • lambisio@feddit.cl
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        and AI can make it faster to reproduce the text in tge image

        That was solved decades ago without AI. It’s called OCR.