[question] Alt-text service, self hosted

Is there a service that I could plug into my misskey/firefish/whatev that provides an alt-text for an image I’m posting. I don’t post images that often, so the few times, it would be nice to have an automatic OCR or LLM/Stable Diffusion description of a picture attached to it.

Of course, without depending on Google or similars.

Any ideas? cc @selfhost@lemmy.ml @selfhosted@lemmy.world

  • Deebster@programming.dev
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 months ago

    I feel that auto-generated descriptions are going to generally be terrible, even with the new GPT AIs. There’s too much context needed to do a good job to be able to just feed an image into some code and get something useful.

    On the other hand, transcriptions should be able to be done more accurately, particularly with a bit of extra logic to recognise forms like Twitter posts.

    Some database of alt-texts might be possible by scraping for alt-texts and transcriptions from the fediverse, reddit, etc, but a quick search didn’t come up with anything.