A cartoon of a woman standing next to a coffee machine, holding out a takeaway coffee container. She smiles as she asks “Frankenstein?”
The same woman, now with no visible mouth, in a wider shot, showing two figures raising their hands and looking at each other: a man in a lab coat, glasses, and with grey frizzly hair, and a depiction of “Frankenstein’s monster” as soon in popular culture.
Volunteer singular, maybe. It’s the same person on every post I’ve seen today.
To me it just doesn’t seem to satisfy the purpose of alt text. It reads a lot more like an LLM being asked to visually describe what it sees. It’s too verbose.
Sure, and yes, it’s literally doing what alt text would do, for the same purpose (i.e. describe the image for the visually impaired). The “style” of these that I’ve seen (not just here) is pretty verbose, so I don’t think that’s necessarily an indication of llm use. Obviously I can’t prove it either way, but I’d rather give these the benefit of the doubt, since this is useful work if it helps people follow along.
The visually impaired don’t really get anything from descriptions like “in a wider shot” though, nor is “now with no visible mouth” a relevant detail because the style of the comic does not depict any character with a mouth unless they are speaking. That’s LLM logic.
Yeah, that’s fair. The mouth description probably seals it for me. I think it’d be more useful to describe the overall “nonplussed” expression than the literal description.
Transcription
A cartoon of a woman standing next to a coffee machine, holding out a takeaway coffee container. She smiles as she asks “Frankenstein?”
The same woman, now with no visible mouth, in a wider shot, showing two figures raising their hands and looking at each other: a man in a lab coat, glasses, and with grey frizzly hair, and a depiction of “Frankenstein’s monster” as soon in popular culture.
Is this AI generated?
no? or do you mean the transcript
The transcript, yeah.
Probably not. I think most of these are human volunteers
Volunteer singular, maybe. It’s the same person on every post I’ve seen today.
To me it just doesn’t seem to satisfy the purpose of alt text. It reads a lot more like an LLM being asked to visually describe what it sees. It’s too verbose.
Sure, and yes, it’s literally doing what alt text would do, for the same purpose (i.e. describe the image for the visually impaired). The “style” of these that I’ve seen (not just here) is pretty verbose, so I don’t think that’s necessarily an indication of llm use. Obviously I can’t prove it either way, but I’d rather give these the benefit of the doubt, since this is useful work if it helps people follow along.
The visually impaired don’t really get anything from descriptions like “in a wider shot” though, nor is “now with no visible mouth” a relevant detail because the style of the comic does not depict any character with a mouth unless they are speaking. That’s LLM logic.
Yeah, that’s fair. The mouth description probably seals it for me. I think it’d be more useful to describe the overall “nonplussed” expression than the literal description.