ClockBench: Even the best AI models can't reliably read the clock

Pro@programming.dev · edit-2 2 days ago

ClockBench: Even the best AI models can't reliably read the clock

Lembot_0004@discuss.online · 2 days ago

Now read that FAQ. I see just a bunch of limitations descriptions, not a “I can read and correctly understand 100 percent of the images”

criss_cross@lemmy.world · 1 day ago

I think there’s a vast difference between “I say I can take in images as input for prompts with limitations “ and “I’m using the wrong tool for a completely absurd use case” like your microscope analogy implies.

Lembot_0004@discuss.online · 1 day ago

LLM is the wrong tool for image analysis, even if the providers say that it is possible. Possibility doesn’t mean effectiveness or even usefulness. Like a microscope and onions.

ClockBench: Even the best AI models can't reliably read the clock

ClockBench: Even the best AI models can't reliably read the clock

ClockBench AI Benchmark