Upvote!
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Pro@programming.devM to Technology@programming.devEnglish ·
edit-2
2 days ago

ClockBench: Even the best AI models can't reliably read the clock

clockbench.ai

external-link
message-square
10
fedilink
  • cross-posted to:
  • technology@lemmy.world
10
external-link

ClockBench: Even the best AI models can't reliably read the clock

clockbench.ai

Pro@programming.devM to Technology@programming.devEnglish ·
edit-2
2 days ago
message-square
10
fedilink
  • cross-posted to:
  • technology@lemmy.world
ClockBench AI Benchmark
clockbench.ai
external-link
ClockBench evaluates whether models can read analog clocks - a task that is trivial for humans, but current frontier models struggle with.
  • criss_cross@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    2 days ago

    https://help.openai.com/en/articles/8400551-chatgpt-image-inputs-faq

    • Lembot_0004@discuss.online
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      4
      ·
      2 days ago

      Now read that FAQ. I see just a bunch of limitations descriptions, not a “I can read and correctly understand 100 percent of the images”

      • criss_cross@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        2 days ago

        I think there’s a vast difference between “I say I can take in images as input for prompts with limitations “ and “I’m using the wrong tool for a completely absurd use case” like your microscope analogy implies.

        • Lembot_0004@discuss.online
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          3
          ·
          1 day ago

          LLM is the wrong tool for image analysis, even if the providers say that it is possible. Possibility doesn’t mean effectiveness or even usefulness. Like a microscope and onions.

Technology@programming.dev

Technology@programming.dev

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !Technology@programming.dev

Share interesting Technology news and links.

Rules:

  1. No paywalled sites at all.
  2. News articles has to be recent, not older than 2 weeks (14 days).
  3. No external video links, only native(.mp4,…etc) links under 5 mins.
  4. Post only direct links.

To encourage more original sources and keep this space commercial free as much as I could, the following websites are Blacklisted:

  • Al Jazeera;
  • NBC;
  • CNBC;
  • Substack;
  • Tom’s Hardware;
  • ZDNet;
  • TechSpot;
  • Ars Technica;
  • Vox Media outlets, with exception for Axios;
  • Engadget;
  • TechCrunch;
  • Gizmodo;
  • Futurism;
  • PCWorld;
  • ComputerWorld;
  • Mashable;
  • Hackaday;
  • WCCFTECH;
  • Neowin.

More sites will be added to the blacklist as needed.

Encouraged:

  • Archive links in the body of the post.
  • Linking to the direct source, instead of linking to an article talking about the source.

Misc:

Relevant Communities:

  • Beehaw Technology- Technology Related Discussions.
  • lemmy.zip Technology- Hard Tech news.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 596 users / day
  • 2.04K users / week
  • 4.14K users / month
  • 4.14K users / 6 months
  • 1 local subscriber
  • 567 subscribers
  • 624 Posts
  • 1.38K Comments
  • Modlog
  • mods:
  • Pro@programming.dev
  • BE: 0.19.6
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org