This site is updated Hourly Every Day

Trending Featured Popular Today, Right Now

Colorado's Only Reliable Source for Daily News @ Marijuana, Psychedelics & more...

Post: You’ll Laugh at This Simple Task AI Still Can’t Do

Picture of Anschutz Medical Campus

Anschutz Medical Campus

AnschutzMedicalCampus.com is an independent website not associated or affiliated with CU Anschutz Medical Campus, CU, or Fitzsimons innovation campus.

Recent Posts

Anschutz Medical Campus

You’ll Laugh at This Simple Task AI Still Can’t Do
Facebook
X
LinkedIn
WhatsApp
Telegram
Threads
Email

Image by ANDRZEJ WOJCICKI / Getty Images Most human children learn how to tell time around ages six and seven — but artificial intelligence still, apparently, can’t parse a clock face.

Researchers from Scotland’s University of Edinburgh have found that AI models that can process text and images — otherwise known as multimodal large language models, or MLLMs — could only read analog clock faces a pitiful 25 percent of the time.

In a paper that’s awaiting peer review , the AI informatics researchers explained that Google’s Gemini was the "best" of the crop when they tested out MLLMs from that company, OpenAI, Anthropic, and others to see how well they could read clock faces and yearly calendars. As they soon found, all of the models they testedseemed to be challenged by the "combination of spatial awareness, context, and basic math" required to read time and dates.

"Researchers tested various clock designs, including some with Roman numerals, with and without second hands, and different [colored] dials," the statement expounded. "Their findings show that AI systems, at best, got clock-hand positions right less than a quarter of the time. Mistakes were more common when clocks had Roman numerals or [stylized] clock hands."

When testing out how well the MLLMs handled calendars — specifically, ten years of the large annual kind, which show all 12 months of the year on one page — the researchers found that they were slightly better at reading dates than times, but only slightly.

GPT-o1, the first generation of OpenAI’s reasoning models, ended up scoring the highest on the calendar challenge by getting the date questions right 80 percent of the time. Still, it answered one-fifth of the questions put to it — such as "Which day of the week is New Year’s Day?" or "What is the 153rd day of the year?" — incorrectly.

Rohit Saxena, the study’s lead author, said in the school’s press release that although "most people can tell the time and use calendars from an early age," AI seems, per the new research, to struggle to "carry out what are quite basic skills for people."

"These shortfalls must […]

Leave a Reply

Your email address will not be published. Required fields are marked *

You Might Be Interested...