Project Rebound

FOLLOW US:

Subscribe to our daily newsletter

Your subscription could not be saved. Please try again.

Your subscription has been successful.

By providing an email address. I agree to the Terms of Use and acknowledge that I have read the Privacy Policy.

AI models struggle to identify nonsense, says study

Agence France-Presse / 01:42 AM September 15, 2023

German Minister of Economics and Climate Protection Robert Habeck (L) stands in front of a dancing robot at the Robotics Innovation Center of the German Research Center for Artificial Intelligence (DFKI) in Bremen, Germany, on September 14, 2023. German Minister of Economics and Climate Protection Robert Habeck visits the Robotics Innovation Center of the German Research Center for Artificial Intelligence and the Fraunhofer Institute for Manufacturing Technology and Advanced Materials (IFAM). (Photo by FOCKE STRANGMANN / AFP)

German Minister of Economics and Climate Protection Robert Habeck (L) stands in front of a dancing robot at the Robotics Innovation Center of the German Research Center for Artificial Intelligence (DFKI) in Bremen, Germany, on September 14, 2023. German Minister of Economics and Climate Protection Robert Habeck visits the Robotics Innovation Center of the German Research Center for Artificial Intelligence and the Fraunhofer Institute for Manufacturing Technology and Advanced Materials (IFAM). ( AFP)

PARIS, France – The AI models that power chatbots and other applications still have difficulty distinguishing between nonsense and natural language, according to a study released on Thursday.

The researchers at Columbia University in the United States said their work revealed the limitations of current AI models and suggested it was too early to let them loose in legal or medical settings.

Article continues after this advertisement

They put nine AI models through their paces, firing hundreds of pairs of sentences at them and asking which were likely to be heard in everyday speech.

They asked 100 people to make the same judgement on pairs of sentences like: “A buyer can own a genuine product also / One versed in circumference of highschool I rambled.”

The research, published in the Nature Machine Intelligence journal, then weighed the AI answers against the human answers and found dramatic differences.

Article continues after this advertisement

Sophisticated models like GPT-2, an earlier version of the model that powers viral chatbot ChatGPT, generally matched the human answers.

Article continues after this advertisement

Other simpler models did less well.

Article continues after this advertisement

But the researchers highlighted that all the models made mistakes.

“Every model exhibited blind spots, labelling some sentences as meaningful that human participants thought were gibberish,” said psychology professor Christopher Baldassano, an author of the report.

Article continues after this advertisement

“That should give us pause about the extent to which we want AI systems making important decisions, at least for now.”

Tal Golan, another of the paper’s authors, told AFP that the models were “an exciting technology that can complement human productivity dramatically”.

However, he argued that “letting these models replace human decision-making in domains such as law, medicine, or student evaluation may be premature”.

Among the pitfalls, he said, was the possibility that people might intentionally exploit the blind spots to manipulate the models.

Your subscription could not be saved. Please try again.

Your subscription has been successful.

Subscribe to our daily newsletter

By providing an email address. I agree to the Terms of Use and acknowledge that I have read the Privacy Policy.

AI models burst into public consciousness with the release of ChatGPT last year, which has since been credited with passing various exams and has been touted as a possible aide to doctors, lawyers and other professionals.

gsg

TOPICS: AI, Artificial Intelligence, technology

READ NEXT

Infinix ZERO 30 5G, the ultimate vlogging phone is coming to t...

AI chatbots built software in under 7 minutes for less than $1

EDITORS' PICK

MWC lauds partners, barangay desludging and environmental achievers in ToKasangga 2024

Clemency for Mary Jane Veloso? Marcos says ‘everything is on the table’

10 One Direction hits soaring on Spotify after Liam Payne’s death

Bong Go pushes for Student Loan Moratorium Bill

Cratering peso sinks to record-low 59 to a dollar

Endometriosis linked to slightly higher risk of early death

MOST READ

Comelec to study DQ for Cagayan de Oro bet in registration mess

RESULTS: Gilas Pilipinas vs New Zealand at Fiba Asia Cup Qualifiers

PCG: Chinese cable-laying vessel docks at Subic port to unload cables

Gilas Pilipinas beats New Zealand for first time in Fiba competition

newsinfo

MWC lauds partners, barangay desludging and environmental achievers in ToKasangga 2024

globalnation

Clemency for Mary Jane Veloso? Marcos says ‘everything is on the table’

usa

10 One Direction hits soaring on Spotify after Liam Payne’s death

newsinfo

Bong Go pushes for Student Loan Moratorium Bill

business

Cratering peso sinks to record-low 59 to a dollar

globalnation

Endometriosis linked to slightly higher risk of early death

www

Comelec to study DQ for Cagayan de Oro bet in registration mess

sports

RESULTS: Gilas Pilipinas vs New Zealand at Fiba Asia Cup Qualifiers

globalnation

PCG: Chinese cable-laying vessel docks at Subic port to unload cables

sports

Gilas Pilipinas beats New Zealand for first time in Fiba competition

business

Cratering peso sinks to record-low 59 to a dollar

newsinfo

House probe retraces bulk withdrawals of confidential funds

TAGS: AI, Artificial Intelligence, technology

Your subscription could not be saved. Please try again.

Your subscription has been successful.

Subscribe to our newsletter!

By providing an email address. I agree to the Terms of Use and acknowledge that I have read the Privacy Policy.

Disclaimer: Comments do not represent the views of INQUIRER.net. We reserve the right to exclude comments which are inconsistent with our editorial standards. FULL DISCLAIMER

© Copyright 1997-2024 INQUIRER.net | All Rights Reserved

This is an information message

We use cookies to enhance your experience. By continuing, you agree to our use of cookies. Learn more here.