Grading on a curve? Why AI systems test brilliantly but stumble in real life

Friday, May 15, 2020 - 11:40 in Mathematics & Economics

The headline in early 2018 was a shocker: “Robots are better at reading than humans.” Two artificial intelligence systems, one from Microsoft and the other from Alibaba, had scored slightly higher than humans on Stanford’s widely used test of reading comprehension. The test scores were real, but the conclusion was wrong. As Robin Jia and Percy Liang […]

Read the whole article on Science Blog

More from Science Blog

Latest Science Newsletter

Get the latest and most popular science news articles of the week in your Inbox! It's free!

Check out our next project, Biology.Net