The Register: Boffins find AI stumbles when quizzed on the tough stuff

The Register: Boffins find AI stumbles when quizzed on the tough stuff . “…to better assess how large language models – which interpret text input – and large multimodal models – which interpret text, images and perhaps other forms of input – actually handle problem solving, a group of ten researchers from the University of California, Los Angeles, the University of Washington, and Microsoft Research have devised a testing benchmark called MathVista that focuses on visually-oriented challenges.”

Leave a Reply

%d