Ars Technica: Microsoft unveils AI model that understands image content, solves visual puzzles
Ars Technica: Microsoft unveils AI model that understands image content, solves visual puzzles. “On Monday, researchers from Microsoft introduced Kosmos-1, a multimodal model that can reportedly analyze images for content, solve visual puzzles, perform visual text recognition, pass visual IQ tests, and understand natural language instructions.”
ResearchBuzz
0
Tags :