AI researchers tested language models inside a robot vacuum, revealing chaotic behaviour, comedic breakdowns and poor performance…