

I haven’t tried in a while, but shortly after gpt4 came out I tried to play chess against it. It just completely changed the board position nearly every move making illegal moves, adding pieces etc. do current models keep track of the board and make legal moves without special prompting to help? Were these assisted by agentic tools handling state?

You’re right about having to check everything written, the problem is that the unpaid intern loses you time while you have to check all the work, and then eventually learns enough and gains trust where you can be more hands-off. The AI doesn’t.
Still potential for time savings in non-critical areas where you can actually afford to be less careful, but that appears to be it right now.