Artificial intelligence is becoming smarter and more powerful every day. But sometimes, instead of solving problems properly, AI models find shortcuts to succeed. This behavior is called reward ...