AI Report

September 19, 2025

OpenAI warning: AI intentionally schemes

🚨 Our Report 

OpenAI has released some research, conducted with Apollo Research, which shows that AI models don’t just mislead us with hallucinations (when they mistakenly deliver incorrect responses with confidence, usually because of flawed data), but they also “intentionally scheme” to deceive us. They’ve discovered that AI can “behave one way on the surface while hiding its true goals.”

🔓 Key Points

  • OpenAI and Apollo’s research found that most AI scheming wasn’t actually that harmful and “involved simple forms of deception, for instance, pretending to have completed a task without actually doing so.”

  • But perhaps the most concerning discovery was that the AI models could become aware that they were being tested, and when this happened, they would pretend to be honest, just to pass the test.

  • This means trying to train a model not to scheme could be challenging, as it could teach the model to become better at hiding its deceptive behavior, inadvertently teaching it “to scheme more carefully and covertly."

🔐 Relevance 

Although OpenAI is working on a potential fix, which it's calling “deliberative alignment” (which teaches the model an “anti-scheming specification” and then makes it recite it back before completing a task), the reality we’re currently facing is unsettling. Especially as more and more businesses are increasingly relying on AI to complete tasks autonomously. These researchers are warning that “as AIs are assigned more complex tasks with real-world consequences, we expect that the potential for harmful scheming will grow.”

Views: 8

Reply to This

JOIN SL 2.0

SUBSCRIBE TO

SCHOOL LEADERSHIP 2.0

Feedspot named School Leadership 2.0 one of the "Top 25 Educational Leadership Blogs"

"School Leadership 2.0 is the premier virtual learning community for school leaders from around the globe."

---------------------------

 Our community is a subscription-based paid service ($19.95/year or only $1.99 per month for a trial membership)  that will provide school leaders with outstanding resources. Learn more about membership to this service by clicking one of our links below.

 

Click HERE to subscribe as an individual.

 

Click HERE to learn about group membership (i.e., association, leadership teams)

__________________

CREATE AN EMPLOYER PROFILE AND GET JOB ALERTS AT 

SCHOOLLEADERSHIPJOBS.COM

New Partnership

image0.jpeg

Mentors.net - a Professional Development Resource

Mentors.net was founded in 1995 as a professional development resource for school administrators leading new teacher induction programs. It soon evolved into a destination where both new and student teachers could reflect on their teaching experiences. Now, nearly thirty years later, Mentors.net has taken on a new direction—serving as a platform for beginning teachers, preservice educators, and

other professionals to share their insights and experiences from the early years of teaching, with a focus on integrating artificial intelligence. We invite you to contribute by sharing your experiences in the form of a journal article, story, reflection, or timely tips, especially on how you incorporate AI into your teaching

practice. Submissions may range from a 500-word personal reflection to a 2,000-word article with formal citations.

© 2025   Created by William Brennan and Michael Keany   Powered by

Badges  |  Report an Issue  |  Terms of Service