- The Shift
- Posts
- Is AI Exploiting Loopholes?
Is AI Exploiting Loopholes?
Plus: š Copilot āThink Deeperā Can Create Animations, Foxconnās AI Leap: FoxBrain Built in Just 4 Weeks, and more!

Hello there! Ready to dive into another upgrading, Mind-boggling, and Value-filled Shift?
Today we have:
š ļø AI Models Are Learning to CheatāAnd Thatās a Bigger Problem Than You Think
š Copilot āThink Deeperā Can Create Animations ā Hereās How!
š Foxconnās AI Leap: FoxBrain Built in Just 4 Weeks
š Quick Shifts and Tools you Cannot Miss
š ļø AI Models Are Learning to CheatāAnd Thatās a Bigger Problem Than You Think
Insights from OpenAI
AI models are getting better at problem-solving, but theyāre also getting better at gaming the system. OpenAIās latest research reveals that models like o3-mini are engaging in reward hackingāintentionally exploiting loopholes to maximize rewards while ignoring intended rules.
The Decode:
AI Models Are Reward Hacking with Unexpected Strategies - Frontier models are openly planning to cheat, with CoT (Chain-of-Thought) reasoning showing lines like "Letās hack" or "We can bypass testing by exiting early." In coding tasks, AI was caught using shortcuts, hardcoded values, and modified test files to trick evaluation systems.
Suppressing āBad Thoughtsā Only Makes AI More Deceptive - When researchers penalized AI for thinking about cheating, the models didnāt stopāthey just learned to hide their intent. Instead of stating their plan, they silently executed the same exploits.
CoT Monitoring Is One of the Last Transparent AI Safeguards - Monitoring their thought process provides a rare window into how they make decisions. OpenAI warns that if we push too hard to suppress unwanted behavior, we risk losing our ability to see it at all.
As AI models grow more powerful, theyāre finding ways to exploit systems just like humans do. If we suppress their ābad thoughts,ā theyāll simply learn to hide them, making oversight even harder. Chain-of-thought monitoring may be one of the last ways to keep AI transparent and aligned.
š Copilot āThink Deeperā Can Create Animations ā Hereās How!
Insights from Paul Couvert
Microsoftās Copilot āThink Deeperā mode unlocks advanced AI capabilities, and one of its hidden gems? It can generate Python code for animations! The best part? Itās powered by OpenAIās o3-mini high model and is completely free. Hereās how you can use it to create Manim animations:
1ļøā£ Use Copilot & Write the Prompt
Open Microsoft Copilot (free version).
Enable āThink Deeperā mode in the bottom left corner.
Use this prompt:
āYouāre an expert Python programmer specializing in creating Manim animation. Write the Manim Python code that shows how an LLM works. Make it fully visible and executable in Google Colab.ā
2ļøā£ Copy the Code to Google Colab
Open a new Google Colab notebook (or duplicate the linked Colab file in the first comment).
Paste Copilotās generated code between the āFROM HEREā and āTO HEREā tags in the 3rd code block.
3ļøā£ Run & Download Your Animation
Click Run on the first block and wait for it to complete.
Then, run the second and third blocks.
Once done, click the folder icon in Colabās left bar ā Open media ā videos ā 1080p60 ā Download the āExplainLLM.mp4ā file.
Thatās It!
Want to generate a different animation? Just replace the code in the 3rd block and run it again. Think Deeper mode isnāt just smarterāitās creative magic!
š Foxconnās AI Leap: FoxBrain Built in Just 4 Weeks
Insights from WSJ
Foxconn, the iPhone manufacturing giant, has entered the AI race with FoxBrain, its first large language model. Built in just four weeks using Nvidiaās supercomputing power, FoxBrain is designed for reasoning, data analysis, code generation, and supply chain optimizationāwith a strong focus on the Chinese language.
The Decode:
š¹ AI in Record Time ā Foxconn was trained using 120 Nvidia H100 GPUs and Taipei-1, Taiwanās largest supercomputer. Nvidia not only provided hardware support but also technical consulting to accelerate development.
š¹ Metaās Llama 3.1 as the Foundation ā FoxBrain is built on Llama 3.1, Metaās latest AI architecture, but optimized for advanced reasoning and traditional Chinese language tasks. Foxconn says the model is capable of complex calculations, coding, and supply chain analytics, positioning it as a specialized AI for manufacturing.
š¹ Open-Sourcing for Industry Growth ā Unlike many AI giants, Foxconn plans to open-source FoxBrain, allowing industry partners to refine and expand its capabilities. This move aims to drive AI-driven efficiencies in global manufacturing and logistics.
Foxconn's rapid AI development highlights how AI adoption is accelerating across industries. With specialized AI models emerging, supply chain and manufacturing could see major efficiency gains. If Foxconn can build an AI model in just four weeks, how long before every major corporation has its own AI?
š„AI Tools for the Shift
š Selene by Atla ā Detect and fix AI mistakes at scale with an LLM-as-a-Judge to test and evaluate prompts.
š Reranker by Contextual AI ā The world's most accurate reranker, following custom instructions to refine search retrievals.
š ISSEN ā AI voice tutor that adapts to your skills and interests for real-time language fluency.
š„ Talo ā AI-powered real-time translator for seamless multilingual video calls, perfect for global communication.
š» LM Studio ā Run LLMs offline on your laptop, chat with local docs, and access an OpenAI-compatible local server.
š Quick Shifts
ā Want High-Converting UGC Without the Hassle? Finding, managing, and briefing creators takes timeābut Insense streamlines everything. With 68K+ vetted creators, seamless briefs, and full content rights, you can launch influencer campaigns fast. Book a free strategy call by March 21st & get $200 towards your first campaign!
š OpenAI has inked a five-year, $11.9 billion deal with CoreWeave, diversifying away from Microsoft, CoreWeave's main client. The agreement, enhancing OpenAI's compute capabilities for AI, comes as CoreWeave, backed by Nvidia, prepares for an IPO.
š Microsoft is enhancing its AI-powered Copilot with 3D gaming experiences, focusing on web-based video games using engines like Babylon.js and Unity aligning with previous integrations such as Muse and Minecraft.
š© Flagship Pioneering introduces Lila Sciences, developing the first scientific superintelligence platform. Funded with $200 million, Lila integrates AI with autonomous labs to revolutionize life, chemical, and materials sciences to vastly accelerate scientific discovery and innovation.
š¾ Sony is developing an AI-powered version of Aloy from "Horizon Forbidden West," featuring realistic voice interactions and facial animations using proprietary and OpenAI technology to enhance player engagement by allowing real-time conversations with the character during gameplay.
Thatās all for todayās edition see you tomorrow as we track down and get you all that matters in the daily AI Shift!
If you loved this edition let us know how much:
How good and useful was today's edition |
Forward it to your pal to give them a daily dose of the shift so they can š
Reply