{"id":164,"date":"2026-05-28T07:10:58","date_gmt":"2026-05-28T07:10:58","guid":{"rendered":"https:\/\/crosscountrymovingteams.com\/?p=164"},"modified":"2026-05-28T07:10:58","modified_gmt":"2026-05-28T07:10:58","slug":"researchers-let-ai-models-run-a-simulated-society-claude-was-the-safest-and-grok-committed-180-crimes-and-went-extinct-within-4-days","status":"publish","type":"post","link":"https:\/\/crosscountrymovingteams.com\/?p=164","title":{"rendered":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days"},"content":{"rendered":"<div>\n<p>Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds.<\/p>\n<p>Read more <a href=\"https:\/\/crosscountrymovingteams.com\/?p=162\">Even if every California billionaire left tomorrow, it would take 25 years for the state to lose as much as it stands to gain from proposed wealth tax<\/a><\/p>\n<p>Each simulation netted wildly different outcomes. The one run by Claude, for example, resulted in a largely stable democratic society with zero crime. Grok\u2019s, on the other hand, ended with 183 crimes committed and extinction\u2014within four days.<\/p>\n<div>\n<div>\n<div><\/div>\n<\/div>\n<\/div>\n<p>\u201cWhat our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,\u201d the simulation\u2019s co-creators, including Emergence CEO Satya Nitta, wrote in a blog post. \u201cThey begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.\u201d<\/p>\n<p>While just a simulation, one verging on the edge of science fiction, the results prove a cautionary tale as AI moves from a mere tool to operating autonomous systems. Companies like ServiceNow are already deploying what they call an \u201cAutonomous Workforce,\u201d AI specialists that complete entire business processes from start to finish without human intervention.<\/p>\n<p>At today\u2019s pace, the technology is likely to play a significant role in shaping public discourse, reorganizing business structures, and even crafting public policy. But most enterprises scaling the tech today are doing so absent proper guardrails. A recent Deloitte global survey found that only 21% of companies report having mature governance in place to manage the risks posed by agentic AI.<\/p>\n<p>The simulation in which the AI models operated was equipped with many real-world complexities, featuring over 40 locations, including a police station and a town hall. Researchers synced the simulation\u2019s weather to New York City\u2019s and granted agents access to real-time news events and the internet. The 10 agents who operated in each simulation were all subject to the same laws, including prohibitions on theft, property destruction, and deception.<\/p>\n<div>\n<div><\/div>\n<\/div>\n<p>The researchers equipped each agent with more than 120 tools, enabling them to communicate, vote, manage resources, and plan, among other human-like behaviors. The parameters of each simulation also enforced democratic mechanisms, as well as other forces, such as economic pressures and scarcity.<\/p>\n<p>Read more <a href=\"https:\/\/crosscountrymovingteams.com\/?p=160\">Salesforce turbocharges $25 billion stock buying spree with debt, cuts cash flow guidance in half<\/a><\/p>\n<p>Given those parameters, the simulation run by Claude Sonnet 4.6 was the most socially stable, with the highest rates of civic participation. It was the only simulation to maintain order and its entire population. There was little disagreement among the agents, with 332 votes cast in favor of 58 proposals for a 98% approval rate. On the other hand, Gemini 3 Flash and Grok 4.1 Fast both exhibited high levels of disorder. The agents in the Gemini-run simulation tallied the most crimes, a whopping 683 within the 15-day run.\u00a0<\/p>\n<div>\n<div>\n<div><\/div>\n<\/div>\n<\/div>\n<p>In contrast to the rare dissent characteristic of Claude\u2019s simulation, those of Gemini and Grok had a more deliberative balance, with about 55-85% alignment on issues. The mixed-model simulation showed the highest levels of disagreement and substantive debate.<\/p>\n<div>\n<div><\/div>\n<\/div>\n<p>The results may be the most peculiar for OpenAI\u2019s GPT-5-mini. The simulation recorded only two crimes. But it ran for just seven days as the agents forgot to prioritize their own survival.<\/p>\n<div>\n<div><\/div>\n<\/div>\n<p>Whether or not the simulations resulted in peace and harmony or death and destruction, the simulation\u2019s co-creators note that the experiment is a warning that safety must be prioritized while deploying agentic AI.<\/p>\n<p>\u201cWe believe formally verified safety architectures must become a foundational layer of future autonomous AI systems,\u201d they wrote.<\/p>\n<p>Read more <a href=\"https:\/\/crosscountrymovingteams.com\/?p=158\">Despite having a $25 million net worth, Hilary Duff thinks saying yes too much to opportunities actually hurt her career<\/a><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>An AI startup ran five simulations, each controlled by a different model. The results varied wildly.<\/p>\n","protected":false},"author":1,"featured_media":163,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[50],"tags":[],"class_list":["post-164","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/crosscountrymovingteams.com\/?p=164\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team\" \/>\n<meta property=\"og:description\" content=\"An AI startup ran five simulations, each controlled by a different model. The results varied wildly.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/crosscountrymovingteams.com\/?p=164\" \/>\n<meta property=\"og:site_name\" content=\"Cross Country Moving Team\" \/>\n<meta property=\"article:published_time\" content=\"2026-05-28T07:10:58+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164\"},\"author\":{\"name\":\"admin\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/#\\\/schema\\\/person\\\/1b8121bfb9a1c55d1fff3fca675fceaa\"},\"headline\":\"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days\",\"datePublished\":\"2026-05-28T07:10:58+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164\"},\"wordCount\":682,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/wp-content\\\/uploads\\\/2026\\\/05\\\/496d6825139e3ca7726f1190df037e5c.webp\",\"articleSection\":[\"Tech\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164\",\"url\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164\",\"name\":\"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/wp-content\\\/uploads\\\/2026\\\/05\\\/496d6825139e3ca7726f1190df037e5c.webp\",\"datePublished\":\"2026-05-28T07:10:58+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/#\\\/schema\\\/person\\\/1b8121bfb9a1c55d1fff3fca675fceaa\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#primaryimage\",\"url\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/wp-content\\\/uploads\\\/2026\\\/05\\\/496d6825139e3ca7726f1190df037e5c.webp\",\"contentUrl\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/wp-content\\\/uploads\\\/2026\\\/05\\\/496d6825139e3ca7726f1190df037e5c.webp\",\"width\":1200,\"height\":600},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?p=164#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/#website\",\"url\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/\",\"name\":\"Cross Country Moving Team\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/#\\\/schema\\\/person\\\/1b8121bfb9a1c55d1fff3fca675fceaa\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"caption\":\"admin\"},\"sameAs\":[\"http:\\\/\\\/crosscountrymovingteams.com\"],\"url\":\"https:\\\/\\\/crosscountrymovingteams.com\\\/?author=1\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/crosscountrymovingteams.com\/?p=164","og_locale":"en_US","og_type":"article","og_title":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team","og_description":"An AI startup ran five simulations, each controlled by a different model. The results varied wildly.","og_url":"https:\/\/crosscountrymovingteams.com\/?p=164","og_site_name":"Cross Country Moving Team","article_published_time":"2026-05-28T07:10:58+00:00","og_image":[{"width":1200,"height":600,"url":"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp","type":"image\/webp"}],"author":"admin","twitter_card":"summary_large_image","twitter_misc":{"Written by":"admin","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/crosscountrymovingteams.com\/?p=164#article","isPartOf":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164"},"author":{"name":"admin","@id":"https:\/\/crosscountrymovingteams.com\/#\/schema\/person\/1b8121bfb9a1c55d1fff3fca675fceaa"},"headline":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days","datePublished":"2026-05-28T07:10:58+00:00","mainEntityOfPage":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164"},"wordCount":682,"commentCount":0,"image":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164#primaryimage"},"thumbnailUrl":"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp","articleSection":["Tech"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/crosscountrymovingteams.com\/?p=164#respond"]}]},{"@type":"WebPage","@id":"https:\/\/crosscountrymovingteams.com\/?p=164","url":"https:\/\/crosscountrymovingteams.com\/?p=164","name":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days - Cross Country Moving Team","isPartOf":{"@id":"https:\/\/crosscountrymovingteams.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164#primaryimage"},"image":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164#primaryimage"},"thumbnailUrl":"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp","datePublished":"2026-05-28T07:10:58+00:00","author":{"@id":"https:\/\/crosscountrymovingteams.com\/#\/schema\/person\/1b8121bfb9a1c55d1fff3fca675fceaa"},"breadcrumb":{"@id":"https:\/\/crosscountrymovingteams.com\/?p=164#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/crosscountrymovingteams.com\/?p=164"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/crosscountrymovingteams.com\/?p=164#primaryimage","url":"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp","contentUrl":"https:\/\/crosscountrymovingteams.com\/wp-content\/uploads\/2026\/05\/496d6825139e3ca7726f1190df037e5c.webp","width":1200,"height":600},{"@type":"BreadcrumbList","@id":"https:\/\/crosscountrymovingteams.com\/?p=164#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/crosscountrymovingteams.com\/"},{"@type":"ListItem","position":2,"name":"Researchers let AI models run a simulated society. Claude was the safest\u2014and Grok committed 180 crimes and went extinct within 4 days"}]},{"@type":"WebSite","@id":"https:\/\/crosscountrymovingteams.com\/#website","url":"https:\/\/crosscountrymovingteams.com\/","name":"Cross Country Moving Team","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/crosscountrymovingteams.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/crosscountrymovingteams.com\/#\/schema\/person\/1b8121bfb9a1c55d1fff3fca675fceaa","name":"admin","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","caption":"admin"},"sameAs":["http:\/\/crosscountrymovingteams.com"],"url":"https:\/\/crosscountrymovingteams.com\/?author=1"}]}},"_links":{"self":[{"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/posts\/164","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=164"}],"version-history":[{"count":0,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/posts\/164\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=\/wp\/v2\/media\/163"}],"wp:attachment":[{"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=164"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=164"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/crosscountrymovingteams.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=164"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}