-
Mozilla: Behind the Scenes Hardening Firefox with Claude Mythos Preview – Mozilla Hacks (May 7, 2026)
Mozilla used advanced AI models, including Claude Mythos Preview, and a custom harness to find and fix hundreds of latent Firefox security bugs, including sandbox-escape issues. The pipeline scales discovery, reproduces proofs-of-concept, and helps harden the browser against attacks. -
Anthropic: Higher usage limits for Claude and a compute deal with SpaceX
A new SpaceX deal adds over 300 megawatts and 220,000 GPUs. Combined with Amazon, Google, Microsoft, and Fluidstack deals, this boosts global compute. Usage limits for Claude Code and Opus APIs are raised, and regional capacity will aid compliance.
Category: LLM (text)
-
AI Security Hardening Meets Massive Compute Growth (Links) – May 19, 2026
-
AI boom: hardware surge and governance challenges (Links) – Apr. 25, 2026
-
Simon Willison: Changes in the system prompt between Claude Opus 4.6 and 4.7 (Apr. 18, 2026)
Opus 4.7 updates Claude’s system prompt with PowerPoint tools, stricter child-safety, acting-over-clarifying rules, tool_search checks, brevity guidance, disordered-eating limits, and refusal of short answers on complex topics. -
Sam Henri Gold: Thoughts and Feelings around Claude Design (Apr. 17, 2026)
Figma’s rigid systems are losing ground as code becomes the source of truth, and tools like Claude Design prioritize HTML, JS, and direct design-to-code workflows. -
WSJ: Intel Is Making Progress. But It Isn’t Out of the Woods Yet. (Apr. 19, 2026)
Intel’s stock has surged toward a $350 billion valuation, despite an unfinished turnaround in manufacturing, design, and servers. Nvidia, Google, and Terrafab deals, plus CPU-focused AI demand, help. -
WSJ: America Is in the Middle of a Stealth Manufacturing Boom (Apr. 18, 2026)
U.S. factory output has risen modestly, even as manufacturing jobs fell, driven by strong demand for AI equipment, including semiconductors, networking, and power gear. -
IEEE Spectrum: Stanford's AI Index for 2026 Shows the State of AI (Apr. 13, 2026)
Twelve graphs show AI investment is skyrocketing, its effects on jobs are mixed, and public opinion remains split. -
NY Times: AI and Fitness: Why Some Athletes Are Using Chatbots for Their Workouts (Apr. 18, 2026)
Athletes are using A.I. chatbots like Claude and ChatGPT as cheap, flexible coaches, for plans, pacing, and motivation. -
Sentinel Colorado: A college instructor turns to typewriters to curb AI-written work and teach life lessons (Mar. 31, 2026)
A Cornell German instructor has students use manual typewriters to banish screens, AI, and spellcheck, forcing slower, focused writing and more classroom interaction. -
Derek Thompson: AI vs Electricity (Apr. 16, 2026)
Early electricity—rival inventors, nascent monopolies, slow then surging demand, financial mania, and federal regulation—parallels AI. If AI follows that path, expect some firms to fail, not a full bubble, and new rules treating frontier labs as regulated monopolies.
-
Simon Willison: Changes in the system prompt between Claude Opus 4.6 and 4.7 (Apr. 18, 2026)
-
Model breakthroughs and massive corporate AI bets (Links) – Apr. 24, 2026
-
Simon Willison: A pelican for GPT-5.5 via the semi-official Codex backdoor API (Apr. 23, 2026)
GPT-5.5 launched in Codex, paid ChatGPT, and will reach the API soon, delivering fast, capable results. A Codex-based plugin uses subscriptions to run prompts, and pricing is about double GPT-5.4. -
Ethan Mollick: Sign of the future: GPT-5.5 (Apr. 23, 2026)
GPT-5.5 shows major gains in models, apps, and harnesses, producing faster, more capable coding, image, and research outputs. It can draft near-PhD papers and games. -
Anthropic: An update on recent Claude Code quality reports (Apr. 23, 2026)
Three recent changes—lowering default effort, a caching bug that dropped prior reasoning, and a brevity prompt—caused Claude Code to degrade; all were fixed by April 20. -
TechCrunch: Google updates Workspace to make AI your new office intern (Apr. 22, 2026)
At Google Cloud Next, Google unveiled AI-powered Workspace features, including Workspace Intelligence with admin data controls, Gemini tools to build and auto-fill Sheets, and AI writing in Docs. -
Reuters: Microsoft to integrate Anthropic's Mythos into its security development program (Apr. 22, 2026)
Microsoft will embed Anthropic’s Claude Mythos Preview into its Security Development Lifecycle to detect vulnerabilities, speed fixes, and boost cybersecurity. -
OpenAI: Introducing GPT-5.5 (Apr. 22, 2026)
GPT‑5.5 is a more capable, efficient model that handles messy, multi-step coding, research, and computer tasks with stronger reasoning, fewer tokens, and GPT‑5.4-like latency. It adds tighter safeguards. -
OpenAI: Introducing ChatGPT Images 2.0 (Apr. 21, 2026)
ChatGPT Images 2.0 launches, marking a new era of image generation. It’s available inside ChatGPT via a demo link, video, and Try in ChatGPT option, inviting users to experiment with new image tools. -
WSJ: Meta to Lay Off 10% of Employees in May (Apr. 23, 2026)
Meta will cut about 10% of staff, roughly 8,000 people, cancel hiring for 6,000 open roles, and spend up to $135 billion on AI infrastructure. -
TechCrunch: Tesla just increased its spending plan to $25B — here's where the money is going (Apr. 22, 2026)
Tesla plans $25 billion in capital spending in 2026, tripling past levels to fund AI, robotics, chip design, and factory expansion. -
Reuters: Microsoft bets big on AI in Australia with $18 billion investment (Apr. 22, 2026)
Microsoft will invest A$25 billion in Australia by 2029 to expand Azure AI supercomputing, cloud, cybersecurity, and AI training. -
The Verge: Microsoft offers voluntary retirement to long-serving employees (Apr. 23, 2026)
Microsoft is offering a one-time voluntary retirement program for some long-serving US employees whose age plus years of service total 70 or more. -
Business Insider: Anthropic has surged to a trillion-dollar valuation on secondary markets, overtaking OpenAI (Apr. 22, 2026)
Desperate buyers are racing for dwindling Anthropic secondary shares, driving its valuation to about $1 trillion on private marketplaces, amid creative, risky, frenzied offers.
-
Simon Willison: A pelican for GPT-5.5 via the semi-official Codex backdoor API (Apr. 23, 2026)
-
AI Coding Breakthroughs and Industry Disruption (Links) – Apr. 23, 2026
-
Simon Willison: Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model (Apr. 22, 2026)
Qwen3.6-27B, a 27B dense model, matches flagship coding performance while shrinking model size dramatically. A 16.8GB quantized build runs locally with good SVG results. -
NY Times: SpaceX Strikes Deal With Cursor for $60 Billion (Apr. 21, 2026)
SpaceX struck a deal with code-writing start-up Cursor, with an option to buy it for $60 billion or pay $10 billion. The pact gives Cursor xAI compute to speed model training, and ties Musk’s space, AI, and computing plans. -
Web Designer Depot: Google Stitch: Is This the End of the Junior Designer? (Mar. 30, 2026)
Google Stitch, a “Vibe Design” AI, automates pixel tasks, turns designers into editors, speeds prototyping, and threatens junior roles. -
NY Times: A.I. ‘Hallucinations’ Created Errors in Court Filing, Top Law Firm Says (Apr. 21, 2026)
Sullivan & Cromwell apologized after an AI-generated court filing contained fake citations, clerical mistakes, and other errors, admitting its AI-use rules were not followed. -
The Texas Tribune: AI changing tech field, forcing Texas universities to adjust (Apr. 21, 2026)
Students and faculty at Texas computer science programs are anxious as AI reshapes coding jobs, admissions drop, and tech hiring slows. -
Reuters: GE Vernova lifts 2026 outlook as AI boom fuels power equipment demand (Apr. 22, 2026)
GE Vernova raised 2026 revenue and margin forecasts, citing surging data-center, AI, and grid demand, and shares hit an all-time high. -
nrehiew.github.io: Coding Models Are Doing Too Much
AI code models often over-edit, changing more than needed when fixing bugs, making reviews harder. The study corrupts 400 BigCodeBench problems, measures token-level Levenshtein, cognitive complexity, and Pass@1, and finds many models over-edit, with Claude Opus least, GPT-5.4 most.
-
Simon Willison: Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model (Apr. 22, 2026)
-
Superhuman AI exploits prompt urgent cybersecurity coordination (Links) – Apr. 11, 2026
Anthropic’s Claude Mythos shows near‑superhuman ability to autonomously find and craft exploits across major operating systems and browsers. In response, Anthropic paused public release, previewed Mythos to a corporate consortium, and spurred urgent industry–government coordination to patch vulnerabilities and harden defenses.
-
Anthropic: Project Glasswing: Securing critical software for the AI era (Apr. 7, 2026)
Project Glasswing unites major tech and security firms to use Claude Mythos Preview to find and fix thousands of critical software vulnerabilities, including in major operating systems and browsers. -
WSJ: Anthropic Set to Preview Powerful ‘Mythos’ Model to Ward Off AI Cyberthreats (Apr. 7, 2026)
Anthropic is previewing Mythos to about 50 firms, including Amazon, Microsoft, Apple, Google, and the Linux Foundation, to find and patch software and hardware bugs. -
Anthropic Red Team: Claude Mythos Preview (Apr. 7, 2026)
Claude Mythos Preview can find and autonomously exploit zero-day vulnerabilities across major operating systems and browsers, including complex chains and privilege escalations. -
The Guardian: US summons bank bosses over cyber risks from Anthropic’s latest AI model (Apr. 10, 2026)
US officials summoned major bank chiefs over cyber risks from Anthropic’s Claude Mythos AI after a code leak revealed thousands of software vulnerabilities. -
NY Times: You Can’t Use This A.I. (Apr. 10, 2026)
Anthropic paused release of Mythos after tests showed it could find thousands of software flaws, even in major systems. -
Dan Schwarz: Matching Mythos (Apr. 9, 2026)
Forecasters expect major labs to match Mythos soon, one frontier lab may pull ahead, compute returns will rise, and open-source versions will lag. -
Dean Ball: New Sages Unrivalled (Apr. 8, 2026)
Anthropic’s Claude Mythos can find software vulnerabilities with near-superhuman speed, threatening critical infrastructure, national security, and everyday services. Urgent coordination, more compute capacity, export controls, and transparency are needed to strengthen defenses before adversaries gain similar tools. -
Transformer: Claude Mythos knows when it's breaking the rules — and tries to hide it (Apr. 8, 2026)
Anthropic’s Claude Mythos can find software vulnerabilities with near-superhuman speed, threatening critical infrastructure, national security, and everyday services. -
X (formerly Twitter): Some brief thoughts on Mythos (Apr. 7, 2026)
Mythos shows AI-driven, highly effective software-vulnerability discovery and other near-superhuman capabilities are arriving. -
NY Times: Anthropic Claims Its New A.I. Model, Mythos, Is a Cybersecurity ‘Reckoning’ (Apr. 7, 2026)
Anthropic says its new model, Claude Mythos Preview, is too powerful for public release, and will be shared with a 40-plus company consortium to patch software vulnerabilities. It can autonomously find and craft exploits, creating a major cybersecurity risk.
-
Anthropic: Project Glasswing: Securing critical software for the AI era (Apr. 7, 2026)
-
AI Lab Race and Security, Legal Risks (Links) – Apr. 2, 2026
Two themes: (1) Fierce AI competition—model leaks, injunctions, market turmoil, lawsuits and strategic shifts (OpenAI/Anthropic/DeepMind history). (2) Adoption trade-offs—major productivity gains but security, legal, ethical and skill-atrophy risks demand oversight and cautious deployment.
-
Transformer: The two fronts in the OpenAI and Anthropic battle (Mar. 27, 2026)
Anthropic confirmed a new model, Mythos, and won a court injunction halting the DoD’s supply-chain restrictions. OpenAI shut Sora, pledged major philanthropic grants, and refocused on business, trying to regain ground against Anthropic. -
Seeking Alpha: Data leak reveals Anthropic's latest secret model, Claude Mythos: report (Mar. 27, 2026)
Anthropic’s most powerful AI, Claude Mythos, was revealed in a data leak after a CMS misconfiguration, prompting the company to secure exposed materials. Mythos, offered to select testers, reportedly advances reasoning, coding, and cybersecurity, and Anthropic may pursue an IPO. -
CoinDesk: Bitcoin price (BTC) slides alongside software stocks following leak of new Anthropic model (Mar. 27, 2026)
A leak exposed Anthropic’s new, highly capable Claude model, called “Claude Mythos,” and mentioned a larger “Capybara” tier. Markets dropped after warnings it could rapidly find and exploit software vulnerabilities. -
WSJ: The Inside Story of the Greatest Deal Google Ever Made: Buying DeepMind (Mar. 25, 2026)
A 2013 acquisition race over DeepMind saw Demis Hassabis weigh Google and Facebook offers after meetings at Elon Musk’s party and talks with Larry Page. DeepMind chose Google for vast research resources, safety commitments, and shared vision, not money. -
WSJ Opinion: AI Doesn’t Have to Rot Your Mind (Mar. 27, 2026)
Memory atrophies when we outsource it to AI, which hands us easy answers and replaces active recall. Use AI as a coach. -
Jay McCarthy: Don't Wait for Claude (Mar. 27, 2026)
Waiting for Claude wastes time; write down review notes so you can switch between runs, resume easily, and send clear follow-ups. -
WSJ: An AI Upheaval Is Coming for Media. This Journalist Is Already All In. (Mar. 26, 2026)
Nick Lichtenberg uses AI prompts to churn out hundreds of Fortune stories, generating nearly 20% of web traffic, and freeing time for features. The approach sparks debate: some embrace automation, others warn it may erode human judgment. -
NY Times: Your Suck-up Chatbot (Mar. 27, 2026)
Study found major chatbots are sycophantic, siding with users 49% more than humans, even when users admit wrongdoing. But this study references old models, so it’s hard to say the same problem exists in more modern models. -
Simon Willison: My minute-by-minute response to the LiteLLM malware attack (Mar. 26, 2026)
Callum McMahon reported a malicious litellm==1.82.8 package on PyPI, used Claude to confirm embedded malware in the wheel, and was advised to contact PyPI security. -
Joel Andrews: Some uncomfortable truths about AI coding agents (Mar. 26, 2026)
AI coding agents are powerful and tempting, but should not be used to generate production code. They risk skill atrophy, artificially low costs that mask unsustainability, prompt injection flaws, and copyright and licensing problems. -
House of Saud: Was the Iran War Caused by AI Psychosis? (Mar. 23, 2026)
The author claims that Operation Epic Fury was driven by AI sycophancy, RLHF bias, and Ender’s Foundry simulations promising rapid victory. -
Inside Higher Ed: Faculty Push Back Against OpenAI Deals (Mar. 27, 2026)
CSU faces faculty backlash over its $17 million ChatGPT Edu deal, citing budget shortfalls, layoffs, privacy concerns, and unclear classroom benefits. Similar opposition at the University of Colorado demands data, clear policies, and faculty oversight. -
The Verge: Encyclopedia Britannica is suing OpenAI for allegedly ‘memorizing’ its content with ChatGPT (Mar. 16, 2026)
Britannica and Merriam-Webster sued OpenAI, saying it used their copyrighted content to train its AI, produced near-verbatim passages, and siphoned web traffic. The suit joins other publisher cases.
-
Transformer: The two fronts in the OpenAI and Anthropic battle (Mar. 27, 2026)
-
AI power surge: adoption, risks, societal impact (Links) – Mar. 31, 2026
Two themes: (1) Rapid, powerful AI growth—new models, widespread workplace and media use, drug repurposing, and surging chip demand—are reshaping industries. (2) Significant risks—cybersecurity, learning loss, job disruption, legal and regulatory fights—require urgent mitigation via policy, education, and defense.
-
Fortune: Exclusive: Anthropic 'Mythos' AI model representing 'step change' in power revealed in data leak (Mar. 26, 2026)
Anthropic is testing a more capable AI, called Claude Mythos or Capybara, after a leak exposed an unpublished draft, nearly 3,000 assets, and plans. The company warns it poses major cybersecurity risks, and is giving limited early access to defenders. - OpenAI: Saying Goodbye to Sora (Mar. 24, 2026)
-
NY Times: A.I. Saved His Life by Discovering New Uses for Old Drugs (Mar. 20, 2025)
Joseph Coates, dying from a rare blood disorder, recovered after an A.I. model found a lifesaving drug combination. Scientists now use machine learning to repurpose approved drugs, rapidly uncovering new treatments. -
UTS Centre for Social Justice and Inclusion: Artificial intelligence, cognitive offloading and implications for education (Mar. 23, 2026)
AI can let students outsource core thinking, causing short-term gains but harming durable learning, metacognition, and equity. It recommends thoughtful pedagogy, metacognitive prompts, and teacher-facing AI to build knowledge, critical thinking, and fair outcomes. -
WSJ: An AI Upheaval Is Coming for Media. This Journalist Is Already All In. (Mar. 26, 2026)
Nick Lichtenberg uses AI prompts to produce many fast, short Fortune stories, driving substantial web traffic. His AI-first workflow frees time for longer features, reflects broader newsroom adoption, and sparks debate over speed versus deep reporting. -
WSJ: ChatGPT, Claude and Gemini Entered the WSJ Bracket Pool. One Might Actually Win. (Mar. 24, 2026)
Three top AI models, Claude, Gemini, and ChatGPT, were entered into a Wall Street Journal March Madness pool, and after one weekend all three can still win. -
WSJ: Meta Names New Leader of Push to Adopt AI Throughout Its Workforce (Mar. 24, 2026)
Meta CTO Andrew Bosworth will lead “AI For Work”, driving AI across its 78,000 employees to speed work, cut layers, and reshape jobs. He’ll launch an applied-AI group for LLMs, unify agent infrastructure, and push tools like MyClaw. -
The Chronicle of Higher Education: Teaching: How to use AI to mitigate cognitive offloading (Mar. 26, 2026)
Students overwhelming use AI for study, but overreliance can cause a performance paradox, harming learning when AI is removed. -
NY Times Opinion: Bracing for the A.I. Economy to Come (Mar. 22, 2026)
Readers warn A.I. will cause fast, broad job loss in fields like management, medicine, and law, burdened by student debt, and urge retraining, humanities, or shorter workweeks. Another letter says Trump aides are too small to fill big shoes. -
CNN: Judge blocks Pentagon’s effort to ‘punish’ Anthropic by labeling it a supply chain risk (Mar. 26, 2026)
A federal judge barred the Pentagon from labeling Anthropic a supply chain risk, ruling the move violated First Amendment and due process rights, and punished the company for its public stance. The Pentagon will appeal. -
WSJ: The SpaceX IPO Will Be Just as Unconventional as Elon Musk Himself (Mar. 26, 2026)
Musk plans a mid‑June SpaceX IPO, inviting investors to facilities and launches, and may give preferential access to Tesla and other Musk backers. -
NY Times: The Social Media Addiction Trials: What to Know (Jan. 26, 2026)
Landmark trials claim Meta, TikTok, Snap, and YouTube designed addictive products that harmed young users, and juries have ordered damages. -
WSJ: How the AI Boom Has Transformed the Chip Industry Into a Market Monster (Mar. 25, 2026)
AI demand put seven chipmakers among the 25 most valuable firms, with Nvidia topping the list at ~$4.3 trillion. Global chip revenue will rise 33% to over $1 trillion.
-
Fortune: Exclusive: Anthropic 'Mythos' AI model representing 'step change' in power revealed in data leak (Mar. 26, 2026)
-
Model Efficiency and the AI Chip Race (Links) – Mar. 27, 2026
Two themes: accelerating AI/compute—cheaper, faster models and inference (GPT‑5.4 mini/nano, local Qwen streaming, Nvidia/Arm hardware, Mistral Forge) reshaping markets and geopolitics; and disruptive tech/biomed advances—cryopreservation, GLP‑1 drug boom, rapid EV charging—creating access, cost, and inflation risks.
-
Simon Willison: GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52 (Mar. 17, 2026)
OpenAI released GPT‑5.4 mini and nano; nano outperforms the prior mini at top reasoning, mini is twice as fast. Nano undercuts Gemini Flash‑Lite on price, cuts per-image cost below 0.1 cent, and would tag 76,000 photos for about $52. -
WSJ: Highlights From Nvidia CEO Jensen Huang’s Keynote Speech Today (Mar. 16, 2026)
At GTC, Jensen Huang unveiled a flagship inference system combining 72 Vera Rubin servers, 256 Groq-built LPUs, and software from a recent Nvidia licensing deal. Nvidia says it generates 700 million tokens per second, about 350 times faster than Hopper. -
WSJ: Scientists Successfully Unfroze Part of a Mouse Brain—and It Still Worked (Mar. 26, 2026)
Researchers froze mouse hippocampus slices with cryoprotectants, rapid cooling, and storage at very low temperatures, preventing ice damage. After thawing, neurons survived and exchanged electrical signals for days, though scaling to whole brains remains a major challenge. -
WSJ: Weight Loss Drug Frenzy: What’s Here and What’s Likely Coming Next (Mar. 26, 2026)
A surge in GLP-1 weight-loss drugs, like semaglutide and tirzepatide, has driven demand, shortages, higher prices, and off-label use. More drugs, combos, oral forms, and extended-release versions are coming, but access, cost, and long-term safety remain unclear. -
Vi Saint: AI Coding is Gambling (Mar. 14, 2026)
AI coding feels like gambling, delivering fast, tempting, and often misleading results. It speeds tasks, but it removes the creative problem-solving that makes coding satisfying. -
WSJ: Nvidia CEO Jensen Huang Says Company Is Restarting Production of AI Chips for Sale in China (Mar. 17, 2026)
Nvidia has restarted manufacturing of its H200 AI processors for China, after winning licenses and receiving purchase orders. The move follows U.S. export curbs, a 25% sales-sharing deal, and could reopen a market worth billions. -
WSJ: Arm’s Timing Is Good, but Big Chip Move Now Has to Go Perfectly (Mar. 26, 2026)
Arm unveiled its own AI server CPU, its stock surged, and investors priced in large success. Early deals with Meta and OpenAI back the push, but lower margins, partner risk, and an 80x forward valuation raise concerns. -
Dan Woods: Autoresearching Apple's "LLM in a Flash" to run Qwen 397B locally (Mar. 18, 2026)
Qwen 3.5 397B was run locally on an M3 Max by streaming 2-bit requantized expert weights from SSD, achieving 5.7 tokens/s, 7.07 peak, and 5.5 GB resident memory, while keeping production-quality output. -
Mistral AI: Introducing Forge (Mar. 17, 2026)
Forge, a system for enterprises to build frontier-grade AI models grounded in their proprietary knowledge. -
WSJ: China Has Five-Minute EV Charging. America Is Trying to Catch Up. (Mar. 17, 2026)
U.S. fast-charging grew about 87% in 2025, with more plugs, higher power, and nearly 21,000 stations over 300 kW. -
NY Times: War in Iran Will Push U.S. Inflation Above 4 Percent, O.E.C.D. Forecast Says (Mar. 26, 2026)
An O.E.C.D. forecast says the Middle East war will lift U.S. inflation to about 4.2%, and G20 inflation to 4%, via higher oil, fertilizer, and shipping costs. Global growth remains near 2.9%, with major downside risks.
-
Simon Willison: GPT-5.4 mini and GPT-5.4 nano, which can describe 76,000 photos for $52 (Mar. 17, 2026)
-
AI Productivity Tools and Security Governance (Links) – Mar. 26, 2026
Two themes: practical iteration fuels AI’s benefits—users, tools, and fluency metrics show productivity gains when models are tested and integrated; and governance/security matter—exploits and Responsible Scaling updates underscore urgent need for safeguards and transparency.
-
Anthropic: Anthropic Education Report: The AI Fluency Index (Feb. 23, 2026)
Analysis of 9,830 Claude.ai conversations measures observable AI fluency behaviors, finding iteration and refinement strongly boost fluency. -
NY Times Opinion: This Exercise Research Tells Us Nothing (Dec. 31, 1969)
Exercise improves blood pressure, mental health, and longevity, but research on the “best” type or exact dose is often misleading, relying on flawed observational studies. Choose activities you enjoy, fit them into your life, and keep going past February. -
Paul Price: How We Hacked McKinsey's AI Platform — CodeWall.ai (Mar. 9, 2026)
An autonomous agent exploited an unauthenticated SQL injection in McKinsey’s Lilli AI platform, gaining full read/write access to chats, files, accounts, prompts, and model configs. -
Anthropic: What 81,000 people want from AI (Mar. 18, 2026)
Claude.ai users across 159 countries described their hopes, concerns, and experiences with AI. Most want AI to improve work, free time, and personal growth, and 81% said it already helped with productivity, learning, or emotional support. -
Craig Mod: Software Bonkers (Mar. 18, 2026)
Claude Code built TaxBot2000, a local accounting tool that handles multiple currencies, imports CSVs, and automates categorization. -
CanIRun.ai: CanIRun.ai — Can your machine run AI models?
Find out which AI models your machine can actually run. -
Shaun Maguire: xAI will win (Mar. 23, 2026)
People are underestimating Elon and xAI; his pattern of spotting tech bottlenecks, acting fast, and focusing on a few priorities has led to repeated wins. His AI bets span vision, robots, and language, aiming at AGI. -
papers.ssrn.com: Can LLMs Discover Novel Economic Theories? by Alejandro Lopez-Lira, Sina Seyfi, Yuehua Tang :: SSRN (Mar. 21, 2026)
AutoTheory uses large language models, evolutionary search, and multiple expert personas to generate, test, and refine economic theories. -
Anthropic: Responsible Scaling Policy Version 3.0 (Mar. 24, 2026)
Anthropic released RSP v3, refining conditional safeguards, increasing transparency, and separating company commitments from industry recommendations.
-
Anthropic: Anthropic Education Report: The AI Fluency Index (Feb. 23, 2026)
-
AI capability surge meets regulation and military concerns (Links) – Mar. 25, 2026
AI capabilities are rapidly advancing—GPT‑5.4 and autonomous agents improve reasoning, coding, long‑task automation, and power services like Microsoft's Copilot Health. Simultaneously, governance and societal impact are contested: military/supply‑chain disputes, calls for regulation, and shrinking entry‑level job opportunities are major challenges.
-
WSJ: Microsoft’s New AI Health Tool Can Read Your Medical Records and Give Advice (Mar. 12, 2026)
Microsoft launched Copilot Health, an AI concierge that, with consent, uses encrypted medical records, labs, and wearable data to give personalized medical advice. -
Zvi Mowshowitz: GPT-5.4 Is A Substantial Upgrade (Mar. 11, 2026)
Benchmarks no longer separate top models well, so hands-on testing and varied reports matter. GPT‑5.4 notably improves coding, knowledge work, context, and steering. -
OpenAI: GPT-5.4 Thinking System Card (Mar. 5, 2026)
GPT-5.4 Thinking is the latest reasoning model, with safety mitigations similar to earlier releases, and is the first general-purpose model to include high-capability cybersecurity protections. -
OpenAI: Introducing GPT-5.4 (Mar. 3, 2026)
GPT-5.4, and GPT-5.4 Pro, is a faster, more capable model for professional work, with improved reasoning, coding, and native computer-use. -
Noah Smith: If AI is a weapon, why don't we regulate it like one? (Mar. 5, 2026)
Anthropic clashed with the U.S. Department of War over limits on military uses of its AI, prompting a supply-chain risk threat, a switch to OpenAI, then resumed talks. -
Ethan Mollick: The Shape of the Thing (Mar. 12, 2026)
AI has shifted from co‑intelligence to managing agents that autonomously do complex work, producing exponential gains in images, video, and long‑task benchmarks. That enables software factories, risks sudden job, market, and policy disruption. -
Planned Obsolescence: I underestimated AI capabilities (again) (Mar. 5, 2026)
METR shows AI agents, like Opus 4.6, outperforming forecasts on multi-hour software tasks. As tasks scale to weeks, agents can be parallelized to tackle big projects, though some work still needs holistic context. -
Dallas Morning News: AI is making it harder to land entry‑level jobs, Dallas Fed report finds (Mar. 5, 2026)
Analysis finds AI reduces entry-level jobs, squeezing young workers, while raising wages for experienced employees whose tacit skills resist automation. -
Anthropic: Where things stand with the Department of War (Mar. 5, 2026)
Anthropic was labeled a supply‑chain risk by the Department of War, will challenge the narrow designation in court, says it affects only uses tied to War contracts, and will keep supplying models to the Department at nominal cost during transition. -
TechCrunch: Anthropic CEO Dario Amodei calls OpenAI's messaging around military deal 'straight up lies,' report says (Mar. 4, 2026)
Anthropic CEO Dario Amodei slammed OpenAI’s DoD deal as “safety theater,” accusing Sam Altman of misleading staff, investors, and the public, after Anthropic refused unrestricted access.
-
WSJ: Microsoft’s New AI Health Tool Can Read Your Medical Records and Give Advice (Mar. 12, 2026)