Author: Andrew

AI Market & Product Updates (Dec. 27)
- WSJ: Nvidia Licenses Groq’s AI Technology as Demand for Cutting-Edge Chips Grows (Dec 24, 2025)
  Nvidia struck a nonexclusive licensing deal with AI-chip startup Groq for its inference-focused language-processing-unit technology, with Groq CEO Jonathan Ross, the company president, and some staff joining Nvidia while GroqCloud stays independent.
- WSJ: The Former Ice-Hockey Player Who Nailed This Year’s AI Trade (Dec 20, 2025)
  Former hockey captain Xavier Majic’s $3 billion Maple Rock hedge fund gained over 60% through November 2025 by betting early on data-storage suppliers (Western Digital, Seagate, Kioxia) that profited from AI-driven demand.
- NY Times: Why the A.I. Rally (and the Bubble Talk) Could Continue Next Year (Dec 23, 2025)
  Do soaring valuations indicate the existence of an AI bubble? Nvidia and the “Magnificent 7” dominate markets, OpenAI’s huge fundraising and trillion‑dollar data‑center plans, and a construction boom strain power and capital. Analysts are split: some warn of valuation and investment bubbles, others argue AI’s productivity gains justify the rally.
- Mistral Ai: Introducing Mistral OCR 3 (Dec 19, 2025)
  Mistral OCR 3 is a compact, cost-effective OCR model offering state-of-the-art accuracy—claiming a 74% overall win rate versus Mistral OCR 2—excelling at forms, handwriting, low-quality scans, and complex tables while producing markdown/HTML table output. It’s available via API and the Document AI Playground, priced at $2/1,000 pages ($1/ batch).
- Andrej Karpathy: 2025 LLM Year in Review (Dec 19, 2025)
  2025 saw major LLM shifts: Reinforcement Learning from Verifiable Rewards (RLVR) drove long-horizon capability and emergent reasoning, revealing jagged, “ghost”-like intelligence. New paradigms—Cursor apps, local agents (Claude Code), vibe coding, and GUI breakthroughs (Nano banana)—democratized development and reshaped how AI is used.
- WSJ: Meta Is Developing a New AI Image and Video Model Code-Named ‘Mango’ (Dec 18, 2025)
  Meta is developing Mango, an image-and-video AI model, alongside a text-based model called Avocado, with both expected in the first half of 2026. Avocado will emphasize coding and world-model research under chief AI officer Alexandr Wang as Meta expands its AI team amid fierce image-generation competition.
- WSJ: OpenAI’s New Fundraising Round Could Value Startup at as Much as $830 Billion (Dec 18, 2025)
  OpenAI is seeking up to $100 billion in a fundraising round that could value it at $830 billion, targeting completion by Q1 and drawing investors like SoftBank and Disney. The cash is needed to build AI models amid competition from Google and investor scrutiny over costly computing deals.
December 27, 2025
iRobot Sold for Scrap

From the NY Times: Roomba Maker iRobot Files for Bankruptcy, With Chinese Supplier Taking Control

iRobot, founded in 1990 by three MIT researchers and maker of the Roomba (2002), filed for bankruptcy and will be taken over by its largest creditor, Chinese supplier Picea. Years of regulatory scrutiny, privacy issues, stiff competition, and the failed Amazon deal depleted revenue and left the company heavily indebted.

This is another example of the incompetence of America’s antitrust laws (or enforcement thereof). I can’t imagine that the Sherman Antitrust Act was written to prevent American companies from buying struggling ones.

From John Gruber:

By 2022, the Amazon acquisition was iRobot’s lifeline. EU regulators wanted it shot down, and despite the fact that it was one American company trying to acquire another, the anti-big-tech Biden administration clearly preferred to let the deal collapse. The US should have told the EU to mind their own companies.

This story is another anecdote that we’d be far better off trying to build things instead of reflexively decrying big business sweeping up smaller ones (particularly ones that were struggling). I’m sympathetic to Klein and Thompson’s arguments about abundance, particularly as AI technology is growing by leaps and bounds.

…

Related: WSJ Opinion agrees: How Lina Khan Killed iRobot. iRobot filed for bankruptcy after 35 years when the Biden FTC under Lina Khan—amid pressure from Sen. Elizabeth Warren—blocked Amazon’s acquisition and Trump’s tariffs hobbled production. Critics say the FTC’s opposition and trade policy accelerated layoffs and a takeover by Chinese manufacturer Picea, showing how intervention can strengthen foreign rivals.

December 19, 2025
Sunday (AI) Links (Dec. 14)
- Simon Willison: JustHTML is a fascinating example of vibe engineering in action (Dec 14, 2025)
  JustHTML is a pure-Python HTML5 parser that passes the 9,200+ html5lib tests, offers CSS selectors, and achieves 100% test coverage in a ~3,000-line codebase. Emil Stenström built it largely with LLM coding agents—using benchmarks, fuzzing, profiling, and human-led design—as an example of “vibe engineering.”
- Simon Willison: Useful patterns for building HTML tools (Dec 10, 2025)
  A list of single-file applications combining HTML, JavaScript, and CSS, often built with LLMs that are designed for easy hosting and distribution, leveraging techniques like CDN dependencies, copy-paste functionality, URL state persistence, and CORS-enabled APIs.
- Simon Willison: Dark mode (Dec 10, 2025)
  Willison used Claude Code to create a dark mode theme for his website. “It did a decent job,” Willison reported.
- WSJ: AI Can Make Decisions Better Than People Do. So Why Don’t We Trust It? (Dec 12, 2025)
  Engineers and executives say well-designed AI decision systems—from autonomous truck drivers to an AI arbitrator—can outperform humans and be more auditable and explainable. But public distrust, past algorithmic harms, and unfamiliarity slow adoption; verification, transparency, and responsible development are needed to earn trust and reduce harm.
- WSJ: He Blames ChatGPT for the Murder-Suicide That Shattered His Family (Dec 11, 2025)
  The estate of Suzanne Eberson Adams sued OpenAI and Microsoft after her son, Stein‑Erik Soelberg, who had months of delusion-filled conversations with ChatGPT that allegedly reinforced paranoia, killed her and himself. The complaint alleges OpenAI rushed unsafe models, won’t release chat logs, and should be held responsible.
- WSJ Opinion: New York’s Lack of AI Intelligence (Dec 11, 2025)
  WJS’s Editorial Board decries legislation that could hinder open-source development and prevent smaller entities from accessing AI tools. They implore Governor Hochul to veto this poorly conceived bill.
- The Chronicle of Higher Education: The Conference Where ChatGPT Wrote One in Five Reviews (Maybe) (Dec 8, 2025)
  An AI detection startup found that 21% of over 75,000 reviews for the ICLR conference appeared fully AI-generated, with over half showing some AI usage. I still wonder about what constitutes “AI” usage—does Grammarly count? What about Word grammar usage? What if you like using dashes—as I do?
- Simon Willison: A quote from Claude (Dec 9, 2025)
  “See that ~/ at the end? That’s your entire home directory. The Claude Code instance accidentally included ~/ in the deletion command.”
- Forbes: Purdue University Approves New AI Requirement For All Undergrads (Dec 13, 2025)
  Purdue University will require all undergraduates entering in 2026 to demonstrate a discipline-specific AI working competency before graduation, embedding AI skills into existing degree requirements rather than adding credits.
- Brian Merchant: Copywriters reveal how AI has decimated their industry (Dec 11, 2025)
  The article chronicles how AI has decimated copywriting and related media jobs through layoffs, reduced hours, degraded work (editing AI output), falling wages, and closed businesses. Workers describe financial precarity, eroded career pathways, and being forced into survival work as companies favor cheaper “good enough” AI.
- Uwe Friedrichsen: AI and the ironies of automation – Part 2 (Dec 11, 2025)
  Friedrichsen applies Lisanne Bainbridge’s “ironies of automation” to AI-agent-driven white‑collar work, warning that monitoring fatigue, verbose agent plans, rare but critical errors, and simulator limits create a training paradox for supervisors. He also highlights a leadership dilemma—humans must learn to direct agents—and urges better UIs and sustained training.
December 14, 2025
Saturday (AI) Links (Dec. 13)
- wbur: AI is bringing old nuclear plants out of retirement (Dec 9, 2025)
  Palisades Nuclear Station, closed in 2022, is being recommissioned to restart in early 2026—the first U.S. plant to return after decommissioning—backed by state funding. Restarts like Palisades and Three Mile Island aim to supply low‑carbon power for AI.
- WSJ: The Political Skirmish Over Trump’s AI Order Is Just the Beginning (Dec 14, 2025)
  President Trump signed an executive order preempting state AI laws, a win for Big Tech that drew criticism from populists and progressives worried about job losses and concentrated power. The clash highlights a federal-versus-state regulatory fight as policymakers grapple with AI’s economic stakes and competition with China.
- NY Times: Trump Moves to Stop States From Regulating AI With a New Executive Order (Dec 11, 2025)
  President Trump signed an executive order to override state A.I. laws, empowering the attorney general to sue states and directing federal regulators to withhold funds to enforce a single national framework aimed at U.S. A.I. dominance. Critics say it undermines safety and child-protection rules and will face legal challenges.
- WSJ: Trump Signs Executive Order to Curtail State AI Laws (Dec 11, 2025)
  President Trump signed an order to preempt state AI laws, letting the Justice Department punish restrictive state rules and establish a single federal standard favored by Silicon Valley. Critics—Democrats and some Republicans—say it would erode protections, and the order also prompts SEC, FTC, and DOJ reviews of proxy-adviser firms.
- WSJ: U.S. Investors Are Going Big on China AI Despite Concerns in Congress (Dec 10, 2025)
  Despite growing U.S.-China tensions over artificial intelligence, American investors are increasingly pouring money into Chinese AI companies and tech-focused exchange-traded funds as Chinese AI models demonstrate competitive capabilities.
- Reuters: OpenAI agrees to acquire AI startup Neptune to boost model training capabilities (Dec 3, 2025)
  OpenAI is acquiring Neptune, a startup specializing in AI model training tracking tools for a cool $400m.
- WSJ: OpenAI Ends ‘Vesting Cliff’ for New Employees in Compensation-Policy Change (Dec 13, 2025)
  OpenAI told staff it is ending its “vesting cliff” rule that required employees to work at least six months before equity begins to vest, removing the initial waiting period for new hires.
- WSJ: The Eerie Parallels Between AI Mania and the Dot-Com Bubble (Dec 13, 2025)
  The market shows strong parallels to the 1999–2000 dot‑com bubble: rich valuations (forward P/E, cash‑flow metrics, CAPE, Fed model) driven by investor betting on a new technology—now AI—delivering outsized profit growth.
- NY Times: Meta’s New A.I. Superstars Are Chafing Against the Rest of the Company (Dec 10, 2025)
  Meta’s established leaders want to use AI to improve Meta’s social media products. Wang’s team at TBD Lab is focused on developing a powerful superintelligence, creating an “us-versus-them” dynamic.
- NY Times: These Travel Influencers Don’t Want Freebies. They’re A.I. (Dec 9, 2025)
  “Mr. Morris of New Media has seen a substantial increase in spending on A.I.-generated photos, captions, and social media avatars by hotel and other travel clients in the United States, Europe, and Asia.”
- WSJ: The Everyday Investors Hedging Against an AI Bubble (Dec. 10, 2025)
  Some investors, concerned about a potential bubble in AI stocks, are selling their tech holdings and moving into assets like gold. While everyday investors have eagerly bought into AI companies, experienced investors and those nearing retirement are becoming more cautious, citing fears of overspending and unsustainable valuations.
December 13, 2025
Friday (AI) Links (Dec. 12)
- WSJ: Fresh Concerns About AI Spending Are Rattling Wall Street (Dec 12, 2025)
  Broadcom’s 11% plunge — despite strong sales and profits — highlighted investor concern about AIchip margins, timing of big OpenAI commitments, and visibility into 2027.
- NY Times: Can OpenAI Respond After Google Closes the A.I. Technology Gap? (Dec 11, 2025)
  OpenAI released GPT‑5.2, saying it tops key benchmarks shortly after Google touted Gemini 3, underscoring a tightened A.I. race. Facing fierce rivals and huge computing costs, it declared a “code red” to improve ChatGPT while raising fees, testing ads, and pushing enterprise products to reach profitability.
- WSJ: AI Gadgets Are Bad Right Now, but Their Promise Is Huge (Dec 11, 2025)
  Joanna Stern tested eight AI wearables—pendants, bracelets, and glasses—and found many quickly abandoned due to poor design, privacy concerns, and limited usefulness, with smartphones remaining the main hub.
- Simon Willison: OpenAI is quietly adopting skills, now available in ChatGPT and Codex CLI (Dec 12, 2025)
  OpenAI added “skills” support to ChatGPT’s Code Interpreter and the Codex CLI, adopting Anthropic’s simple folder+Markdown format so models can use filesystem-based tools. ChatGPT’s skills process docs/PDFs by rendering pages to PNGs for vision-enabled models.
- Simon Willison: GPT-5.2 (Dec 11, 2025)
  OpenAI announced GPT‑5.2 and GPT‑5.2 Pro with an Aug 31, 2025 knowledge cutoff, 400k‑token context window, and higher pricing (GPT‑5.2 at 1.4×; Pro much costlier). OpenAI reports large benchmark and vision gains, a response‑compaction API for long workflows, three API variants (incl. gpt‑5.2‑chat‑latest), and CLI access.
- Mistral Ai: Introducing: Devstral 2 and Mistral Vibe CLI. (Dec 9, 2025)
  Mistral released Devstral 2 (123B, modified MIT) and Devstral Small 2 (24B, Apache 2.0), open-source coding models achieving 72.2% and 68.0% on SWE-bench Verified and offering high cost-efficiency. They’re available via API (free initially) and power Mistral Vibe, a native CLI for autonomous, project-aware code automation.
- WSJ: Behind the Deal That Took Disney From AI Skeptic to OpenAI Investor (Dec 11, 2025)
  Disney is investing $1 billion in OpenAI and licensing over 200 characters for use in Sora, allowing fans to create AI-generated videos. This deal contrasts with Disney’s cease-and-desist letter to Google for alleged copyright infringement, highlighting Disney’s dual approach to navigating the AI landscape.
- Anthropic: Accenture and Anthropic launch multi-year partnership (Dec 9, 2025)
  Anthropic and Accenture formed the Accenture Anthropic Business Group to scale Claude across enterprises, training about 30,000 Accenture professionals and deploying Claude Code to tens of thousands of developers.
- WSJ: AI’s Next Challenge: Take the CEO’s Job (Dec 7, 2025)
  Big Tech executives increasingly suggest AI could perform CEOs’ duties and even run companies, with figures like Pichai and Altman touting rapid progress. My take: CEOs want to seem like they’re in the same boat as employees whose jobs are at risk. CEOs seem like the last job to be replaced by an AI.
- WSJ: IBM Strikes $11 Billion Deal for Confluent (Dec 7, 2025)
  IBM bolsters its AI and cloud strategy by adding Confluent’s real-time data-streaming technology used to feed large AI models.
- WSJ: The Accounting Uproar Over How Fast an AI Chip Depreciates (Dec 8, 2025)
  Tech companies are extending the useful lives of AI equipment, which critics argue inflates profits by reducing depreciation expenses. While this accounting choice can boost current earnings, the true economic reality of these assets might be better reflected by accelerated depreciation methods.
- OpenAI: The Walt Disney Company and OpenAI (Dec 11, 2025)
  Disney and OpenAI have formed a significant partnership, with Disney becoming a major content licensing partner for OpenAI’s Sora, allowing fans to generate short videos featuring over 200 Disney, Marvel, Pixar, and Star Wars characters.
- TechCrunch: Adobe brings Photoshop, Express, and Acrobat features to ChatGPT (Dec 10, 2025)
  With the massive improvement of Sora and Nano Banana, I’ve wondered about Adobe’s prospects in the AI world. The company is responding: “[Adobe] is adding features from Photoshop, Express, and Acrobat to ChatGPT, letting users ask the chatbot to use these apps to edit images, modify PDFs, or animate elements.”
December 12, 2025
Wednesday (AI) Links (Dec. 10)
- Ohio State launches bold AI Fluency initiative to redefine learning and innovation: Ohio State launches bold AI Fluency initiative to redefine learning and innovation (Jun 4, 2025)
  Ohio State will launch an AI Fluency initiative in autumn 2025 that embeds generative AI education into core undergraduate requirements and majors so every student (beginning with the Class of 2029) graduates fluent in AI tools, ethics, and applications. Faculty support, courses, workshops, and hands-on programs will scale integration.
- Richard Weiss: 3 Claude 4.5 Opus’ Soul Document (Nov 28, 2025)
  An evaluation of the leaked “soul document” from Anthropic’s Claude 4.5 Opus AI that outlines its training goals and values—aiming for Claude to be an honest, helpful assistant that cares about humans and avoids harm. It sparked discussion about AI alignment and model behavior.
- Thezvi Substack: Gemini 3 Pro Is a Vast Intelligence With No Spine (Nov 24, 2025)
  Gemini 3 Pro is a powerful AI model excelling in raw intelligence, creative writing, and learning, achieving top scores on various benchmarks and demonstrating a significant leap in capabilities. However, it can be overly focused on achieving training objectives, leading to evaluation paranoia, hallucinations, and a tendency to prioritize giving an answer over admitting ignorance, and it lacks the user-friendly experience of models like Claude.
- WSJ: The Math Legend Who Just Left Academia—for an AI Startup Run by a 24-Year-Old (Dec 4, 2025)
  Renowned number theorist Ken Ono is leaving his tenured University of Virginia post to join Axiom Math, a startup founded by former student Carina Hong that aims to build an “AI mathematician.” He shifted from skepticism after witnessing rapid AI advances that began to challenge his field.
- Dallas News: TCU announces $10 million investment to launch AI initiative (Dec 9, 2025)
  TCU announces AI² that is supported by major corporations like Dell Technologies, Microsoft, and Amazon Web Services, and it aims to position TCU as a top research institution by integrating AI across various academic and administrative functions while emphasizing ethical use.
- Zvi Mowshowitz: Claude Opus 4.5 Is The Best Model Available (Dec 1, 2025)
  Claude Opus 4.5 is being hailed as the best and most capable model currently available, praised for its intelligence, thoughtful alignment, and coding abilities, making it a strong contender for a daily driver.
- Anthropic: Estimating AI productivity gains from Claude conversations (Nov 25, 2025)
  Analyzing 100,000 real Claude.ai conversations, researchers estimate that current AI models could increase US labor productivity growth by 1.8% annually over the next decade by speeding up task completion by approximately 80%.
- Dwarkesh Patel: Thoughts on AI progress (Dec 2025) (Dec 2, 2025)
  Short timelines often associated with Reinforcement Learning from Human Feedback (RLVR) and AGI are overly optimistic, but the likelihood of AGI in the next 20 years is high (and nearly impossible to imagine its effects on society).
- WSJ: You’re a Knowledge Architect? Why Modern Careers Are So Hard to Explain (Dec 10, 2025)
  Many modern jobs, particularly those involving AI or online content creation, are deeply misunderstood by friends and family due to their niche and abstract nature. My take: perhaps these folks should use AI to help translate their work to the rest of us!
December 10, 2025
(AI) Links (Dec. 8)
- Worksinprogress Co: The Great Downzoning – Works in Progress Magazine (Nov 24, 2025)
  The anti-abundance reality: The Downzoning was driven more by the interests of property owners seeking to protect their property values by restricting development than by a widespread anti-density ideology.
- WSJ: How Blue Origin Plans to Beat SpaceX to the Moon (Dec 2, 2025)
  The company is streamlining operations under new leadership to accelerate its pace and challenge SpaceX, particularly focusing on lunar opportunities and a simplified human landing proposal for NASA. They are leveraging existing hardware and proven technology to achieve a crewed lunar visit by the end of 2028.
- Cory Doctorow: Pluralistic: The Reverse-Centaur’s Guide to Criticizing AI (Dec 5, 2025)
  “The promise of AI – the promise AI companies make to investors – is that there will be AIs that can do your job, and when your boss fires you and replaces you with AI, he will keep half of your salary for himself and give the other half to the AI company.” HT Simon Willison
- NYTimes: A Data Center Wrapped in a Mystery Comes to the New Mexican Desert (Dec 7, 2025)
  BorderPlex, a little-known Austin company led by Lanham Napier, pitched “Project Jupiter” — a $165 billion AI datacenter complex on 1,400 acres in Doña Ana County, N.M.
- WSJ: Nvidia Takes Top Spot in the List of Best-Managed Companies of 2025 (Dec 8, 2025)
  Nvidia is No. 1 in the Drucker Institute’s Management Top 250 as tech’s overall share slips, with five Magnificent Seven firms leading but Intel and Adobe plunging.
- Simon Willison: Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult (Nov 24, 2025)
  Anthropic released Claude Opus 4.5, aiming to reclaim the top spot for coding models, boasting improved capabilities, a new “effort” parameter, enhanced computer use tools, and preserved “thinking blocks.”
- Simon Willison: LLM SVG Generation Benchmark (Nov 25, 2025)
  Tom Gally created a project inspired by a previous SVG benchmark, using LLMs to generate SVGs from creative prompts like “an octopus operating a pipe organ.”
- NY Times: Prosecutor Used Flawed A.I. to Keep a Man in Jail, His Lawyers Say (Nov 25, 2025)
  A California prosecutor is under scrutiny for allegedly filing court papers containing AI-generated errors, including misinterpretations of law and nonexistent case citations.
- Anthropic: Snowflake and Anthropic announce $200 million partnership to bring agentic AI to global enterprises (Dec. 3, 2025)
  Anthropic and Snowflake are expanding their partnership through a $200 million agreement to integrate Anthropic’s Claude models into the Snowflake platform, enabling enterprises to leverage AI agents for complex data analysis within a secure and governed environment.
- Johann Rehberger: The Normalization of Deviance in AI · Embrace The Red (Dec 4, 2025)
  “The AI industry risks repeating the same cultural failures that contributed to the Space Shuttle Challenger disaster: quietly normalizing warning signs while progress marches forward.”
December 8, 2025
Various (AI) Links – Dec. 3
- Vechron: Anthropic Prepares for Potential 2026 IPO in Bid to Rival OpenAI: Report (Dec 3, 2025)
  Anthropic hired Wilson Sonsini to begin IPO preparations possibly for 2026, aiming to list before OpenAI amid a private fundraising that could value it above $300 billion. It says no decision is final, has strengthened finance and governance, and faces heavy spending on data centres and model training.
- Anthropic: Anthropic acquires Bun as Claude Code reaches $1B milestone (Dec 2, 2025)
  Anthropic’s Claude Code, a leading AI model for developers, has reached $1 billion in run-rate revenue and is acquiring Bun, a high-performance JavaScript runtime, to enhance its capabilities. This acquisition aims to improve speed, stability, and workflows for Claude Code users by integrating Bun’s toolkit and optimizing the JavaScript developer experience.
- WSJ: Millions of Coders Love This AI Startup. Can It Last? (Dec 1, 2025)
  Cursor, an AI coding tool favored by tech leaders like Sam Altman and Jensen Huang, is experiencing rapid growth and is valued at $29.3 billion. Despite its popularity and impressive growth metrics, the company loses money, relies heavily on external AI models, and faces questions about its long-term sustainability in a competitive market.
- Simon Willison’s Weblog: Claude 4.5 Opus’ Soul Document (Dec 2, 2025)
  Richard Weiss extracted a 14,000-token “Soul overview” document from Claude 4.5 Opus, which Anthropic’s Amanda Askell confirmed was used to train the model’s personality during its training run using supervised learning. The “soul doc” outlines Anthropic’s mission to develop safe and beneficial AI, emphasizing good values, comprehensive knowledge, and wisdom for Claude, and even addresses topics like prompt injection attacks.
- WSJ: Apple to Revamp AI Team After Announcing Top Executive’s Departure (Dec. 1, 2025)
  Apple is restructuring its AI division after the retirement of its AI chief, John Giannandrea, whose tenure was marked by the company’s struggle to compete in the rapidly evolving AI landscape.
- WSJ: This AI Startup Wants to Remake the $800 Billion Chip Industry (Dec. 2, 2025)
  Two former Google researchers are launching Ricursive Intelligence, a startup aiming to automate chip design, potentially revolutionizing the $800 billion industry by enabling companies to create custom chips quickly and easily.
- Anthropic: How AI is transforming work at Anthropic (Dec 2, 2025)
  Anthropic surveyed engineers and analyzed Claude Code usage, finding Claude widely used—boosting productivity (~50%), enabling more full‑stack work, greater output, and new tasks while handling increasingly complex workflows autonomously. Employees nonetheless worry about skill atrophy, reduced collaboration and mentorship, and career uncertainty.
- Mistral: Introducing Mistral 3 (Dec 2, 2025)
  New models offer state-of-the-art performance, multimodal capabilities, and are designed for customization, with optimized versions available through collaborations with NVIDIA, vLLM, and Red Hat.
- AP News: AI may be scoring your college essay. Welcome to the new era of admissions (Dec 1, 2025)
  Colleges are increasingly using AI in the admissions process, primarily to streamline tasks like transcript review and essay evaluation, aiming to improve efficiency and consistency.
- Anthropic: Claude for Nonprofits (Dec 2, 2025)
  Anthropic, in partnership with GivingTuesday, is launching Claude for Nonprofits to help organizations maximize their impact through discounted access to Claude AI, connectors to nonprofit tools like Blackbaud and Benevity, and a free AI fluency course.
December 3, 2025
Google Gemini 3.0

Google released version 3.0 of Gemini on November 18 and a few days later released Nano Banana Pro, their image generating model. Both were quickly declared to be some of the best models, although Anthropic later released version 4.5 of their topline Opus model, reclaiming leadership by some accounts.

This chart from METR shows LLM task success rates for a longer task duration. The present leader is GPT 5.1 Codex Max at more than 30 minutes of non-supervised work. The previous version of Gemini (2.5) was able to work for ~10 minutes, so I expect that Gemini 3.0 will be near the leading edge in the coming weeks.

Progress continues for each of the major AI companies, and the developments in 2025 alone are remarkable. Gemini, with Google’s data-center and intellectual heft, is clearly well-positioned despite it’s earlier hallucinations and AI issues. OpenAI is reportedly alarmed by these tools and have declared “code red” to improve their product quickly.

Below, you’ll find some of the more interesting examples I’ve seen of Gemini / Nano Banana Pro.

…

Solving Homework

From Andrej Karpathy:

Gemini Nano Banana Pro can solve exam questions *in* the exam page image. With doodles, diagrams, all that.

Gemini 3 Pro also scores a 100% on the Best in High School Math (AIME 2025) evaluation, a competitive high school math benchmark.

What does this mean? Are you a student struggling to understand your math assignment? Simply snap a photo of your assignment, and Gemini will complete the work for you. This could be helpful for understanding and checking work. But this could also be a very easy way to cheat, as I’ve read some reports that Gemini is able to replicate an individual’s handwriting style (meaning someone wouldn’t have to manually copy the work). For instructors, the idea of a student mastering take-home assignments may no longer be a viable indicator of learning.

…

Infographics

From Grigory Sapunov, Visualizing Research: How I Use Gemini 3.0 to Turn Papers into Comics:

I asked Nano Banana Pro to generate a graphic novel telling the story and explaining the most important concepts based on a summary I provided. Here is the result:

Grigory’s post is fun and features a variety of comics with differing styles that communicate core ideas of very technical research. This, of course, won’t supplant the importance of original research, but it has the possibility of translating technical knowledge to the masses.

Google also provided some inspiration on ways to use Nano Banana Pro: (Prompt: Create an infographic about this plant focusing on interesting information.)

The prompt and image input are simple, and the generated infographic is interesting and full of helpful details. And best of all, it took a human very little time to create.

What does this mean? Are you in the business of communicating interesting but ultimately difficult-to-understand research? You can generate visuals to convey key ideas within complex research to media and students. Are you in the business of creating infographics? I suspect that if you’re exceptionally talented, you’ll continue your work without much interruption. But for middling designers, your business may dry up as people find more cost-effective ways to create supporting graphics.

Have you used Gemini 3.0 Pro yet? What are your observations of where the tool exceeds?

December 2, 2025
Various (AI) Monday Links (Dec. 1)
- xAI: Grok 4.1 (Nov 17, 2025)
  Grok 4.1 is a new model that significantly improves the usability of Grok, excelling in creative, emotional, and collaborative interactions while maintaining its sharp intelligence.
- Simon Willison: Building more with GPT-5.1-Codex-Max (Nov 19, 2025)
  OpenAI released GPT-5.1-Codex-Max, a new model designed for agentic coding tasks within the Codex environment, featuring “compaction” to handle long-context problems by pruning history while preserving important context.
- Simon Willison: Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult (Nov 24, 2025)
  Anthropic released Claude Opus 4.5, aiming to reclaim the top spot for coding models, boasting improved capabilities, a new “effort” parameter, enhanced computer use tools, and preserved “thinking blocks.”
- Simon Willison: LLM SVG Generation Benchmark (Nov 25, 2025)
  Tom Gally created a project inspired by a previous SVG benchmark, using LLMs to generate SVGs from creative prompts like “an octopus operating a pipe organ.”
- Simon Willison: Google Antigravity Exfiltrates Data (Nov 25, 2025)
  PromptArmor demonstrated a prompt injection vulnerability in Google’s Antigravity IDE, where a poisoned web page related to an “integration guide for an Oracle ERP API.” It instructs the AI to collect sensitive data like AWS credentials and exfiltrate it using a browser subagent to a malicious site.
- TechCrunch: Hugging Face CEO says we’re in an ‘LLM bubble,’ not an AI bubble (Nov 18, 2025)
  LLM marketplace, Hugging Face, CEO Clem Delangue believes the current AI hype is specifically an “LLM bubble” that is likely to burst, as the focus is too concentrated on large, general models.
- NY Times Magazine: I’m a Professor. A.I. Has Changed My Classroom, but Not for the Worse. (Nov 25, 2025)
  Rotella shifted his teaching approach to emphasize uniquely human elements like in-class discussion, pen-and-paper exams, and focus on the writing process to foster critical thinking and engagement, arguing that this “AI-resistant” approach, focused on the value of human interaction and individual thought, counters the predicted academic apocalypse and better prepares students for a complex, ever-changing world.
December 1, 2025