Choose the Right AI Tool

The AI landscape has changed a bit since I wrote What Is Real AI? back in 2021. The advent of GenAI has enabled a new wave of dubious AI sales pitches. Here’s one that crossed my desk recently:

We’ve identified some key GenAI opportunities at PermaPlate … forecast revenue and claims across [products] and adjust staffing monthly.

This sounds like a good idea, except – it’s not a GenAI application. It’s a standard forecasting exercise that everybody does already. If I did want to switch to a learning model – even a deep neural net, which is architecturally similar to an LLM – it would still not be GenAI.

The thing to remember is that GenAI “generates” things, like blog posts and deepfakes. My favorite learning models, going back to AI-Based Risk Rating, are all quantitative in nature. Here again, there are plenty of good statistical methods. Even if you prefer to use a learning model, you may not choose a neural net.

Neural nets have a problem with explainability. They’re basically a black box. That’s why credit bureaus, which must be able to explain their ratings, use a two-step approach. They use AI for exploration and feature engineering, but then they put the features into a more-transparent logistic regression model.

I might consider GenAI in a forecasting application, to deal with unstructured data. On the other hand, I would ask why the data is unstructured. We did this exercise as a POC here at PermaPlate. I wrote a little program that would read a service contract, and then answer natural-language queries.

Which coverage did the customer select and does it include roadside assistance?

It was a cool demo, but – if you want coverage details available for automation, it makes a lot more sense to store them in machine-readable form, at origination time. And what kind of automation might that be? Well, it might be “agentic.”

Agentic AI means that the AI has “agency,” in the sense that it can make decisions and do things in the world. Cool, huh? We give AI agency by equipping it with tools, in the form of software APIs.

Imagine asking ChatGPT to organize your next trip. It can’t, because it’s trapped inside your web browser. But if you invoke ChatGPT as part of an agentic workflow, with interfaces to the airlines and hotels, it can actually book the trip.

Agentic workflows often divide the work among tool-using LLMs, with a mastermind LLM directing the others. For systems that don’t have APIs, the agent can use Robotic Process Automation to operate the system’s user interface – just like you would at the keyboard. It’s not surprising that UIPath, one of the leading RPA vendors, has moved into Agentic AI.

Here is a short list of the latest AI techniques:

  • Large Language Model (LLM) – Like Grok and ChatGPT, these are AI models that can read and write (and plan, and execute).
  • Generative AI – Broad class of AI models that can create things, including LLMs but also diffusion models for video and other media.
  • Deep Neural Net (DNN) – Core technology behind GenAI, and many other learning models, as in my earlier article.
  • Retrieval Augmented Generation (RAG) – As the name implies, GenAI “augmented” by the ability to find and read your documents. See Unguided RAG for Text Comparison.
  • Robotic Process Automation (RPA) – Not AI, but frequently used by Agentic AI. As I wrote in Applied AI for Auto Finance, you can derive a lot of efficiency from RPA alone.
  • Agentic AI – AI agents that can make decisions and act autonomously.

Now that you know the lingo, you can choose the right tool for the job – or your AI sales pitch. I, for one, will not be using GenAI to predict claims volume … but I may use Agentic AI to dispatch the technicians.

The Cybernetic Teammate

Here is a recent HBS study on the role of GenAI as a collaborator in a team work environment. What I liked most about the study is that it is field work – real-world tasks in a real company, Procter & Gamble (read more about field work in my review of Gary Klein’s book). It must have been a fun field trip for the Harvard kids. By the way, you may recognize Karim Lakhani as the author of Competing in the Age of AI.

GenAI’s ability to engage in natural language dialogue enables it to participate in the kind of open-ended, contextual interactions that characterize effective teamwork

The introduction recaps the literature on team work, and points to some testable hypotheses about using GenAI as a “cybernetic teammate.” They then proceed to a product development exercise using the company’s standard methods, with a large sample (n=776) of employees in randomly-assigned groups.

The image shows a chart for one of the outcomes, proposal “quality.” For quality, AI-augmented teams were more likely to produce proposals ranking in the top decile. This chart is a little scary, if you think about it, because the bump from adding AI is bigger (and cheaper) than the bump from adding more people.

In a nutshell, teams do better than individuals, but individuals using AI do better than teams. I see this on my LinkedIn feed all the time, and I can vouch for it myself. Shrewd founders see AI as a force multiplier, allowing them to go farther alone before needing to bring in partners.

The study also found that using AI produced proposals better balanced between marketing and technical orientation. Apparently, this is a big skills divide at P&G. Marketers will produce groovy ideas that aren’t feasible, and vice-versa for the tech people. Note the bimodal curve in Figure 11. So, the basic team needs at least one of each skill – unless you’re using AI. AI had the effect of bringing solutions more toward the middle ground.

Finally, test subjects self-evaluated for emotional bien-être, and discovered that working with AI was almost as satisfying as working with other people. So, if you can’t afford a marketing colleague for your lonely, overworked engineer, you can at least get him a cybernetic teammate.

AI Datacenters Are Eating the World

AI datacenters are categorically different from traditional “hyperscale” cloud providers. The older datacenters were optimized for networking and storage – think streaming video and commerce websites.

AI datacenters are optimized for computing, and density is king. The goal is to pack as many parallel processing chips – Nvidia GPUs and Google TPUs – into a rack, and as many racks into the building, as possible.

This means that power consumption and cooling requirements are through the roof. One rack in a typical AWS datacenter might draw 20 kilowatts, while the latest Nvidia rack draws 132 kW. Pack the building full of those, and…

Project/Company Target Capacity Status & Timeline Key Details
xAI Colossus Expansion 1.2 GW Expansion underway; 150 MW substation completing Q4 2025, full 1.2 GW by late 2025–early 2026 Building on the existing 250–300 MW Colossus cluster (built in 122 days). Involves on-site natural gas plant and grid upgrades; faces EPA scrutiny but leverages pre-existing factory infrastructure for speed.
OpenAI Texas Campus 1 GW (phased from 300 MW) Phase 1 (300 MW) operational; Phase 2 construction started Jan 2025, full GW by mid-2026 Houses hundreds of thousands of GPUs; includes 210 substations and on-site electrical upgrades. Already straining ERCOT grid—could equal ~10% of regional peak during heat waves.
Meta 1–5 GW (supercluster) Ground broken 2024; first 1 GW online 2026, scaling to 5 GW by 2030 Zuckerberg’s “gigawatt-plus” initiative; Meta’s largest yet, with $64–72B spend in 2025 alone. Focuses on liquid cooling for high-density AI racks; part of broader multi-GW campus plans.
Microsoft OpenAI “Stargate” 1–5 GW Planning advanced; construction to start late 2025, launch 2028 $500B joint venture with Oracle/Nvidia; aims for massive AI sovereignty. Includes SMR nuclear pilots; power sourcing via PPAs and on-site generation to bypass grid delays.

Microsoft has two 300 MW datacenters. This is comparable to peak load for the city of Tacoma, during their summer AC season. Within a few years, all the leading AI vendors will have datacenters above 1GW. That’s why Microsoft just made a deal to restart the infamous Three-Mile Island nuclear power plant.

A cynic might observe that, while the TMI facility was deemed unsafe to power homes and businesses in Pennsylvania, regulators were willing to reconsider once Microsoft came knocking. Likewise, in Europe, nuclear-powered France is winning the datacenter race over green Germany.

After years of woodburning and windmills, the voracious demands of AI are forcing the world to take another look at nuclear power.

Reality-Based Management

On May 29, 1919, a team of astronomers led by Sir Arthur Eddington photographed the star field behind a solar eclipse. Comparing the position of these stars at night, versus their position during the eclipse, they proved Einstein’s theory that starlight was deflected by the Sun’s gravity.

This experiment made a profound impression on Karl Popper, a young philosopher studying the scientific method. In order to be “scientific,” Popper wrote, a theory must make predictions that can be tested by experiment.

If Eddington’s team had not found the predicted result, Einstein’s theory would have been dead. As Popper wrote, “confirmations should count only if they are the result of risky predictions.” In his famous essay on falsifiability, he contrasts this with the work of Marx and Freud, also popular at the time.

In those theories, Popper found only confirmation bias: “you saw confirmed instances everywhere: the world was full of [post hoc] verifications of the theory.” A theory that is “irrefutable” is not scientific, he wrote. A scientific theory, like Einstein’s, must make definite predictions that could be disproved.

One hundred years later, Scott Adams would warn his readers against confirmation bias, directing them instead to test their ideas based on predictive power: “The best way to judge the accuracy of an idea is not by logic but by its predictive power. If an idea predicts the future accurately, it is a useful idea.”

Business leaders I have worked with pride themselves on “reality based” management. You can’t plan a strategy or launch a new investment based on an incorrect understanding of your market. Maintaining an accurate model of reality takes concerted effort. Read Popper’s full essay here