Blockchain CouncilGlobal Technology Council
ai9 min read

AI Apps and AI Models Compared

Michael WillsonMichael Willson
AI Apps and AI Models Compared

Artificial intelligence is advancing so quickly that every week brings a new headline, a new breakthrough, or a new debate that forces people to rethink the entire landscape. Yet some developments stand out because they signal a deeper shift. Cursor’s recent announcement is one of those moments. The AI coding platform raised 2.3 billion dollars at a valuation of 29.3 billion, a level usually associated with the biggest AI model companies rather than application layer startups.

At first glance, it looks like a high profile funding story. But the implications go far beyond one company’s success. Cursor’s raise opens a fresh discussion about the balance of power between AI applications and foundational AI model. For years, many people assumed the model layer would dominate everything built above it. This announcement suggests that idea may be incomplete. It raises a fundamental question. Can both layers thrive together, or will one eventually overshadow the other? To understand this, we need to explore the events surrounding Cursor’s rise, the breakthroughs across the model layer, and the tensions building between different parts of the AI ecosystem.

Before diving into the debate, it helps to review the news that shaped the last ten days. These updates reveal how fast the environment is changing and why the industry is now reconsidering the relationship between AI apps and models.

A New Wave of Agentic Cyber Attacks

The first major story came from Anthropic. The company reported that in mid September, they detected the first real use of autonomous AI agents in a cyber espionage operation. The threat actor was identified as a Chinese state supported hacking group. The striking part was not only the attack itself, but the role AI played in it.

Anthropic shared that the hackers used Claude Code to carry out most of the attack. The system performed between eighty and ninety percent of the operational steps. Human hackers stepped in only for major decisions. The attack targeted thirty organizations, including financial firms, government agencies, technology companies, and chemical manufacturers. A small number of infiltration attempts even succeeded.

The attackers bypassed guardrails by breaking harmful activities into multiple smaller tasks that appeared harmless on their own. Anthropic monitored the operation for ten days, shut down the accounts involved, and worked with authorities. Their assessment concluded that AI agents can now match the work of highly skilled human teams during cyber operations.

This incident was far more advanced than the earlier “vibe hacking” that required constant human steering. This time, agents took initiative. Anthropic also warned that future agent systems will increase both the scale and volume of AI driven attacks. It is a glimpse into a world where AI delivers massive productivity gains but also introduces new risks. This context influences how we think about AI models and AI apps because both are becoming more powerful in different ways, and the stakes are rising.

Anthropic Commits 50 Billion Dollars to Data Centers

Soon after the cyber incident, Anthropic made another major announcement. The company plans to invest fifty billion dollars to build data centers across the United States. Until now, Anthropic relied heavily on compute purchased from Amazon and Google. That strategy helped the company scale but came with limitations. These included rate limits caused by compute shortages, reliance on in house chips from cloud providers, and even some customer churn during periods of overcrowding.

The new plan involves developing new sites in Texas and New York with the help of Fluidstack. The first locations will go live next year. CEO Dario Amodei positioned this shift as essential for ensuring US leadership in advanced AI. The infrastructure will support large scale models, scientific breakthroughs, and thousands of new jobs.

The scale of this investment suggests a future where model companies are not only research organizations but also major infrastructure operators. This type of growth directly affects how applications evolve because the economics of training and running models influence the opportunities available for app companies like Cursor.

Thinking Machines Lab Joins the Hypervalued Group

Another major event came from Thinking Machines Lab. Bloomberg reported that the company is raising funding at a valuation between fifty and sixty billion dollars. This is a dramatic increase from twelve billion just months ago. The business is still pre revenue. Their product, Tinker, is used by university researchers and a few enterprise clients, but their valuation comes primarily from their team and research velocity.

This mirrors the trajectory of Safe Superintelligence, which also achieved a multibillion valuation before releasing a product. These valuations show how rapidly the model layer is attracting capital. Yet even with this momentum, Cursor, which sits at the application layer, now matches valuations typically associated with foundational research labs. This is what turns Cursor’s story into a pivotal moment.

Google Expands Notebook LM With Deep Research

Google’s updates showed how fast major platforms can evolve. Notebook LM added a new capability called deep research. Users can enter a broad question like “latest quantum physics breakthroughs” and let the system gather documents, extract insights, and build structured research summaries. It also gained the ability to generate short video overviews in various artistic styles, including pixel art, pop art, and art nouveau.

These updates pushed Notebook LM from a summarization tool to a comprehensive research assistant. It also increases pressure on AI application developers because large companies can introduce new features quickly, raising user expectations.

DeepMind Releases SEMA 2

DeepMind introduced SEMA 2, an upgraded agent that learns through large scale self play and can operate in simulated worlds it has never encountered before. The jump in performance is significant. SEMA 1 achieved a thirty one percent success rate across evaluation tasks. SEMA 2 reached sixty five percent. Humans score around seventy six percent. The improvement on unseen environments was equally notable, rising from two percent to thirteen percent.

DeepMind tested SEMA 2 inside entirely new environments generated by Genie 3. The agent could understand instructions, orient itself, and take meaningful action. These updates support the idea that world models may be a key path toward more general intelligence. They also highlight how foundational model research is accelerating.

GPT 5.1 API and New Prompting Guidance

OpenAI released GPT 5.1 through its API. Developers noticed that the model is more verbose unless directed otherwise, more controllable, and better at following subtle instructions. It also behaves differently when used inside agent frameworks. Updated prompting guides make it easier to migrate from GPT 4 and GPT 5.

These advancements again highlight how quickly foundational research evolves, and they fuel the debate about whether AI apps can survive in a world dominated by fast moving model labs.

The Central Debate: Will AI Models Overtake AI Apps

The conversation exploded after investor Yishan Wong published a post that drew twenty million views. His argument is straightforward and provocative. He believes most AI application startups are unlikely to survive because foundational model companies innovate too quickly.

His thesis includes several points:

Model labs innovate at extreme speed.
New capabilities arrive every nine to twelve months.
App startups do not have time to build durable businesses.
Most AI apps will be replaced before they mature.
The likely outcomes are short term cash or acquisition.

He argues that only app companies with specialized data barriers tied to the physical world have a chance to scale sustainably.

This argument sparked widespread discussion among founders, engineers, and investors.

Counterarguments: Why Many Believe the App Layer Still Has Power

Not everyone agrees with Yishan. Many industry experts offered counterpoints, and their responses fall into several core themes.

Vertical applications require deep workflow engineering

Applications must handle integration, domain expertise, compliance, human in the loop processes, and environment specific needs. Model companies do not have the focus or incentives to tackle these details.

The final ten percent of product quality is extremely hard

A model can be ninety percent correct, but businesses need near perfect consistency. App companies specialize in delivering the last mile of reliability.

Behavioral exhaust creates a defensible moat

Natasha Malpani highlighted that the most valuable data is not training data but behavioral data created by real users. This includes edits, workflow patterns, intent signals, and actions. This data does not flow back to model labs. It stays with the apps.

Model labs cannot specialize in every domain

The model layer focuses on general intelligence. The app layer focuses on depth and specialization. This creates room for both.

Some apps will grow into model companies

Cursor is the clearest example of this hybrid model.

Cursor’s Raise Changes the Landscape

Cursor’s valuation of 29.3 billion and its 2.3 billion dollar raise surprised the entire industry. The company also announced one billion dollars in annual recurring revenue. This makes Cursor the fastest company in history to reach that milestone.

Their proprietary model, Composer 1, is now one of the most used models on the platform. In April, the model leaderboard looked very different. Today, Composer 1 ranks above many established systems. Cursor’s CEO, Michael Trul, confirmed that they are moving into full model development. This makes Cursor the first application company to successfully transition into a combined app and model company.

This hybrid strategy gives Cursor access to reinforcement learning data, behavioral feedback, and domain specific improvements that model labs cannot easily replicate.

Comparing the App Layer and the Model Layer

Here is a clear contrast between the strengths of each layer:

Innovation speed
Apps move slowly but gain depth.
Models move fast and push new research.

Data access
Apps collect behavioral exhaust.
Models gather broad training data.

Business defensibility
Apps offer trust, UX, integration, and compliance.
Models offer scale and capability.

Weakness
Apps face obsolescence risk.
Models face cost, compute, and safety challenges.

Why the Debate Matters for Professionals

This debate affects career paths, team structure, product decisions, and investment strategies. People who want to understand AI deeply often build foundational skills through structured paths like a Tech certification. Engineers and researchers exploring world models or advanced agent systems often pursue advanced learning in areas such as reinforcement learning or emerging architectures. Leaders responsible for applying AI across marketing, product, operations, and strategy develop frameworks through structured programs like a Marketing and business certification.

This combination of human capability and system capability shapes how AI spreads across organizations.

The Future Belongs to Companies That Can Operate in Both Layers

The most compelling insight from the recent news cycle is that the distinction between apps and models may blur in the future. Cursor demonstrates that app companies can evolve into model companies when they gather enough behavioral intelligence. Meanwhile, model labs are expanding into infrastructure, research, and massive scale deployment.

The next generation of AI companies may not choose between the layers. They may combine them. The world is changing rapidly, and even the experts admit they are still learning what the future will look like.

Cursor’s rise does not signal the end of the app layer. It signals the start of a new chapter where app companies explore how far they can go and how close they can get to the model layer.

AI Apps and AI Models