2026

Loop's guide to using AI

What you should be using now, what you should be investing in, and what's coming next.

Keeping up with AI is exhausting.

It feels like there's something new every day, half of it is hype, and working out what actually matters takes time.

With years of experience building AI products, I've written this guide to walk you through the essentials - which models to use, tools that can genuinely change how you work, and how to get real value from AI in your business.

Liam McCormick

Senior AI Engineer

Generative AI models

Choose the right model for your use case

Anthropic

Recommended

Claude 4.5 Sonnet

Balances intelligence and cost perfectly. Your everyday workhorse for most tasks.

Lower cost makes this ideal for regular use. Only reach for Opus when you need the absolute best.

Anthropic

Recommended

Claude 4.5 Opus

The heavyweight. Excels at planning, complex reasoning, and challenging tasks.

Higher cost means you'll want to be selective. Reserve this for when quality matters most.

Google

Recommended

Gemini 3 Pro

Matches Opus-level reasoning at half the price. Strong alternative for complex work.

Significantly cheaper than Claude 4.5 Opus whilst delivering comparable performance.

OpenAI

GPT-5.1

Solid benchmark performance, though I typically favour Claude and Gemini.

Slightly cheaper than Gemini 3 Pro. Good for high-volume tasks where cost matters.

Recommended tools

The tools I use daily to get work done

Coding

Claude Code

Work smarter with Claude Code, a terminal-based AI assistant that understands your entire codebase. From autonomous file editing to creating new components, Claude Code handles complex development tasks while you stay in flow.

Whether you're refactoring legacy code, debugging tricky issues, or building new features, Claude Code works alongside you with full context of your project. It's included free with your Claude subscription, making it an essential tool for modern development.

Explore

Coding

GitHub Copilot

Transform the way you write code with GitHub Copilot. This AI-powered coding assistant understands context from your comments and code to suggest entire lines or blocks of code as you type.

Whether you're learning a new language, exploring unfamiliar frameworks, or simply want to write code faster, Copilot adapts to your style and helps you maintain flow. It's like having an experienced developer looking over your shoulder, ready to help whenever you need it.

Explore

Coding

Vercel v0

Build interfaces at the speed of thought with Vercel v0. Describe the UI you want in plain English, and watch as v0 generates production-ready React components using shadcn/ui and Tailwind CSS that you can copy directly into your project.

Whether you're prototyping a new feature or exploring different design directions, v0 dramatically accelerates frontend development. Iterate on designs through conversation, refine components in real-time, and ship polished interfaces faster than ever before.

Explore

Writing

Claude Sonnet 4.5

Experience AI writing that sounds genuinely human with Claude Sonnet 4.5. This model excels at nuanced, conversational content that captures your unique voice, whether you're crafting long-form articles or polishing business communications.

From maintaining British English conventions to handling complex technical writing, Claude Sonnet 4.5 brings intelligence and attention to detail that other models miss. It's the go-to choice when writing quality matters more than speed.

Explore

Writing

ChatGPT

Meet the AI assistant that's everywhere you need it. ChatGPT combines powerful writing capabilities with seamless browser integration, voice conversations, and built-in web search to help you work smarter across all your tasks.

Whether you're drafting emails, researching topics, or brainstorming ideas, ChatGPT adapts to your workflow. The voice mode makes it feel like talking to a colleague, while web search ensures you're always working with current information.

Explore

Research

NotebookLM

Transform how you work with information using NotebookLM from Google. Upload PDFs, paste website links, or add your notes, and watch as it generates structured summaries, answers questions, and helps you make connections you might have missed.

Whether you're researching for a project, analysing documents, or trying to make sense of complex information, NotebookLM acts as your personal research assistant. Best of all, it's completely free to use with no limits on uploads.

Explore

Research

Claude with research mode

Get the best of both worlds with Claude's web search capabilities. Combine Claude's exceptional reasoning and writing abilities with real-time access to current information, complete with proper citations you can trust and verify.

Whether you're fact-checking claims, researching current events, or need up-to-date information on rapidly evolving topics, Claude synthesises multiple sources into clear, well-reasoned responses. It's research assistance that thinks critically about what it finds.

Explore

Design

Nano Banana Pro

Create stunning visuals with perfectly rendered text using Nano Banana Pro, Google DeepMind's latest image generation model powered by Gemini 3 Pro. From mockups and posters to detailed infographics, it excels at combining high-fidelity imagery with legible text across multiple languages.

Whether you're designing marketing materials, creating educational diagrams, or building brand assets, Nano Banana Pro delivers up to 4K resolution with advanced creative controls. Available free in the Gemini app and integrated into Adobe Firefly and Photoshop for seamless workflows.

Explore

Design

Canva AI

Design like a pro without being one. Canva AI combines an extensive template library with intelligent AI assistance to help you create professional graphics, presentations, and social media content in minutes instead of hours.

Whether you're creating a quick Instagram post or designing a full pitch deck, Canva AI suggests layouts, generates images, and helps you maintain consistent branding. The free tier gives you plenty to work with, making professional design accessible to everyone.

Explore

Design

Veo 3.1

Witness the future of video creation with Veo 3.1, Google's cutting-edge AI video generator that creates photorealistic footage with synchronised audio that looks and sounds like it was filmed with professional equipment.

From product demonstrations to creative storytelling, Veo 3.1 is genuinely game-changing for content creators. What used to require expensive filming and production can now be generated from text descriptions, opening up entirely new creative possibilities for videos, ads, and visual narratives.

Explore

Productivity

Notion AI

Notion AI brings powerful writing capabilities directly into your workspace. Write faster, think bigger, and augment your creativity with an AI assistant that understands your context.

From drafting documents to summarizing meeting notes, Notion AI helps you work more efficiently without leaving your workspace. It's the writing assistant that knows where you are and what you're working on.

Explore

Loop Newsletter

Get weekly AI insights delivered to your inbox

Subscribe for free

Read by executives at:

Building AI Agents

AI agents can complete tasks that would take your team hours—research, data analysis, code generation, workflow automation. But most projects fail. Not because the technology isn't capable, but because teams jump straight to building without understanding what they're actually trying to solve.

I've spent the last two years building production agents for enterprise clients. Here's what actually works.

What AI agents actually are

There's a lot of confusion here, so let's clear it up.

An AI assistant responds to your prompts. You ask a question, it gives an answer. That's ChatGPT, Claude, Gemini—useful, but passive.

An AI agent is different. It takes actions, uses tools, and completes multi-step tasks on its own. You give it a goal, and it figures out how to get there.

Here's a simple example: an assistant tells you how to book a meeting. An agent checks calendars, finds availability, sends invites, and confirms attendance—without you touching anything.

The difference matters because agents can do things that simply aren't possible with a chatbot. They can execute code, call APIs, search databases, scrape websites, and even control your computer.

When to build vs when to buy

Not every situation needs a custom agent. Sometimes the off-the-shelf options are exactly right.

Use existing agents when

The use case is common (coding, writing, research)
You need speed to value
Your workflow fits the tool's assumptions

Build custom when

You need integration with proprietary systems
The workflow is unique to your business
You require specific controls or compliance

If you're just getting started, I'd recommend exploring Claude Code for development work, Notion AI for workspace tasks, or Replit Agent for building applications. These give you a feel for what's possible before you invest in custom builds.

How to scope an agent project

This is where most teams go wrong. They jump straight to the technology without understanding the problem.

Start with the workflow, not the technology.

Map out exactly how the task is done today—every step, every decision point, every handoff. Where are the bottlenecks? Where do errors happen? Where do people waste time on repetitive work?

Identify the 3-5 use cases that actually matter.

Not every process benefits from agents. Look for high volume, repeatable steps, clear success criteria, and tolerance for occasional errors. Avoid high-stakes decisions, processes requiring nuanced judgment, or anything customer-facing without robust fallbacks.

Define the human-agent boundary.

What does the agent do autonomously? Where does it pause for human review? This isn't a limitation—it's how you build trust and catch errors before they compound.

I've seen teams try to automate everything at once. It never works. Start narrower than you think necessary. A tightly-scoped agent that works reliably beats an ambitious one that fails unpredictably.

The technical building blocks

You don't need to be a developer to understand how agents work. Here's what's under the hood:

Foundation models

The reasoning engine. This is Claude, GPT, or Gemini providing the intelligence that powers decisions.

Tool use

How agents interact with external systems. APIs, databases, applications—whatever you give them access to.

Orchestration

Managing multi-step workflows, handling failures, maintaining state across a conversation.

Memory

Short-term (conversation context) and long-term (learned preferences, historical data).

Guardrails

Input validation, output filtering, scope limitations. The safety net that prevents agents from going off-course.

If you're building custom agents, it's worth understanding Anthropic's Model Context Protocol (MCP) and their Agent SDK. These are becoming industry standards and will save you significant development time.

What I've learned building production agents

I've shipped agents that handle real-time journey planning, generate client proposals from workshop notes, and help enterprises identify valuable AI features across their systems. Here's what I wish I'd known earlier:

Data quality matters more than model choice.

Your agent is only as good as the information it can access. I've seen teams spend weeks debating Claude vs GPT when their real problem was messy documentation and unreliable integrations. Fix the data first.

Design for failure.

Agents will make mistakes. The question is whether you've built systems to catch errors, alert humans, and recover gracefully. Circuit breakers, confidence thresholds, and human-in-the-loop checkpoints aren't optional—they're essential.

Change management is half the work.

The technical build might take three months. Getting teams to actually use the agent, trust it, and integrate it into their workflows? That takes longer. Budget for training, iteration, and ongoing feedback.

Scope creep kills projects.

Every stakeholder will have ideas for "just one more feature." Resist this. Prove value with a focused use case first, then expand. Small wins build momentum for larger initiatives.

A real example

For a transport application, we built an agent that provides real-time bus tracking and AI-powered journey planning. Users ask natural questions—"How do I get to the airport by 3pm?"—and the agent figures out the rest.

The approach

We connected the agent to live transit data, gave it tools to query routes and schedules, and designed fallbacks for when data wasn't available. The key was keeping the scope tight: journey planning only, with clear handoffs for edge cases.

The result

The prototype contributed to a £20 million contract. More importantly, it proved that agents could handle real-world complexity—variable data, unexpected queries, time pressure—without falling over.

What we'd do differently

We underestimated how much time we'd spend on data quality. The transit feeds were inconsistent, and we ended up building more validation logic than expected. Next time, I'd audit the data sources properly before writing any agent code.

Before you build: readiness checklist

Clear problem definition (not "we want to use AI")

Mapped current workflow with identified bottlenecks

Defined success metrics and baseline measurements

Identified data sources and confirmed access

Established human oversight process

Designed fallback mechanisms for failures

Allocated budget for iteration and maintenance

Secured stakeholder buy-in for workflow changes

If you can't tick most of these, you're not ready to build. That's not a criticism—it's how you avoid wasting time and money on projects that stall.

Common failure modes

"Boiling the ocean"

Trying to automate everything at once. Pick one workflow. Prove it works. Then expand.

Skipping the data work

Assuming existing documentation is agent-ready. It almost never is.

No fallback plan

Building agents without clear failure modes. What happens when the LLM hallucinates?

Treating agents like APIs

Expecting deterministic outputs from probabilistic systems. Design for variance.

Ignoring latency

Not accounting for LLM response time in user workflows. Speed matters.

Explore agent tools

Start with existing solutions or build your own

Notion AI

Summarise pages, generate content within your workspace

Claude Code

AI-powered terminal coding assistant for file editing and features

ChatGPT with integrations

Spotify, Zillow, Figma support with generative UI

Replit Agent

End-to-end application building and deployment

Microsoft Copilot

Integrated with Microsoft 365, 150M+ enterprise users

Google Gemini

Integrated in Google Workspace with deep search

Before you start implementing AI

Key questions and principles to guide your AI implementation

Step 01

Ask the right questions

What specific problem are you solving?

Identify the exact business need, not just "we want to use AI." The more precise you are, the better your solution will be.

Do you have the right data?

AI models are only as good as the data they're trained on. You need sufficient volume, but more importantly, quality and relevance to your use case.

Is AI the best solution?

Sometimes traditional software, automation, or process changes work better. Don't force AI where simpler solutions exist.

What does success look like?

Define clear, measurable outcomes before you start. How will you know if this is working? Set realistic benchmarks.

Who will use this and how?

Map out the user journey. Consider adoption barriers, training needs, and how this fits into existing workflows.

What's your timeline and budget?

AI projects often take longer and cost more than expected. Factor in data preparation, iteration, and ongoing maintenance.

Step 02

Build AI in a responsible way

Transparency & disclosure

Users should always know when they're interacting with AI. Make it clear what the AI does, how it makes decisions, and what its limitations are. Don't hide AI behind a facade of human interaction.

Bias detection & mitigation

AI systems can perpetuate or amplify existing biases. Test with diverse datasets, monitor outputs for unfair outcomes, and have processes in place to address bias when it's discovered.

Human oversight & control

Keep humans in the loop, especially for high-stakes decisions. AI should augment human judgment, not replace it entirely. Build in review processes and override capabilities.

Data privacy & security

Respect user data. Follow regulations like GDPR, implement proper consent mechanisms, and be transparent about how data is used. Consider data minimisation—collect only what you need.

Safety & reliability

Plan for failure modes. What happens when the AI makes a mistake? Build in fallback mechanisms, monitoring systems, and circuit breakers to prevent harm.

Accountability & governance

Establish clear ownership and responsibility. Who's accountable when something goes wrong? Create governance frameworks and document your AI systems' capabilities and limitations.

Step 03

Avoid these common pitfalls

Starting with too ambitious a scope

The "boil the ocean" approach rarely works. Start with one specific, well-defined use case. Prove value there, then expand. Small wins build momentum and buy-in for larger initiatives.

Ignoring data quality and preparation

Garbage in, garbage out. Most AI projects fail because of poor data, not poor algorithms. Budget significant time for data cleaning, labelling, and validation. It's unglamorous but critical.

Underestimating change management

AI isn't just a technology problem—it's a people and process problem. Users need training, workflows need redesigning, and stakeholders need ongoing communication. Don't skip this.

Building without clear success metrics

If you can't measure it, you can't improve it. Define success criteria upfront: what metrics will move, by how much, and by when? Without this, you'll never know if your AI is actually working.

Over-relying on AI without validation

AI makes mistakes. Always include human review for critical decisions, especially early on. Trust but verify. Build feedback loops so the system can learn from errors.

Neglecting ongoing maintenance

AI systems drift over time as data and environments change. Plan for continuous monitoring, retraining, and updates. This isn't a "set it and forget it" technology.

Step 04

Measure success

Efficiency & productivity metrics

Track time saved per task, cost reduction, process speed improvements, and throughput increases. For example: "Customer support resolution time reduced from 4 hours to 45 minutes." These show immediate operational impact.

Quality & accuracy metrics

Measure error rates, prediction accuracy, precision and recall, and user satisfaction scores. Compare AI performance to baseline (human or previous system). Track improvements over time as the model learns.

Adoption & engagement metrics

Monitor daily active users, feature usage rates, user retention, and time spent. Low adoption often signals UX issues or lack of trust. High engagement indicates the AI is providing real value to users.

Business impact metrics

Calculate ROI, revenue impact, customer lifetime value changes, retention rates, and competitive advantage gained. These tie AI initiatives to bottom-line results that executives care about.

System health metrics

Track uptime, response latency, model drift, data quality scores, and incident rates. Technical health metrics ensure your AI remains reliable and performs consistently over time.

Start simple, iterate often

Don't try to measure everything at once. Pick 3-5 key metrics that align with your goals. Review regularly and adjust as you learn what actually matters for your use case.

What businesses should invest in

Strategic priorities for AI transformation

Workforce transformation

The biggest opportunity isn't replacing people—it's enabling them to work at a higher level. Companies that invest in upskilling their workforce and redesigning workflows around AI assistance will see productivity gains that compound over time.

This means training programmes, new processes, and cultural change. The organisations winning with AI aren't just buying tools; they're fundamentally rethinking how work gets done.

AI agents

This is where the real value lies. AI agents that can complete tasks autonomously are delivering 2.1x greater ROI than other AI initiatives, according to BCG research. We're talking about systems that don't just suggest—they execute.

Start with high-impact use cases: customer service automation (think Klarna's approach, done properly), internal workflow automation for meeting notes and data analysis, or code generation and testing like Uber's doing. The key is focusing on 3-5 use cases that matter, not spreading yourself thin across dozens of experiments.

Companies like HCA Healthcare are seeing 40% reductions in review cycles. But here's what matters: you need to redesign workflows around agents, not force-fit them into existing processes. And human oversight? Still critical, no matter what the hype suggests.

Generative UI

Static interfaces are becoming obsolete. Generative UI creates dynamic interfaces based on context, delivering better user experiences than anything we could design manually. ChatGPT's app integrations with Spotify and Zillow prove the concept works—interfaces that adapt to what users actually need, when they need it.

Think context-aware interface changes and personalised user journeys. Companies like Context.ai are reimagining productivity software entirely around this concept. For React developers, frameworks like CopilotKit AG-UI make it practical to implement.

This isn't science fiction—it's happening now. The question isn't whether to explore generative UI, but how quickly you can start experimenting with it in your products.

Expert data collection

Your AI agents are only as good as the knowledge they can access. The companies winning with AI aren't just deploying models—they're systematically capturing expertise from their best people and making it accessible to their agents. This is the difference between an AI that gives generic responses and one that actually understands your business.

Start documenting decision-making processes, common scenarios, and edge cases from your top performers. Record how your experts handle complex situations, what questions they ask, and what factors they consider. Structure this knowledge so AI agents can retrieve and apply it contextually—not just as static documentation, but as living decision trees and examples.

This isn't a one-time project. Build systems where experts can continuously feed knowledge into your AI infrastructure. The organisations that treat expert knowledge as a strategic asset—actively collecting, structuring, and maintaining it—will have AI agents that actually reflect their institutional intelligence, not just what's in public training data.

Emerging technologies

Worth watching, but not ready for production

World models

AI systems that build internal representations of how the world works, enabling better reasoning and prediction.

Quantum computing

Computers that leverage quantum mechanics for exponentially faster processing.

Brain-computer interfaces

Direct communication pathways between the brain and external devices.

Fusion energy

Clean energy generation by replicating the sun's power.