
Loop's guide to using AI
What you should be using now, what you should be investing in, and what's coming next.
Keeping up with AI is exhausting.
It feels like there's something new every day, half of it is hype, and working out what actually matters takes time.
With years of experience building AI products, I've written this guide to walk you through the essentials - which models to use, tools that can genuinely change how you work, and how to get real value from AI in your business.
Liam McCormick
Senior AI Engineer
Generative AI models
Choose the right model for your use case

Claude 4.5 Sonnet
Balances intelligence and cost perfectly. Your everyday workhorse for most tasks.
Lower cost makes this ideal for regular use. Only reach for Opus when you need the absolute best.

Claude 4.5 Opus
The heavyweight. Excels at planning, complex reasoning, and challenging tasks.
Higher cost means you'll want to be selective. Reserve this for when quality matters most.

Gemini 3 Pro
Matches Opus-level reasoning at half the price. Strong alternative for complex work.
Significantly cheaper than Claude 4.5 Opus whilst delivering comparable performance.

GPT-5.1
Solid benchmark performance, though I typically favour Claude and Gemini.
Slightly cheaper than Gemini 3 Pro. Good for high-volume tasks where cost matters.
Recommended tools
The tools I use daily to get work done
Coding

Claude Code
Work smarter with Claude Code, a terminal-based AI assistant that understands your entire codebase. From autonomous file editing to creating new components, Claude Code handles complex development tasks while you stay in flow.
Whether you're refactoring legacy code, debugging tricky issues, or building new features, Claude Code works alongside you with full context of your project. It's included free with your Claude subscription, making it an essential tool for modern development.

GitHub Copilot
Transform the way you write code with GitHub Copilot. This AI-powered coding assistant understands context from your comments and code to suggest entire lines or blocks of code as you type.
Whether you're learning a new language, exploring unfamiliar frameworks, or simply want to write code faster, Copilot adapts to your style and helps you maintain flow. It's like having an experienced developer looking over your shoulder, ready to help whenever you need it.

Vercel v0
Build interfaces at the speed of thought with Vercel v0. Describe the UI you want in plain English, and watch as v0 generates production-ready React components using shadcn/ui and Tailwind CSS that you can copy directly into your project.
Whether you're prototyping a new feature or exploring different design directions, v0 dramatically accelerates frontend development. Iterate on designs through conversation, refine components in real-time, and ship polished interfaces faster than ever before.
Writing

Claude Sonnet 4.5
Experience AI writing that sounds genuinely human with Claude Sonnet 4.5. This model excels at nuanced, conversational content that captures your unique voice, whether you're crafting long-form articles or polishing business communications.
From maintaining British English conventions to handling complex technical writing, Claude Sonnet 4.5 brings intelligence and attention to detail that other models miss. It's the go-to choice when writing quality matters more than speed.

ChatGPT
Meet the AI assistant that's everywhere you need it. ChatGPT combines powerful writing capabilities with seamless browser integration, voice conversations, and built-in web search to help you work smarter across all your tasks.
Whether you're drafting emails, researching topics, or brainstorming ideas, ChatGPT adapts to your workflow. The voice mode makes it feel like talking to a colleague, while web search ensures you're always working with current information.
Research

NotebookLM
Transform how you work with information using NotebookLM from Google. Upload PDFs, paste website links, or add your notes, and watch as it generates structured summaries, answers questions, and helps you make connections you might have missed.
Whether you're researching for a project, analysing documents, or trying to make sense of complex information, NotebookLM acts as your personal research assistant. Best of all, it's completely free to use with no limits on uploads.

Claude with research mode
Get the best of both worlds with Claude's web search capabilities. Combine Claude's exceptional reasoning and writing abilities with real-time access to current information, complete with proper citations you can trust and verify.
Whether you're fact-checking claims, researching current events, or need up-to-date information on rapidly evolving topics, Claude synthesises multiple sources into clear, well-reasoned responses. It's research assistance that thinks critically about what it finds.
Design

Nano Banana Pro
Create stunning visuals with perfectly rendered text using Nano Banana Pro, Google DeepMind's latest image generation model powered by Gemini 3 Pro. From mockups and posters to detailed infographics, it excels at combining high-fidelity imagery with legible text across multiple languages.
Whether you're designing marketing materials, creating educational diagrams, or building brand assets, Nano Banana Pro delivers up to 4K resolution with advanced creative controls. Available free in the Gemini app and integrated into Adobe Firefly and Photoshop for seamless workflows.

Canva AI
Design like a pro without being one. Canva AI combines an extensive template library with intelligent AI assistance to help you create professional graphics, presentations, and social media content in minutes instead of hours.
Whether you're creating a quick Instagram post or designing a full pitch deck, Canva AI suggests layouts, generates images, and helps you maintain consistent branding. The free tier gives you plenty to work with, making professional design accessible to everyone.

Veo 3.1
Witness the future of video creation with Veo 3.1, Google's cutting-edge AI video generator that creates photorealistic footage with synchronised audio that looks and sounds like it was filmed with professional equipment.
From product demonstrations to creative storytelling, Veo 3.1 is genuinely game-changing for content creators. What used to require expensive filming and production can now be generated from text descriptions, opening up entirely new creative possibilities for videos, ads, and visual narratives.
Productivity

Notion AI
Notion AI brings powerful writing capabilities directly into your workspace. Write faster, think bigger, and augment your creativity with an AI assistant that understands your context.
From drafting documents to summarizing meeting notes, Notion AI helps you work more efficiently without leaving your workspace. It's the writing assistant that knows where you are and what you're working on.

Loop Newsletter
Get weekly AI insights delivered to your inbox
Subscribe for freeRead by executives at:

Building AI Agents
AI agents can complete tasks that would take your team hours—research, data analysis, code generation, workflow automation. But most projects fail. Not because the technology isn't capable, but because teams jump straight to building without understanding what they're actually trying to solve.
I've spent the last two years building production agents for enterprise clients. Here's what actually works.
What AI agents actually are
There's a lot of confusion here, so let's clear it up.
An AI assistant responds to your prompts. You ask a question, it gives an answer. That's ChatGPT, Claude, Gemini—useful, but passive.
An AI agent is different. It takes actions, uses tools, and completes multi-step tasks on its own. You give it a goal, and it figures out how to get there.

Here's a simple example: an assistant tells you how to book a meeting. An agent checks calendars, finds availability, sends invites, and confirms attendance—without you touching anything.
The difference matters because agents can do things that simply aren't possible with a chatbot. They can execute code, call APIs, search databases, scrape websites, and even control your computer.
When to build vs when to buy
Not every situation needs a custom agent. Sometimes the off-the-shelf options are exactly right.
Use existing agents when
- The use case is common (coding, writing, research)
- You need speed to value
- Your workflow fits the tool's assumptions
Build custom when
- You need integration with proprietary systems
- The workflow is unique to your business
- You require specific controls or compliance
If you're just getting started, I'd recommend exploring Claude Code for development work, Notion AI for workspace tasks, or Replit Agent for building applications. These give you a feel for what's possible before you invest in custom builds.
How to scope an agent project
This is where most teams go wrong. They jump straight to the technology without understanding the problem.
Start with the workflow, not the technology.
Map out exactly how the task is done today—every step, every decision point, every handoff. Where are the bottlenecks? Where do errors happen? Where do people waste time on repetitive work?
Identify the 3-5 use cases that actually matter.
Not every process benefits from agents. Look for high volume, repeatable steps, clear success criteria, and tolerance for occasional errors. Avoid high-stakes decisions, processes requiring nuanced judgment, or anything customer-facing without robust fallbacks.
Define the human-agent boundary.
What does the agent do autonomously? Where does it pause for human review? This isn't a limitation—it's how you build trust and catch errors before they compound.
I've seen teams try to automate everything at once. It never works. Start narrower than you think necessary. A tightly-scoped agent that works reliably beats an ambitious one that fails unpredictably.
The technical building blocks
You don't need to be a developer to understand how agents work. Here's what's under the hood:
The reasoning engine. This is Claude, GPT, or Gemini providing the intelligence that powers decisions.
How agents interact with external systems. APIs, databases, applications—whatever you give them access to.
Managing multi-step workflows, handling failures, maintaining state across a conversation.
Short-term (conversation context) and long-term (learned preferences, historical data).
Input validation, output filtering, scope limitations. The safety net that prevents agents from going off-course.
If you're building custom agents, it's worth understanding Anthropic's Model Context Protocol (MCP) and their Agent SDK. These are becoming industry standards and will save you significant development time.
What I've learned building production agents
I've shipped agents that handle real-time journey planning, generate client proposals from workshop notes, and help enterprises identify valuable AI features across their systems. Here's what I wish I'd known earlier:
Data quality matters more than model choice.
Your agent is only as good as the information it can access. I've seen teams spend weeks debating Claude vs GPT when their real problem was messy documentation and unreliable integrations. Fix the data first.
Design for failure.
Agents will make mistakes. The question is whether you've built systems to catch errors, alert humans, and recover gracefully. Circuit breakers, confidence thresholds, and human-in-the-loop checkpoints aren't optional—they're essential.
Change management is half the work.
The technical build might take three months. Getting teams to actually use the agent, trust it, and integrate it into their workflows? That takes longer. Budget for training, iteration, and ongoing feedback.
Scope creep kills projects.
Every stakeholder will have ideas for "just one more feature." Resist this. Prove value with a focused use case first, then expand. Small wins build momentum for larger initiatives.
A real example
For a transport application, we built an agent that provides real-time bus tracking and AI-powered journey planning. Users ask natural questions—"How do I get to the airport by 3pm?"—and the agent figures out the rest.
The approach
We connected the agent to live transit data, gave it tools to query routes and schedules, and designed fallbacks for when data wasn't available. The key was keeping the scope tight: journey planning only, with clear handoffs for edge cases.
The result
The prototype contributed to a £20 million contract. More importantly, it proved that agents could handle real-world complexity—variable data, unexpected queries, time pressure—without falling over.
What we'd do differently
We underestimated how much time we'd spend on data quality. The transit feeds were inconsistent, and we ended up building more validation logic than expected. Next time, I'd audit the data sources properly before writing any agent code.
Before you build: readiness checklist
If you can't tick most of these, you're not ready to build. That's not a criticism—it's how you avoid wasting time and money on projects that stall.
Common failure modes
"Boiling the ocean"
Trying to automate everything at once. Pick one workflow. Prove it works. Then expand.
Skipping the data work
Assuming existing documentation is agent-ready. It almost never is.
No fallback plan
Building agents without clear failure modes. What happens when the LLM hallucinates?
Treating agents like APIs
Expecting deterministic outputs from probabilistic systems. Design for variance.
Ignoring latency
Not accounting for LLM response time in user workflows. Speed matters.
Explore agent tools
Start with existing solutions or build your own

Notion AI
Summarise pages, generate content within your workspace

Claude Code
AI-powered terminal coding assistant for file editing and features

ChatGPT with integrations
Spotify, Zillow, Figma support with generative UI

Replit Agent
End-to-end application building and deployment

Microsoft Copilot
Integrated with Microsoft 365, 150M+ enterprise users

Google Gemini
Integrated in Google Workspace with deep search
Before you start implementing AI
Key questions and principles to guide your AI implementation
Ask the right questions
What specific problem are you solving?
Identify the exact business need, not just "we want to use AI." The more precise you are, the better your solution will be.
Do you have the right data?
AI models are only as good as the data they're trained on. You need sufficient volume, but more importantly, quality and relevance to your use case.
Is AI the best solution?
Sometimes traditional software, automation, or process changes work better. Don't force AI where simpler solutions exist.
What does success look like?
Define clear, measurable outcomes before you start. How will you know if this is working? Set realistic benchmarks.
Who will use this and how?
Map out the user journey. Consider adoption barriers, training needs, and how this fits into existing workflows.
What's your timeline and budget?
AI projects often take longer and cost more than expected. Factor in data preparation, iteration, and ongoing maintenance.
Build AI in a responsible way
Transparency & disclosure
Users should always know when they're interacting with AI. Make it clear what the AI does, how it makes decisions, and what its limitations are. Don't hide AI behind a facade of human interaction.
Bias detection & mitigation
AI systems can perpetuate or amplify existing biases. Test with diverse datasets, monitor outputs for unfair outcomes, and have processes in place to address bias when it's discovered.
Human oversight & control
Keep humans in the loop, especially for high-stakes decisions. AI should augment human judgment, not replace it entirely. Build in review processes and override capabilities.
Data privacy & security
Respect user data. Follow regulations like GDPR, implement proper consent mechanisms, and be transparent about how data is used. Consider data minimisation—collect only what you need.
Safety & reliability
Plan for failure modes. What happens when the AI makes a mistake? Build in fallback mechanisms, monitoring systems, and circuit breakers to prevent harm.
Accountability & governance
Establish clear ownership and responsibility. Who's accountable when something goes wrong? Create governance frameworks and document your AI systems' capabilities and limitations.
Avoid these common pitfalls
Starting with too ambitious a scope
The "boil the ocean" approach rarely works. Start with one specific, well-defined use case. Prove value there, then expand. Small wins build momentum and buy-in for larger initiatives.
Ignoring data quality and preparation
Garbage in, garbage out. Most AI projects fail because of poor data, not poor algorithms. Budget significant time for data cleaning, labelling, and validation. It's unglamorous but critical.
Underestimating change management
AI isn't just a technology problem—it's a people and process problem. Users need training, workflows need redesigning, and stakeholders need ongoing communication. Don't skip this.
Building without clear success metrics
If you can't measure it, you can't improve it. Define success criteria upfront: what metrics will move, by how much, and by when? Without this, you'll never know if your AI is actually working.
Over-relying on AI without validation
AI makes mistakes. Always include human review for critical decisions, especially early on. Trust but verify. Build feedback loops so the system can learn from errors.
Neglecting ongoing maintenance
AI systems drift over time as data and environments change. Plan for continuous monitoring, retraining, and updates. This isn't a "set it and forget it" technology.
Measure success
Efficiency & productivity metrics
Track time saved per task, cost reduction, process speed improvements, and throughput increases. For example: "Customer support resolution time reduced from 4 hours to 45 minutes." These show immediate operational impact.
Quality & accuracy metrics
Measure error rates, prediction accuracy, precision and recall, and user satisfaction scores. Compare AI performance to baseline (human or previous system). Track improvements over time as the model learns.
Adoption & engagement metrics
Monitor daily active users, feature usage rates, user retention, and time spent. Low adoption often signals UX issues or lack of trust. High engagement indicates the AI is providing real value to users.
Business impact metrics
Calculate ROI, revenue impact, customer lifetime value changes, retention rates, and competitive advantage gained. These tie AI initiatives to bottom-line results that executives care about.
System health metrics
Track uptime, response latency, model drift, data quality scores, and incident rates. Technical health metrics ensure your AI remains reliable and performs consistently over time.
Start simple, iterate often
Don't try to measure everything at once. Pick 3-5 key metrics that align with your goals. Review regularly and adjust as you learn what actually matters for your use case.
What businesses should invest in
Strategic priorities for AI transformation
Workforce transformation
The biggest opportunity isn't replacing people—it's enabling them to work at a higher level. Companies that invest in upskilling their workforce and redesigning workflows around AI assistance will see productivity gains that compound over time.
This means training programmes, new processes, and cultural change. The organisations winning with AI aren't just buying tools; they're fundamentally rethinking how work gets done.

AI agents
This is where the real value lies. AI agents that can complete tasks autonomously are delivering 2.1x greater ROI than other AI initiatives, according to BCG research. We're talking about systems that don't just suggest—they execute.
Start with high-impact use cases: customer service automation (think Klarna's approach, done properly), internal workflow automation for meeting notes and data analysis, or code generation and testing like Uber's doing. The key is focusing on 3-5 use cases that matter, not spreading yourself thin across dozens of experiments.
Companies like HCA Healthcare are seeing 40% reductions in review cycles. But here's what matters: you need to redesign workflows around agents, not force-fit them into existing processes. And human oversight? Still critical, no matter what the hype suggests.

Generative UI
Static interfaces are becoming obsolete. Generative UI creates dynamic interfaces based on context, delivering better user experiences than anything we could design manually. ChatGPT's app integrations with Spotify and Zillow prove the concept works—interfaces that adapt to what users actually need, when they need it.
Think context-aware interface changes and personalised user journeys. Companies like Context.ai are reimagining productivity software entirely around this concept. For React developers, frameworks like CopilotKit AG-UI make it practical to implement.
This isn't science fiction—it's happening now. The question isn't whether to explore generative UI, but how quickly you can start experimenting with it in your products.

Expert data collection
Your AI agents are only as good as the knowledge they can access. The companies winning with AI aren't just deploying models—they're systematically capturing expertise from their best people and making it accessible to their agents. This is the difference between an AI that gives generic responses and one that actually understands your business.
Start documenting decision-making processes, common scenarios, and edge cases from your top performers. Record how your experts handle complex situations, what questions they ask, and what factors they consider. Structure this knowledge so AI agents can retrieve and apply it contextually—not just as static documentation, but as living decision trees and examples.
This isn't a one-time project. Build systems where experts can continuously feed knowledge into your AI infrastructure. The organisations that treat expert knowledge as a strategic asset—actively collecting, structuring, and maintaining it—will have AI agents that actually reflect their institutional intelligence, not just what's in public training data.

Emerging technologies
Worth watching, but not ready for production
World models
AI systems that build internal representations of how the world works, enabling better reasoning and prediction.
Quantum computing
Computers that leverage quantum mechanics for exponentially faster processing.
Brain-computer interfaces
Direct communication pathways between the brain and external devices.
Fusion energy
Clean energy generation by replicating the sun's power.



Loop