Is AGI just “BS” adding to the hype train? A fair question from Reddit
“Unless it has its hand in the physical world, what will it actually solve?”
That line from this Reddit post captures a growing mood. If we still need people to build homes, grow food, and drive kids to school, what’s the point of “AGI” – artificial general intelligence – in 2025?
Short answer: the hype is real, and so are the limits. But even without humanoid robots, advanced AI can make a dent in real-world bottlenecks by tackling the digital work that slows the physical work down.
What people mean by AGI in 2025 (and what we actually have)
AGI is loosely defined as an AI system that can match or exceed human performance across most cognitive tasks. We don’t have that. What we do have are powerful large language models (LLMs) – systems based on the transformer architecture – that are good at language, code, and planning, and can use tools like web browsers, databases, and APIs.
You’ll see terms like:
- LLM: a model that predicts text, now extended to images and audio.
- Agents: LLMs that take actions via tools (e.g., run code, call APIs, click UI elements) to complete tasks.
- RAG (retrieval-augmented generation): a method where the model looks up documents at query time so it talks from your data, not just its training. See the original paper by Lewis et al. (arXiv).
Frontier models are impressive, but they still hallucinate, struggle with long chains of reasoning without verification, and need guardrails. See model documentation for current capabilities and limits: OpenAI research, Anthropic, Google DeepMind.
What AGI-like systems can solve without touching the physical world
Most “real life” work has two halves: physical labour and digital coordination. AI is already useful in the digital half – where delays, errors, and costs pile up.
- Paperwork and compliance: drafting, checking, summarising, and routing documents. In the UK this includes planning applications, contracting, KYC/AML checks, and standard reporting under UK GDPR. The regulator’s guidance is clear on obligations: ICO UK GDPR resources.
- Scheduling and logistics: allocating jobs, vehicles, and stock; replanning when something breaks. That shortens project-critical paths even if AI never lifts a brick.
- Design and estimation: generating options, catching obvious spec or building regs conflicts, producing bills of materials, and cost scenarios.
- Software and IT operations: code generation, test creation, automated runbooks, and first-line incident triage.
- Customer operations: triaging tickets, drafting replies with context, and escalating with a summary and next steps.
- Research and synthesis: scanning policy, scientific literature, or case law, then producing a verifiable brief with cited sources via RAG.
- Education and training: personalised tutoring, practice questions, and assessable feedback for apprenticeships and CPD.
These are the friction points that slow down construction, healthcare, farming, and transport. Remove friction, and the physical work moves faster, cheaper, and with fewer errors.
UK-specific wins you can expect sooner
- NHS and social care: automating letters, referral summaries, coding, and admin validation. Clinical use remains tightly regulated and needs human oversight.
- Local government: faster planning validation, standard checks against building regs, and better citizen comms. Human officers still decide, but the queue shortens.
- SMEs: consistent proposals, invoices, and stock updates. A practical starter is linking models to spreadsheets and forms – see my guide on connecting ChatGPT with Google Sheets.
- Financial and legal services: document review, clause extraction, and risk flags to support expert judgement, with clear audit trails for FCA and SRA obligations.
Where the “it doesn’t build houses” critique is right
AI won’t replace bricklayers, drivers, or farmers. Robotics is improving in warehouses and controlled environments, but outdoor, unstructured tasks are still hard. Regulation and safety constraints keep fully autonomous vehicles and care robots cautious in the UK.
Still, coordination is half the battle:
- Construction: AI can compress design iterations, identify clashes before site work, and streamline procurement. That reduces delays and overruns.
- Agriculture: yield forecasting, input optimisation, and scheduling harvests around weather and logistics – paired with existing precision equipment.
- Transport: dynamic routing, staff rostering, and predictive maintenance. Less downtime, better capacity utilisation.
- Energy: demand forecasting and grid balancing support for operators and suppliers.
Think “human + machine”. AI preps the work, checks constraints, and calls out risks. Humans execute safely and skilfully.
Limits that matter: accuracy, safety, and integration
- Hallucinations and reliability: LLMs confidently state falsehoods. Use RAG, verification steps, and clear escalation to humans.
- Evaluation: measure impact with task-level metrics (throughput, error rate, time-to-resolution), not just “it feels clever”.
- Data protection: map data flows, apply DPIAs, and use data minimisation. Prefer vendors with UK/EU data residency options and robust DPA terms.
- Security and misuse: control tool access, rate-limit actions, log every step, and keep humans in the loop for high-risk actions.
- Change management: the tech is often easier than the process redesign and training.
Costs and availability in the UK
API pricing varies by model and usage and is subject to change (see vendor pricing pages; details often not disclosed in press releases). Many providers now offer EU/UK data residency and enterprise agreements; confirm in the service documentation and data processing addendum.
Open-source models running on your own servers are an option for sensitive workloads, but they require MLOps expertise and careful benchmarking. Frontier APIs are typically faster to deploy and stronger on reasoning, with higher per-token costs.
How to make it practical in your organisation
- Pick a narrow, high-friction process (e.g., drafting and validating standard letters) with clear success metrics.
- Add retrieval from your own documents (RAG) and keep prompts/tool calls transparent and logged.
- Enforce human review for anything regulatory, contractual, financial, or safety critical.
- Integrate with the tools your team already uses (email, spreadsheets, ticketing). For simple wins, try a spreadsheet-driven workflow – this guide walks through a practical setup.
- Iterate with real users; measure time saved, error rates, and satisfaction; scale only after it sticks.
Bottom line: not a panacea, but not pointless
AGI as a sci‑fi endstate is still speculative. The 2025 reality is narrower: strong language-and-tools systems that can automate a chunk of the digital work behind real-world services. That won’t lay bricks or milk cows, but it will pull weeks of admin out of a project plan.
If you’re in the UK and sceptical, you’re right to be. Start with the dull, measurable tasks that sap time and money. If the hype doesn’t translate into fewer errors, faster throughput, and better service, bin it. If it does, you’ll feel the impact – even before the robots turn up.