• news-banner

    Expert Insights

AI and Data Protection

min read

 

Key Takeaways

  • AI should be trained and deployed in ways that uphold core data protection principles.
  • Web scraping for AI training presents challenges when identifying an appropriate lawful basis and fulfilling transparency obligations.
  • Agentic AI introduces new complexities including unclear controllership roles and elevated cyber risks due to greater autonomy.

Understanding the ICO's Approach to Responsible AI

The UK Government and UK Information Commissioner’s Office (ICO) recognise that AI has enormous potential to transform how businesses operate. Rather than rushing to introduce strict AI-specific laws, the UK Government has chosen a more pragmatic path. Its focus is on giving businesses practical guidance to manage data protection risks throughout the entire AI lifecycle, from development to deployment. This approach differs from the European Union, which has introduced dedicated AI legislation.

The ICO is also keeping a close eye on new AI technologies. In its January Tech Futures report, the regulator examined the privacy challenges posed by agentic AI, which refers to AI systems capable of acting more autonomously.

Data Protection Considerations in AI Training

Data protection law applies not just to what an AI tool produces, but also to the personal data used to train it. This creates several practical challenges for organisations. Before using personal data to train AI, businesses must consider whether they have a lawful basis to do so under the UK GDPR, such as relying on legitimate interests.

Informing Individuals and Providing Transparency

Organisations face the difficult task of informing individuals when their personal data is being used to train AI. This can be particularly challenging when there is no direct relationship with those individuals. Businesses must also think carefully about how to communicate their use of AI in a way that builds trust rather than creating suspicion or scepticism among the public.

Lawful Basis for Using Personal Data in AI Training

Under data protection law, organisations need a valid lawful basis to process personal data for AI training. The ICO has confirmed that legitimate interests is the only realistic lawful basis when using personal data scraped from the internet to train generative AI models. However, the ICO has warned that meeting the necessary balancing test is difficult in these circumstances, given the high-risk nature of such processing and the fact that it often happens without individuals' knowledge. The ICO's findings on web scraping align broadly with the European Data Protection Board's view that such techniques can breach transparency, data minimisation, and accuracy obligations.

Key Findings from the ICO's GenAI Outcomes Report

Following a consultation series on generative AI that ran from January to September 2024, the ICO published an outcomes report covering five important areas that will drive regulatory focus:

  • Purpose limitation.
  • Accuracy of training data and AI outputs.
  • Allocation of controllership responsibilities across the GenAI supply chain.
  • Embedding individual rights into AI model design.
  • Assessing the lawful basis for web scraping.

Data Protection Risks Relating to AI Outputs

Core data protection principles such as lawfulness, fairness, accuracy, and accountability can be difficult to apply to AI systems. This is because there is often little visibility into how these systems reach their conclusions, a problem commonly referred to as the "black box" issue. Organisations that use AI without careful thought risk unknowingly breaching fundamental data protection rules.

Risks Linked to Data Quality and Provenance

AI systems, particularly large language models, are often trained on enormous datasets whose origins may be unclear. This means that important decisions about people could be based on data that is low quality, outdated, inaccurate, or biased. The consequences can be serious, including not only poor and unreliable outputs but also results that discriminate against individuals based on protected characteristics such as gender, race, or age.

Additional Risks Introduced by Agentic AI

The ICO's recent report highlights that agentic AI introduces further risks. These include:

  • uncertainty about which organisation is responsible for data protection when multiple parties are involved
  • agents accessing more personal data than necessary;
  • agents drawing inferences about sensitive personal information; and
  • new cybersecurity threats arising from the autonomous nature of these systems.

Our thinking

  • The Playbook to Superscale: Hacks 1-3

    Events

  • From Prime Time to Match Day: Engaging the Female Audience

    Events

  • Women in Leadership: In conversation with Wendy Edwards and Karen Ellis

    Claudine Morgan

    Events

  • Nicola Thorpe and Sally Ashford comment in Law.com International on the importance of trusted, long term relationships in advising high-net-worth clients

    Nicola Thorpe

    In the Press

    min read
  • Upward only Rent Review Ban: Could Your Lease Be Caught Retrospectively?

    Sarah Keens

    Quick Reads

    min read
  • Protecting what matters: Your guide to wills and Powers of Attorney

    Abbie Hook

    Insights

    min read
  • James Riby comments in Today’s Family Lawyer about family, household, and cohabitation trends in the UK

    James Riby

    In the Press

    min read
  • Shona Alexander and Maddie Dunn contribute to Family Law Journal, examining how disputes and relationship breakdowns can impact family farms

    Shona Alexander

    In the Press

    min read
  • 5 Things You Need to Know about Greenwashing

    Kerry Stares

    Insights

    min read
  • Jamie Kennaugh comments in Investors’ Chronicle on how couples can safeguard their finances

    Jamie Kennaugh

    In the Press

    min read
  • EU ESG Ratings Regulation: what providers need to know ahead of the July 2026 deadline

    Kerry Stares

    Insights

    min read
  • Charles Russell Speechlys is shortlisted for Team of the Year: Legal Transformation at The Lawyer Awards 2026

    Tessa Bartley

    News

    min read
  • Anti-greenwashing in the UK and EU: the risk landscape and best practice guidance

    Kerry Stares

    Insights

    min read
  • TCC allows Building Liability Order based on an Adjudicator’s Decision and an ‘Anticipatory’ Building Liability Order

    Michael O'Connor

    Insights

    min read
  • Corporate human rights due diligence – episode 2: practical insights from the experts

    Kerry Stares

    Podcasts

  • The Sky’s the Limit: Arbitrating Aviation Disputes

    Patrick Gearon FCIArb

    Insights

    min read
  • Mike Barrington comments on the impact of Standard Life's Aegon acquisition for the insurance market, in Insurance Business, IFA Magazine, Wealth DFM, Professional Adviser, and International Adviser

    Mike Barrington

    In the Press

    min read
  • eprivateclient features an article by Matt Foster and Sarah Moore on untangling crypto assets in divorce

    Matt Foster

    In the Press

    min read
  • Bloomberg Tax quotes Sally Ashford on the forthcoming HMRC requirement for lawyers to register as tax advisers

    Sally Ashford

    In the Press

    min read
  • Nicola Thorpe comments in The Telegraph on the importance of certainty for non-doms considering moving to the UK

    Nicola Thorpe

    In the Press

    min read
Back to top