Skip to main content
BVDNETBVDNET
ServicesWorkLibraryAboutPricingBlogContact
Contact
  1. Home
  2. AI Woordenboek
  3. Core Concepts
  4. What is Catastrophic Forgetting?
book-openCore Concepts
Advanced
2026-W17

What is Catastrophic Forgetting?

Catastrophic forgetting is when training a neural network on new data overwrites previously learned knowledge, causing it to lose earlier capabilities.

Also known as:
catastrophic interference
catastrofaal vergeten
knowledge forgetting
AI Intel Pipeline
What is Catastrophic Forgetting?

What is Catastrophic Forgetting?

Catastrophic forgetting (also called catastrophic interference) is a phenomenon where a neural network, when trained on new data or tasks, loses previously learned knowledge. The new training overwrites the weights that encoded earlier capabilities, causing dramatic performance degradation on the original tasks.

Why It Matters

Catastrophic forgetting is a fundamental challenge for AI systems that need to learn continuously. It explains why you can't simply keep fine-tuning the same model on new data without risk — the model may lose its general capabilities. This is a core concern for continual learning, personalization, and keeping models up-to-date without full retraining.

How It Works

Why it happens:

  • Neural networks store knowledge distributed across weights (parameters)
  • When trained on new data, gradient updates modify these shared weights
  • If the new task's optimal weights differ from the old task's, old knowledge is overwritten
  • Smaller models are more susceptible (fewer parameters to share)

When it occurs:

  • Fine-tuning — adapting a pre-trained model to a specific task may degrade general knowledge
  • Sequential task learning — training on task A, then task B, then task C causes forgetting of A
  • Data distribution shift — updating a model with data from a different distribution

Mitigation strategies:

  • Regularization methods (EWC, SI) — add a penalty for changing weights that were important for previous tasks
  • Replay methods — mix a small amount of old training data with new data
  • Architecture methods — allocate separate model capacity for different tasks
  • LoRA / adapters — add small trainable layers while freezing the base model (the dominant approach for LLM fine-tuning)
  • Model merging — merge separately fine-tuned models instead of sequential training

Example

A company fine-tunes GPT-4 to be an expert legal document reviewer. After training on thousands of legal documents, the model excels at legal analysis but has forgotten how to write marketing copy, do math, or answer general knowledge questions. The legal training catastrophically overwrote the general capabilities. Using LoRA adapters instead would preserve the base model's knowledge.

Sources

  1. McCloskey & Cohen – Catastrophic Interference in Connectionist Networks (1989)
  2. Kirkpatrick et al. – Overcoming Catastrophic Forgetting in Neural Networks (EWC)

Need help implementing AI?

I can help you apply this concept to your business.

Get in touch

Related Concepts

Tokenizer
A tokenizer converts raw text into tokens — the discrete units a language model processes — using subword algorithms like BPE or SentencePiece.
Artificial Intelligence (AI)
Artificial intelligence is the field of computer science that builds systems capable of performing tasks normally requiring human intelligence, such as learning, reasoning, and perception.
Batch Size
Batch size (examples per update) and learning rate (step size for weight updates) are the two most important hyperparameters controlling how neural networks train.
Benchmark (AI Evaluation)
A benchmark is a standardized test used to measure and compare AI model performance, providing reproducible scores across tasks like reasoning, coding, and knowledge.

AI Consulting

Need help understanding or implementing this concept?

Talk to an expert
Previous

Binex

Next

Chain-of-Thought Prompting

BVDNETBVDNET

Web development and AI automation. Done properly.

Company

  • About
  • Contact
  • FAQ

Resources

  • Services
  • Work
  • Library
  • Blog
  • Pricing

Connect

  • LinkedIn
  • Email

© 2026 BVDNET. All rights reserved.

Privacy Policy•Terms of Service•Cookie Policy