100 AI Model Training Dataset Size = 250% Annual Growth Over Fifteen Years, per Epoch AI Note: In AI language models, tokens represent basic units of text (e.g., words or sub-words) used during training. Training dataset sizes are often measured in total tokens processed. A larger token count typically reflects more diverse and extensive training data, which can lead to improved model performance – up to a point – before reaching diminishing returns. Source: Epoch AI (5/25) AI Model Training Dataset Size (Tokens) by Model Release Year – 6/10-5/25, per Epoch AI Training Dataset Size, Tokens CapEx Spend – Big Technology Companies = Inflected With AI’s Rise +250% / Year
