Recent Posts

Cross Entropy Loss Connection to GPT Models

  1 min read

Cross-entropy loss isn’t a heuristic — it is maximum likelihood estimation with a sign flip. It also shows how the same math powers GPT training.