Posts by Tag

llm

Cross Entropy Loss Connection to GPT Models

  1 min read

Cross-entropy loss isn’t a heuristic, it is maximum likelihood estimation with a sign flip. It also shows how the same math powers GPT training.

Back to Top ↑

python

Banco Data Model

  1 min read

this line will show up as preview on the posts page

Back to Top ↑

machine learning

Banco Data Model

  1 min read

this line will show up as preview on the posts page

Back to Top ↑

scikit-learn

Banco Data Model

  1 min read

this line will show up as preview on the posts page

Back to Top ↑

deep-learning

Back to Top ↑

r

Back to Top ↑

superml

Back to Top ↑

text matching

Back to Top ↑

sqlite

Back to Top ↑

asyncio

Back to Top ↑

parallel

Back to Top ↑

vllm

Back to Top ↑

gpu

Back to Top ↑

pytorch

Back to Top ↑

transformers

Back to Top ↑

rope

Back to Top ↑

learning-to-rank

Back to Top ↑

lightgbm

Back to Top ↑

fine-tuning

Back to Top ↑

deeplearning

Cross Entropy Loss Connection to GPT Models

  1 min read

Cross-entropy loss isn’t a heuristic, it is maximum likelihood estimation with a sign flip. It also shows how the same math powers GPT training.

Back to Top ↑