Posts tagged with 'ai'

Notes from Duolingo Founder No Priors Interview

Duolingo's CEO shares some surprising retention tactics and insights into the future of education and AI.

strategy

2025-05-10

Style vs Substance in Chatbot Arena

How LMSYS corrects for "pretty formatting" bias when ranking LLMs.

ml

2025-03-24

Most LLMs Still Lean on AdamW

From BERT to today's open-source GPT clones, the de-coupled weight-decay trick in AdamW remains the default. (T5 is the main outlier, preferring Adafactor at pre-train time.)

ml

2025-02-24

Dead Internet Theory - Bots Run the Web

TIL about the claim that bots now outnumber humans online—and what the numbers actually say.

security

2025-02-16

Key Concepts Behind QLoRA Fine-Tuning

Quantization + low-rank adapters let you fine-tune huge LLMs on a single GPU.

ml
code

2025-02-13

Why logits.exp() Equals Counts

Understanding neural network computations as log-domain operations, making multiplicative interactions additive through logs

math
code

2025-01-20

Understanding Perplexity in Language Model Evaluation

A concise guide to perplexity metric, its calculation, and significance for LLMs.

ml

2024-08-31

KL divergence and cross-entropy loss

How cross-entropy loss is just KL divergence in disguise—and when to use each.

ml

2024-08-13

Why sharing GPU power between AI servers and personal PCs doesn't really work

A practical look at why latency, hardware variety, and security issues make distributed GPU computing impractical

devops

2024-06-10

Chain of Draft to Speed Up LLM Reasoning

Chain of Draft (CoD) prompts LLMs to use short, minimal reasoning steps, achieving near-CoT accuracy with far lower token use and latency.

2024-04-02

FAISS vs pgvector - why one's a library and the other's a database

FAISS is a rocket-fast in-memory index, pgvector is Postgres with vectors. Here's when to pick each, with code.

code

2024-01-02