AI Glossary | Machine Learning Terms

A large AI model trained on massive text data to understand and generate human language. Examples: GPT-4, Claude, Llama.

The neural network architecture behind modern AI. Uses "attention" to understand context in text.

The basic unit AI processes. Can be a word, part of a word, or character. ~1 token ≈ 4 characters.

Training a pre-trained model on specific data to improve it for a particular task.

The input you give to an AI model. Quality of output depends on quality of prompt.

Converting text into numbers that capture meaning. Used for search and similarity.

Controls randomness. Higher = more creative, lower = more predictable.

Maximum tokens AI can see at once. GPT-4o has 128K tokens.

AI performs a task without specific training, using general knowledge.

Giving AI a few examples in the prompt to understand the task.

Feeding relevant documents to AI to generate more accurate answers.

When AI generates confident but false information. A known limitation.

Running an AI model to generate output. Different from training.

Internal values the model learns. More parameters usually = more capable.

Reducing model size by using fewer bits. Makes it run faster on less RAM.

Instructions that set AI's persona and behavior for all interactions.