COMP10002 Foundations of Algorithms

Concepts

The core ideas you need for the assignment, plus optional demos, tooling, and background if you want extra help.

Needed for assignment

Attention

A mechanism that lets one position decide how much other positions should influence it.

One-Hot Encoding

How a single active slot can represent a category or label.

Q, K, V Projections

How one vector is transformed into query, key, and value versions used by attention.

Dot Products and Scores

A way to measure how strongly two vectors line up, often used to build attention scores.

Softmax and Stable Softmax

A function that turns raw scores into non-negative weights that add up to 1.

Causal and Padding Masks

Rules that prevent certain positions from being used, such as future tokens or padding.

KV Cache

A store of previously computed keys and values so generation can reuse past work.

Practical resources

Annotated Scaffold

A guided reading of the provided `a1.c`, with direct links from the main spec and tooltip-style explanations for the names, arrays, and helper functions you keep seeing.

Sample input file

A small concrete example of the assignment input format, showing what a real input file looks like and how to read each line.

Checking and debugging your C code

How to compile with stronger warnings, run useful checking tools, and catch bugs in your own `a1.c` before submission.

Strings in C

The small amount of C string handling you may need for Stage 1: storing tokens, comparing them, sorting them, and copying them safely.

qsort and function pointers in C

A practical guide to using `qsort(...)` for Stage 1 and understanding the comparison function it calls.

Toy demo

A concrete numeric walkthrough that connects the assignment data back to token positions and attention weights.

Concepts

Needed for assignment

Attention

One-Hot Encoding

Q, K, V Projections

Dot Products and Scores

Softmax and Stable Softmax

Causal and Padding Masks

KV Cache

Practical resources

Annotated Scaffold

Sample input file

Checking and debugging your C code

Strings in C

qsort and function pointers in C

Toy demo

Optional concepts and background

Tokens and Embeddings

Reading Transformer Paper Notation

Transformer explainer

Reading LLM architecture diagrams

Transformer history

Parameters and model scale

How this assignment was built