Essays · Notes · Technical Deep Dives

WRITINGS

Articles

Updated

June 24, 2026

01
Understanding the Transformer Architecture
June 24, 2026 · 59 min read · Deep Learning
A ground-up walkthrough of the original encoder–decoder Transformer—from masked self-attention and cross-attention through the full encoder and decoder stacks, training-time parallelization, and autoregressive inference.
02
Positional Encoding: Explained from First Principles
June 20, 2026 · 21 min read · Deep Learning
A first-principles explanation of positional encoding in Transformers—from why self-attention needs order information, through sinusoidal encodings at multiple frequencies, to the full PE formula and a worked example.
03
Understanding Self-Attention from First Principles
June 18, 2026 · 58 min read · Deep Learning
A first-principles walkthrough of self-attention—from static embeddings and manual context weighting to Query, Key, and Value projections, scaled dot-product attention, and the full matrix formulation behind modern Transformers.
04
HOW ATTENTION LEARNS WHERE TO LOOK
June 17, 2026 · 28 min read · Deep Learning
A detailed exploration of Bahdanau and Luong Attention, including alignment scores, additive vs multiplicative attention, context vectors, attentional hidden states, and the mathematical foundations behind sequence-to-sequence attention mechanisms.
05
Breaking the Seq2Seq Bottleneck
June 16, 2026 · 12 min read · Deep Learning
An intuition-first explanation of the attention mechanism in sequence-to-sequence models, covering context vectors, alignment scores, softmax, and dynamic attention in neural machine translation.
06
The Encoder-Decoder Architecture
June 15, 2026 · 16 min read · Deep Learning
A walkthrough of the sequence-to-sequence encoder-decoder model, from context vectors and teacher forcing to the improvements in the original paper.
07
The Art of Transfer Learning
June 4, 2026 · 10 min read · Deep Learning
How pre-trained CNNs, feature extraction, and fine-tuning let you adapt deep models without training from scratch.
08
THE ANATOMY OF A RAG SYSTEM
June 3, 2026 · 12 min read · GenAI
Exploring how retrieval, embeddings, and generation work together to ground LLM responses.
09
The Machinery Behind Bitcoin
August 15, 2025 · 6 min read · Blockchain
An exploration of the processes and machinery that power Bitcoin mining.