
Reimplemented the famous "Show, Attend and Tell" paper in PyTorch and extended it with adaptive attention (λ-tuned sentinel gating) and VQA. Analyzed the tradeoffs of soft attention mechanisms for vision-language tasks and presented poster.
Cornell CS 4782: Deep Learning · team of 3 · May 2026





