Visualizing ModernBERT Efficiency Features
Padding
Unpadding
Sequence Packing
Local Attention
Global Attention
Reset
Click a button above to see the animation and explanation.
Legend:
Token
Pad
Attending
Attended
Local Window