Grouped-query attention
Home
Wiki
Redirect to:
Transformer (deep learning architecture)#Grouped-query attention