Module candle_transformers::utils

source ·

Functions§

  • Repeats a key or value tensor for grouped query attention The input tensor should have a shape (batch, num_kv_heads, seq_len, head_dim),