View source: R/attention-utils.R
Split channels (dimension 2) into multiple heads (becomes dimension 1). x Tensor shape: [batch, length, channels] num_heads integer
1 | split_heads(x, num_heads)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.