nn_ft_cls | R Documentation |
Concatenates a CLS token to the input as the last feature.
The input shape is expected to be (batch, n_features, d_token)
and the output shape is
(batch, n_features + 1, d_token)
.
This is used in the FT-Transformer.
nn_ft_cls(d_token, initialization)
d_token |
( |
initialization |
( |
Devlin, Jacob, Chang, Ming-Wei, Lee, Kenton, Toutanova, Kristina (2018). “Bert: Pre-training of deep bidirectional transformers for language understanding.” arXiv preprint arXiv:1810.04805.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.