nn#
Experimental API. See the NNX page for more details.
- Activation functions
- Attention
MultiHeadAttention
MultiHeadAttention.num_heads
MultiHeadAttention.dtype
MultiHeadAttention.param_dtype
MultiHeadAttention.qkv_features
MultiHeadAttention.out_features
MultiHeadAttention.broadcast_dropout
MultiHeadAttention.dropout_rate
MultiHeadAttention.deterministic
MultiHeadAttention.precision
MultiHeadAttention.kernel_init
MultiHeadAttention.bias_init
MultiHeadAttention.use_bias
MultiHeadAttention.attention_fn
MultiHeadAttention.decode
MultiHeadAttention.normalize_qk
MultiHeadAttention.init_cache()
combine_masks()
dot_product_attention()
make_attention_mask()
make_causal_mask()
- Initializers
- Linear
- Normalization
- Stochastic