Feature Space
3
classes
1
feature(s)
Softmax
Weights
Weights
(Scroll down to see how)
(The weight vector values are in the top right corner)
But we can scale the weight vector for Class b
(But scaling the weight for Class d this way makes it argmaxable again)
See Demeter et al (2020) and Cover (1967)
For instance, this region ranks d > b > c > a
We want points for which the probability of Class d is greater than that of a, b and c
In our paper, we search for unargmaxable classes in 150 Language Models and Machine Translation Models.
We find that unargmaxable classes are infrequent and unlikely to impact model quality.
Phenomena may vary across models, our code is available here if you want to test your own.