Skip to content

Models

Approach

For a specific token classification dependent problem, an artificial intelligence engineer can


Selecting the best model

For this exercise, the best model was selected by comparing a testing phase metric, specifically Matthews Correlation Coefficient (MCC):

\[MCC = \frac{(tn \bullet tp) - (fn \bullet fp)}{{\Large{[}}(tp + fp)(tp + fn)(tn + fp)(tn + fn){\Large{]}}^{0.5}}\]
\[MCC \in [-1, \quad +1]\]

wherein tn, tp, fn, and fp denote true negative, true positive, false negative, and false positive, respectively.


Warning

Note, the best model of a set must undergo (a) mathematical evaluation, and (b) business/cost evaluation. The latter is critical because an acceptable mathematical metric, e.g., \(precision > 0.9\) does not necessarily lead to excellent business/cost metrics.