A Fully Connected Neural Net (FCNN) training model using 1,538 otoliths with 20 random full fold models was run to compare Near Infrared Scans (NIRS) to the Traditional Method of Aging (TMA). Fifteen otoliths were not used for training to look for bias. The total number of oties predicted was 1,553. (See the notes below for other acronym definitions.)
Metadata was also added to the scans for a better fit.
The best prediction, with an impressive R-squared of 0.9669 (APE = 6.257), was found using NIRS scans with the metadata of otolith weight, fish length, fish weight, depth, and latitude. The stats are:
Correlation R_squared RMSE MAE SAD APE N
0.983321 0.966921 1.9744 1.01095 1570 6.25705 1553
FSA (Simple Fisheries Stock Assessment Methods) package's agePrecision() stats:
n validn R PercAgree ASD ACV AAD APE
1553 1553 2 54.09 0.7148 8.849 0.5055 6.257
Other metadata looked at, but not included due to poor performance for this particular FCNN model, were sex, month, and days-into-the-year.
The same model, except without latitude, has an R-squared of 0.9526 (APE = 6.636). Putting back latitude and only removing fish length gives an R-squared of 0.9652 (APE = 7.718). Adding back fish length and removing fish weight results in a model with a lower R-squared of 0.9268, but a reduction in the APE value to 7.127. The spectra only model (no metadata) has an R squared of 0.9378 (APE = 8.788), with a predicted N of 1,556 since there is no missing metadata.
Weight vs length from the 2017, 2018, 2019, 2021, and 2022 Combo Surveys were plotted to understand why including both fish length and fish weight improves the Sablefish NN model. The figure shows that the large females (dark pink circles) above a weight of 5.15 kg no longer fit the standard allometric weight-length relationship (W = a * L^b; gold line), nor does a lowess fit with a reasonable degree of smoothness (green line). Polynomial models with degrees 3, 4, and 5 are very similar to the lowess line fit and are not shown. A model with only female Sablefish shows an almost identical result. A hockey stick type model might be one classic approach that would acheive more balanced residuals on the top end.
Interestingly, a plot of weight vs TMA by length category reveals that the oldest Sablefish are not the heaviest nor the longest. Splitting the females and males into similar separate figures shows the same pattern with, of course, the males being overall smaller than the females.
A FCNN training model using 750 randomly selected otoliths with 20 random full fold models was conducted. The total number of oties predicted was 1,553.
A model, with an R Squared of 0.9476, was found using the NIRS scans along with the metadata of otolith weight, fish length, fish weight, and depth. The same model including latitude was found to fit slightly less well (R squared = 0.9418). This figure highlights in red those otoliths that were not part of the training model but were only predicted. Note that there is no bias. Models with 500 and 250 otoliths in the training model did not perform as well, even when stratified random sampling of otoliths was tried.
A FCNN training model using 1,513 otoliths with 20 random full fold models was executed. Forty otoliths were not used for training to look for bias. The total number of oties predicted was 1,553.
The best model, only using metadata (no scans), had an R squared of 0.8770 using otolith weight, fish length, fish weight, and depth. Adding latitude to the metadata only model did not work well. The stats for the metadata only model are:
Correlation R_squared RMSE MAE SAD APE N
0.936458 0.876953 3.79319 1.93883 3011 11.1122 1553
FSA (Simple Fisheries Stock Assessment Methods) package's agePrecision() stats:
n validn R PercAgree ASD ACV AAD APE
1553 1553 2 42.82 1.371 15.71 0.9694 11.11
For 2022 Combo Sablefish there were 396 TMA double reads for which ager 'NWFSC_1' was the original ager on exactly half of the reads, and the original ager on the other half was 'NWFSC_2'. Of those double reads, 369 had NN predicted ages. NWFSC_1 was the original ager for ~52% and NWFSC_2 was the original ager for ~48% of those 369. This figure, with a APE score of 3.704% (R2 = 0.9707), plots the double reads of NWFSC_2 vs NWFSC_1. This figure, with a APE score of 6.378% (R2 = 0.9573), plots the NN predicted age vs NWFSC_1. TMA age readers strive to achieve an APE score of less than 5% for their double reads of rockfish. With sablefish it really varies and can be higher than that. (Patrick McDonald, personal communication).
The final predicted data has the NN predicted median, the 0.025 lower quantile, and the 0.975 upper quantile over the given number of full 10 fold models (20 in this case). The TMA is also given, if available. The rounded age is the NN predicted median with the Delta added and then rounded (see above).
filenames NN_Pred_Median Lower_Quantile_0.025 Upper_Quantile_0.975 Num_of_Full_10_Fold_Models TMA Age_Rounded
1 SABL_COMBO2022_NIR0022A_PRD_1_102157421_O1 12.7389 7.6131 15.3572 20 14 13
2 SABL_COMBO2022_NIR0022A_PRD_10_102157430_O1 5.5777 4.4615 6.7766 20 6 6
3 SABL_COMBO2022_NIR0022A_PRD_100_102157520_O1 6.8165 5.4652 8.2063 20 6 7
4 SABL_COMBO2022_NIR0022A_PRD_11_102157431_O1 12.2754 9.6108 15.3800 20 16 12
5 SABL_COMBO2022_NIR0022A_PRD_12_102157432_O1 4.9055 3.5046 6.0811 20 6 5
...
The quantiles are a reflection of the NN models precision based on the 20 full 10-fold randomized models, not the accuracy to the TMA Age. This figure, based on the best prediction model above, shows the quantile range on a subset (for clarity) of the predicted data where TMA ages are sorted and this figure shows the same except the predicted ages are sorted. Note that the median, as is well known, is a robust measure of central tendency.
Using 40 full 10-fold randomized models is often a small improvement over 20 full 10-fold models, but 60 full 10-fold models has been seen, in limited testing, to not work as well.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.