TY - JOUR
T1 - A Comparison of Machine Learning Techniques for Taxonomic Classification of Teeth from the Family Bovidae
AU - Matthews, Gregory J
AU - Brophy, Juliet K
AU - Luetkemeier, Maxwell P
AU - Gu, Hongie
AU - Thiruvathukal, George K.
N1 - G. J. Matthews, J.K. Brophy, M. P. Luetkemeier, H. Gua, and G. K. Thiruvathukal, A comparison of machine learning techniques for taxonomic classification of teeth from the Family Bovidae, Journal of Applied Statistics (2018), https://doi.org/10.1080/02664763.2018.1441381
PY - 2018/3/5
Y1 - 2018/3/5
N2 - This study explores the performance of machine learning algorithms on the classification of fossil teeth in the Family Bovidae. Isolated bovid teeth are typically the most common fossils found in southern Africa and they often constitute the basis for paleoenvironmental reconstructions. Taxonomic identification of fossil bovid teeth, however, is often imprecise and subjective. Using modern teeth with known taxons, machine learning algorithms can be trained to classify fossils. Previous work by Brophy et al. [ Quantitative morphological analysis of bovid teeth and implications for paleoenvironmental reconstruction of plovers lake, Gauteng Province, South Africa , J. Archaeol. Sci. 41 (2014), pp. 376–388] uses elliptical Fourier analysis of the form (size and shape) of the outline of the occlusal surface of each tooth as features in a linear discriminant analysis (LDA) framework. This manuscript expands on that previous work by exploring how different machine learning approaches classify the teeth and testing which technique is best for classification. In addition to LDA, four other machine learning techniques were considered (neural networks, nuclear penalized multinomial regression,random forests, and support vector machines) with support vector machines and random forests performing the best in terms of log loss and classification rate.
AB - This study explores the performance of machine learning algorithms on the classification of fossil teeth in the Family Bovidae. Isolated bovid teeth are typically the most common fossils found in southern Africa and they often constitute the basis for paleoenvironmental reconstructions. Taxonomic identification of fossil bovid teeth, however, is often imprecise and subjective. Using modern teeth with known taxons, machine learning algorithms can be trained to classify fossils. Previous work by Brophy et al. [ Quantitative morphological analysis of bovid teeth and implications for paleoenvironmental reconstruction of plovers lake, Gauteng Province, South Africa , J. Archaeol. Sci. 41 (2014), pp. 376–388] uses elliptical Fourier analysis of the form (size and shape) of the outline of the occlusal surface of each tooth as features in a linear discriminant analysis (LDA) framework. This manuscript expands on that previous work by exploring how different machine learning approaches classify the teeth and testing which technique is best for classification. In addition to LDA, four other machine learning techniques were considered (neural networks, nuclear penalized multinomial regression,random forests, and support vector machines) with support vector machines and random forests performing the best in terms of log loss and classification rate.
KW - Classification
KW - machine learning
KW - anthropology
UR - https://ecommons.luc.edu/math_facpubs/28
UR - http://export.arxiv.org/pdf/1802.05778
U2 - 10.1080/02664763.2018.1441381
DO - 10.1080/02664763.2018.1441381
M3 - Article
SN - 0266-4763
VL - 45
JO - Journal of Applied Statistics
JF - Journal of Applied Statistics
IS - 12
ER -