Automatic identification of small skeletal remains from Ardales Cave, Málaga, Spain using a Vision Transformer model

Authors

  • Carmina P. Baylon ⋅ PH Data Science Program, University of the Philippines Diliman
  • Chara Deanna Punzal ⋅ PH Data Science Program, University of the Philippines Diliman
  • Patricia Cabrera ⋅ PH School of Archaeology, University of the Philippines Diliman
  • Jos´e Ramos-Mu˜noz ⋅ ES University of C´adiz, C´adiz
  • Gerd-Christian Weniger ⋅ DE University of Cologne, Cologne
  • Pedro Cantalejo Duarte ⋅ ES Ardales Caves and Rinc´on de la Victoria, M´alaga
  • Juan Rofes ⋅ PH School of Archaeology, University of the Philippines Diliman and Arch´eozoologie, Arch´eobotanique Soci´et´es, Pratiques et Environnements (AASPE), CNRS/MNHN
  • Giovanni A. Tapang ⋅ PH National Institute of Physics, University of the Philippines Diliman

Abstract

A Vision Transformer (ViT) model was used to classify small zooarchaeological bone assemblages from Ardales Cave, Málaga, Spain, across three taxonomic orders (Rodentia, Lagomorpha, and others), achieving 76% validation accuracy after approximately 100 epochs. The model was fine-tuned from an ImageNet-21k-pretrained backbone on a dataset of 417 images with severe class imbalance. This work applies the ARCHAEOVISION pipeline from Philippine tropical fauna to European Paleolithic assemblages, demonstrating the generalizability of its architecture to other archaeological sites.

Published

2026-06-06

How to Cite

[1]
CP Baylon, CD Punzal, P Cabrera, J Ramos-Mu˜noz, G-C Weniger, PC Duarte, J Rofes, and GA Tapang, Automatic identification of small skeletal remains from Ardales Cave, Málaga, Spain using a Vision Transformer model, in Proceedings of the 44th Samahang Pisika ng Pilipinas Physics Conference (Philippines, 2026), SPP-2026-1A-06. URL: https://proceedings.spp-online.org/article/view/SPP-2026-1A-06