Combining global and local vision foundation models for explainable tattoo matching
conference paper
Tattoo matching is important for criminal investigations. Recently, vision foundation models have shown increased performance for tasks like image classification and image retrieval. Global vision foundation models (e.g., CLIP) or local approaches (e.g., OmniGlue) show increased performance for image retrieval. In this paper, we show the added value of combining global and local approaches for explainable tattoo matching. We also investigate the use of a sketchification approach, facilitating the matching process for more abstract tattoos. We finally highlight the potential of local OmniGlue as a more explainable alternative to global image-based matching methods like CLIP.
TNO Identifier
1019350
Source title
Artificial Intelligence for Security and Defence Applications III, september 2025 Madrid Spain Proc. SPIE, vol. 13679