Autonomous reconstruction of strip-shredded documents via self-supervised deep learning and global optimization

Yi-Chang Wu, Pei-Shan Chiang, Yao-Cheng Liu

Abstract


Autonomous reconstruction of mechanically shredded documents is a labor-intensive challenge in forensic and archival workflows, particularly for scripts with complex structures such as Simplified Chinese. While traditional manual reassembly is tedious, existing digital tools typically rely on extensive human intervention. This paper presents an automated reassembly framework that integrates a lightweight convolutional feature extractor with global combinatorial optimization. By adapting the established SqueezeNet v1.1 backbone, we employ a task-specific self-supervised learning strategy trained on synthetically shredded samples, enabling the adapted model to capture local stroke continuity and edge-geometry cues without manual annotation. The framework infers pairwise relationships from calibrated edge-region inputs, organizing compatibility scores into an asymmetric traveling salesman problem (ATSP) formulation. The optimal fragment sequence is solved deterministically using the Concorde TSP solver, yielding a globally consistent reconstruction. Experimental results on physically shredded documents demonstrate reconstruction accuracies of 86.5% for Simplified Chinese and 94.8% for Western scripts. These results indicate that the proposed pipeline effectively generalizes from synthetic training data to real-world scenarios, providing a practical, high-throughput foundation for automated document recovery under computational constraints typical of robotic or embedded systems.

Keywords


Autonomous reconstruction; Chinese text processing; Forensic science; Fully convolutional neural networks; Global optimization; Self-supervised learning; Strip-shredded documents

Full Text:

PDF


DOI: http://doi.org/10.11591/ijra.v15i1.pp107-121

Refbacks

  • There are currently no refbacks.


Copyright (c) 2026 Yi-Chang Wu, Pei-Shan Chiang, Yao-Cheng Liu

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Robotics and Automation (IJRA)
ISSN 2089-4856, e-ISSN 2722-2586

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Web Analytics Made Easy - Statcounter IJRA Visitor Statistics