Convolutional Autoencoder for Reconstruction of Historical Document Images: Ancient Manuscript Babad Lombok

Fahmi Syuhada, Asno Azzawagama Firdaus, Ana Tsalitsatun Ni'mah, Yuan Sa’adatai, Muhammad Tajuddin

Abstract

The Babad Lombok is an ancient literary or manuscripts document that generally contains stories about the origins of the people of Lombok. This document is written on a lontar leaf, which in the past was used to write manuscripts, letters, and documents. At present, the Babad Lombok document can be seen in the form of photos or scans, so it can be viewed without having to go to a museum or cultural heritage site where the document is usually exhibited. However, because this document is an ancient artifact that has been around for hundreds of years, it has naturally experienced fading in the original document or its scanned versions. This makes the text inside less clear. This paper proposes to automatically reconstruct/repair the Babad Lombok document using a neural network. The type of neural network used is an Autoencoder or Convolutional Autoencoder (CAE). The CAE model is built sequentially and trained using original images of Babad Lombok as its training data and manually corrected images of Babad Lombok as the target or ground truth data. In the process, the two types of data are iteratively cropped to a size of 64x64 along the original size of the Babad Lombok image. This process results in input and target data for the CAE training process in this research, each consisting of 48,288 images. Testing the trained autoencoder model shows that the Babad images have been successfully repaired, making the text quality clearer before reconstruction. Ultimately, the proposed CAE has achieved training and validation accuracies of 89.09% and 94.57%, with corresponding loss values of 0.0418 and 0.0226.

Keywords

Ancient;Manuscripts;Lombok;Reconstruction;Convolutional;Autoencoder

Full Text:

PDF

References

An, Jinwon, and Sungzoon Cho. 2015. “Variational Autoencoder Based Anomaly Detection Using Reconstruction Probability.” Special lecture on IE 2(1): 1–18.

Dixit, Umesh D, and M S Shirdhonkar. 2015. “A Survey on Document Image Analysis and Retrieval System.” International Journal on Cybernetics & Informatics (IJCI) 4(2): 259–70.

Freeman, William T, Egon C Pasztor, and Owen T Carmichael. 2000. “Learning Low-Level Vision.” International journal of computer vision 40: 25–47.

Hammouche, Rabah, Abdelouahab Attia, Samir Akhrouf, and Zahid Akhtar. 2022. “Gabor Filter Bank with Deep Autoencoder Based Face Recognition System.” Expert Systems with Applications 197: 116743.

Hinton, Geoffrey E, Alex Krizhevsky, and Sida D Wang. 2011. “Transforming Auto-Encoders.” In Artificial Neural Networks and Machine Learning--ICANN 2011: 21st International Conference on Artificial Neural Networks, Espoo, Finland, June 14-17, 2011, Proceedings, Part I 21, , 44–51.

Kaur, Roopdeep, Gour Karmakar, and Muhammad Imran. 2023. “Impact of Traditional and Embedded Image Denoising on CNN-Based Deep Learning.” Applied Sciences. doi:10.3390/app132011560.

LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. 2015. “Deep Learning.” nature 521(7553): 436–44.

Liu, Tong, Jinzhen Wang, Qing Liu, Shakeel Alibhai, Tao Lu, and Xubin He. 2021. “High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data.” IEEE Transactions on Big Data 9(1): 22–36.

Liu, Weibo, Zidong Wang, Xiaohui Liu, Nianyin Zeng, Yurong Liu, and Fuad E Alsaadi. 2017. “A Survey of Deep Neural Network Architectures and Their Applications.” Neurocomputing 234: 11–26.

Mallik, Anupama, Santanu Chaudhury, Vijay Chandru, and Sharada Srinivasan. 2017. Digital Hampi: Preserving Indian Cultural Heritage. Springer.

Mao, Xiao-Jiao, Chunhua Shen, and Yu-Bin Yang. 2016. “Image Restoration Using Convolutional Auto-Encoders with Symmetric Skip Connections.” arXiv preprint arXiv:1606.08921.

Mintosih, Sri, Amurwani Dwi Lestarining, and others. 1999. Pengkajian Nilai Budaya Naskah Babad Lombok Jilid 1. Direktorat Jenderal Kebudayaan.

Muhammad Tajuddin Anwar, Syahroni Hidayat, Ahmat Adil. 2021. “Transformasi Lontar Babad Lombok Menuju Digitalisasi Berbasis Natural Gradient Flexible (NGF).” Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK) 8(2).

Otsu, Nobuyuki, and others. 1975. “A Threshold Selection Method from Gray-Level Histograms.” Automatica 11(285–296): 23–27.

Parida, Priyadarsan, and Nilamani Bhoi. 2018. “Feature Based Transition Region Extraction for Image Segmentation: Application to Worm Separation from Leaves.” Future Computing and Informatics Journal 3(2): 262–74. doi:10.1016/J.FCIJ.2018.08.001.

Raha, Poulami, and Bhabatosh Chanda. “Restoration of Historical Document Images Using Convolutional Neural Networks.” 7: 3–8.

Rama Lakshmi, Gali, G Divya, D Bhavya, Ch Sai Jahnavi, and B Akila. 2023. “A Review on Image Denoising Algorithms for Various Applications.” In Proceedings of Fourth International Conference on Communication, Computing and Electronics Systems: ICCCES 2022, , 839–47.

Sehgal, Rashmita, and Vandana Dixit Kaushik. 2022. “CT Image Denoising Using Bilateral Filter and Method Noise Thresholding in Shearlet Domain.” In Emerging Technologies in Data Mining and Information Security: Proceedings of IEMIS 2022, Volume 1, Springer, 99–106.

Sreedevi, Indu, Rishi Pandey, N Jayanthi, Geetanjali Bhola, and Santanu Chaudhury. 2013. “Ngfica Based Digitization of Historic Inscription Images.” International Scholarly Research Notices 2013.

Tajuddin, Muhammad. “Penelitian Naskah Kuno Babad Lombok.” https://tajuddin.web.id/detail-naskah/9.

Tajuddin, Muhammad, Ahmat Adil, Syahroni Hidayat, Zaenal Abidin, and R Fanny Priniti. 2019. “Naskah Lontar Sasak Di Era Industri 4.0 Berbasis Cots Method.” In Prosiding Seminar Sains Nasional Dan Teknologi,.

Tensmeyer, Chris, and Tony Martinez. 2020. “Historical Document Image Binarization: A Review.” SN Computer Science 1(3): 173.

Wang, Mei, and Weihong Deng. 2021. “Deep Face Recognition: A Survey.” Neurocomputing 429: 215–44. doi:10.1016/j.neucom.2020.10.081.

Yasenko, Lev, Yaroslav Klyatchenko, and Oksana Tarasenko-Klyatchenko. 2020. “Image Noise Reduction by Denoising Autoencoder.” In 2020 IEEE 11th International Conference on Dependable Systems, Services and Technologies (DESSERT), , 351–55.

You, Ning, Libo Han, Daming Zhu, and Weiwei Song. 2023. “Research on Image Denoising in Edge Detection Based on Wavelet Transform.” Applied Sciences 13(3): 1837.

DOI

https://doi.org/10.21107/rekayasa.v17i1.26101

Metrics

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 Fahmi Syuhada, Asno Azzawagama Firdaus, Ana Tsalitsatun Ni'mah, Yuan Sa’adatai, Muhammad Tajuddin

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.