YOLO and OCR-Based Automated Identity Document Verification

Authors: Mohanapriya V, Mohanapriya N, Nandhini P, Dinesh V, Dr. N. Venkatesvara Rao

DOI Link: https://doi.org/10.22214/ijraset.2025.68628

Abstract

Ensuring the authenticity of identity documents is crucial for secure digital transactions and regulatorycompliance. This paperpresentsa Comprehensive AutomatedDocumentVerificationSystemthatutilizesYOLO (YouOnlyLookOnce)for objectdetection andOCR(Optical Character Recognition) for dataextraction toverifyAadhaar cards,PANcards, and Voter IDcards. The systemautomates the verification process by detecting key document features, extractingrelevanttextualdata, andcross-verifyingitagainst predefined templates and databases. By integrating deep learning-based object detection with OCR, the proposed solution achieves high accuracy, efficiency, and scalability, reducing reliance on manual verification and minimizing fraud risks. Experimental results demonstrate the system’s robustness in detecting forged or tampered documents. This research contributes to improving digital security and streamlining identity verification in sectors such as banking, government services, and online KYC processes.

Introduction

Identity verification is essential in sectors like banking, e-governance, and finance. Traditional manual methods are slow, error-prone, and vulnerable to fraud. To address this, the study proposes an AI-based Comprehensive Automated Document Verification System (CADVS) that combines YOLO (You Only Look Once) for real-time object detection and OCR (Optical Character Recognition) for text extraction.

The system focuses on authenticating government-issued IDs (e.g., Aadhaar, PAN, Voter ID), aiming to minimize human intervention, improve accuracy, detect forged documents, and support large-scale deployment.

Key Features:

Architecture: Modular design with separate input, processing, and output modules to ensure efficiency, scalability, and security.
Technologies Used: Deep learning (YOLO), OCR, Python, OpenCV, TensorFlow/PyTorch, cloud services (AWS, GCP), and databases (MySQL, MongoDB).
Security: Includes encryption, access controls, and authentication for data privacy and integrity.
Deployment: Supports cloud, on-premise, and hybrid setups with Docker/Kubernetes for scalability and monitoring.

Related Works Review:

Existing methods face challenges such as poor dataset diversity, text segmentation, noise, and identity fraud risks.
Innovations like blockchain-steganography hybrids and deep learning-based text localization have been explored to boost ID security.
Studies highlight the importance of high-quality datasets and combining traditional and AI-based techniques for robust verification.

Implementation:

Steps: Data collection → preprocessing → model training → system integration → testing → deployment.
YOLO & OCR Modules: YOLO for detecting ID document features; OCR for extracting and validating textual content.
Performance: Processes documents in ~0.17 seconds with high accuracy.

Results & Analysis:

Tested on 1,500 ID documents:
- Aadhaar: 98.5% detection accuracy
- PAN: 96.8%
- Voter ID: 97.1%
- OCR accuracy: 96.5%
- Fraud detection: 95% success rate
Compared to manual verification (12% error rate) and rule-based OCR (6%), CADVS achieves over 10% improvement in detection accuracy.
Real-world deployment (e.g., banks) cut onboarding time by 40%.

Conclusion

This research presents a deep learning-based Automated Document Verification System that integrates YOLO for objectdetection andOCRfortextextraction.Theproposed system effectively automates document verification, reducinghuman intervention and mitigating fraudrisks.The experimentalresults confirm thatthe system achieves high accuracy in document detection, text extraction, and fraud detection, making it an efficient solution for large-scale identity verification applications. The significance of this study lies in its ability to enhance document authentication with minimal processing time, makingithighlyapplicablefor banking, e-governance,and secure identity management. The integration of deep learningmodelsensuresimproved accuracyandrobustness against document forgery, while the fraud detection mechanisms provide an additional layer of security. This research contributes to the growing field of automated identity verification, addressing the challenges of manual verificationandsecuritythreatsposedbydocumentforgery. Future enhancements to this system could include multilingualOCRsupporttoaccommodatevariousregional languages, thereby improving accessibility and usability. AI-driven anomaly detection techniques could be incorporated to enhance the fraud detection capabilities, makingthesystemmoreresilienttosophisticateddocument manipulations. Additionally, integrating blockchain technologyfor identitymanagement could further enhance the security and transparency of the verification process. Overall, the development of CADVS marks a significant advancement in automated document verification, paving thewayformoresecureandefficientidentityauthentication systems.Thefindingsofthisresearchdemonstratethatdeep learning-based verification systems are not only practical but also essential in combating identity fraud in today’s digital landscape. As technology continues to evolve, further refinements and optimizations will continue to enhance the system\'s efficiency, making it a reliable solution for various domains requiring secure and automated identity verification.

References

[1] Redmon,J.,&Farhadi,A.(2018).YOLOv3:An Incremental Improvement. arXiv preprint arXiv:1804.02767. [2] Smith,R.(2007).AnOverviewoftheTesseractOCR Engine. Proc. 9th Int. Conf. Document Analysis and Recognition (ICDAR), 629-633. [3] Patel, R., & Singh, A. (2022). PAN Card Fraud DetectionUsingDeepLearning.IEEEAccess,10, 142876-142890. [4] Breuel, T. M. (2017). High-Performance OCR Using LSTMNetworks.Proc.Int.Conf.DocumentAnalysis and Recognition (ICDAR), 127-131. [5] OpenCV.(2023).OpenSourceComputerVision Library. [6] Goodfellow,I.,Bengio,Y.,&Courville,A.(2016). Deep Learning. MIT Press. [7] Simonyan,K.,&Zisserman,A.(2015).VeryDeep Convolutional Networks for Large-Scale Image Recognition. arXiv preprint arXiv:1409.1556. [8] Kingma,D.P.,&Ba,J.(2015).Adam: AMethodfor Stochastic Optimization. Proc. 3rd Int. Conf. Learning Representations (ICLR). [9] Zhang,X.,Zou,J., He,K.,&Sun,J.(2016). AcceleratingVeryDeepConvolutionalNetworksfor Classification and Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(10), 1943-1955. [10] NIST.(2023).DocumentAuthenticationandForgery Detection: A Deep Learning Approach. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 770-778. [11] Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely Connected Convolutional Networks. Proc. IEEE Conf. Computer VisionandPatternRecognition(CVPR), 4700-4708. [12] Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., … &Houlsby, N. (2021). An Image is Worth 16x16 Words:TransformersforImageRecognitionatScale. arXiv preprint arXiv:2010.11929. [13] Jaderberg, M., Simonyan, K., Vedaldi, A., & Zisserman,A.(2016).ReadingText intheWildwith Convolutional Neural Networks. Int. J. ComputerVision,116(1),1-20.

Copyright

Copyright © 2025 Mohanapriya V, Mohanapriya N, Nandhini P, Dinesh V, Dr. N. Venkatesvara Rao. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET68628

Publish Date : 2025-04-10

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here