The project aims to automate invoice processing and expense tracking by integrating Optical Character Recognition (OCR), Natural Language Processing (NLP), and Machine Learning (ML). It extracts invoice data (e.g., date, items, cost) using Tesseract OCR, structures it into formats like JSON/CSV, and processes it using OpenAI\'s GPT models and XGBoost for categorization and future expense predictions. The application is built using the Django framework, styled with Tailwind CSS, and deployed via cloud platforms using Docker, AWS, and Apache Kafka for real-time data flow.
Key Components:
1. Data Acquisition & Preprocessing:
Invoices are uploaded and stored securely.
Preprocessing includes grayscale conversion, resizing, noise removal, and skew correction to enhance OCR accuracy.
2. OCR & Text Extraction:
Text is extracted using PyTesseract and processed with OpenCV.
Output is cleaned and structured for easier data extraction.
3. Data Extraction & Structuring:
Regular expressions and NLP extract entities like vendor name, date, total, and items.
Structured data is stored in relational databases or JSON format for analytics and retrieval.
4. NLP & Categorization:
GPT-3.5-turbo-instruct analyzes text and classifies expenses into predefined categories using dynamic prompts.
This reduces manual classification effort and improves accuracy.
5. Expense Prediction using Machine Learning:
Uses XGBoost for forecasting monthly expenses based on features like historical spending, day of the week, and vendor behavior.
Handles outliers well and offers insight into which features most affect predictions.
6. Data Visualization:
Expense trends are visualized using Matplotlib, Seaborn, and Plotly.
Forecasts are displayed alongside historical data on a Django-based dashboard, enhancing user insights and planning.
7. Django Framework & Admin Panel:
Follows the Model-View-Template (MVT) pattern.
Django ORM handles data modeling and retrieval.
Admin interface provides tools for invoice review, category management, and system monitoring.
8. System Deployment:
Uses Docker, AWS Elastic Beanstalk, Heroku, and AWS Lambda for scalable, serverless deployment.
Apache Kafka enables asynchronous, real-time data processing between services.
Role-based access, encryption, and modular microservices ensure high security and responsiveness.
Evaluation & Results:
Prediction Metrics:
MAE, RMSE, and R² used to evaluate accuracy.
Good prediction accuracy in categories like Transportation (MAE: 23.91) and Entertainment (MAE: 34.52).
Lower performance in Utilities and Clothing indicates room for improvement with enhanced features or models.
Invoice Accuracy:
OCR and GPT successfully extracted and matched invoice fields (date, item, amount, category) with high precision.
The system demonstrated effective, real-time processing for diverse invoice formats.
Literature Review:
References focus on Tesseract OCR, image preprocessing (binarization, resizing, skew correction), and early systems that used OCR for invoice extraction.
Prior systems lacked ML-based forecasting or NLP categorization, which this project integrates for improved functionality and scalability.
Introduction
1. Background and Need
The rise of online education has increased the need for secure and fair remote assessment solutions. Traditional exam supervision methods struggle to maintain integrity in virtual environments. To address this, AI and camera vision technologies are being applied to create smarter, more effective proctoring systems.
2. Proposed System – CamProctor
CamProctor is an AI-driven remote proctoring platform that uses Temporal Convolutional Networks (TCNs) and facial recognition for intelligent, real-time surveillance during online exams.
Key Features:
Facial Recognition: Confirms student identity before and during the exam.
TCNs: Analyze time-series video data to detect suspicious or dishonest behavior.
Automated Alerts: Notify proctors immediately when abnormal activity is detected.
3. Core Components & Methods
A. Exam Hall Surveillance
Real-time video monitoring using camera systems
Automated detection of rule violations and misconduct
B. Control Panels
Admin Panel: Manage users, subjects, and store facial data
Faculty Panel: Monitor students, receive alerts, supervise live exams
C. Face Recognition Workflow
Extract facial features using CNN
Train and deploy a facial identification model
Verify students during exams with live face matching
D. Student Verification
Capture live facial data during login
Match with stored data to confirm identity
Display seating info and exam room automatically
E. Malpractice Detection
Monitor behavioral changes over time (e.g., eye movement, head turning)
Identify and flag inconsistent or suspicious actions
F. Alert & Notification System
Audio alerts for in-room awareness
Notifications sent to relevant staff (admin, invigilators) for timely action
4. Results and Impact
Accurate Behavior Detection: System identified suspicious behaviors like head-turning and multiple people in the frame.
Reliable Monitoring: AI-driven approach proved more consistent and impartial than human invigilators.
Reduced False Positives: Behavioral models helped distinguish between normal and suspicious actions.
Discouraged Cheating: Continuous AI monitoring led to a noticeable drop in malpractice.
Detailed Reporting: All flagged incidents were logged and summarized for review.
Conclusion
The execution of this project has delivered notable outcomes, affirming its effectiveness in the domain of remote evaluation within academic environments. The system presents a robust and efficient solution for conducting online examinations, incorporating features such as real-time invigilation and intuitive administrative dashboards.
With smart resource utilization, the platform delivers consistent performance—even during periods of high user traffic—ensuring swift response for essential operations. Additionally, the CamProctor video surveillance mechanism enables the immediate identification of questionable conduct, thereby reinforcing the integrity of the examination process.
The system also produces comprehensive analytical data and exam reports, equipping academic staff with crucial insights regarding student behavior, participation, and overall performance. These insights support data-driven decision-making for academic planning and evaluation. Despite certain hurdles—such as ensuring equal access to digital infrastructure and managing privacy-related issues—the influence of the system on virtual learning environments is significant. By delivering a smooth, secure testing process and safeguarding assessment authenticity, this initiative stands out as a transformative solution in the landscape of digital education.
To summarize, the project has successfully created and deployed a reliable Online Examination Platform powered by CamProctor, offering scalability, security, and ease of use. Its solid performance and contribution to online learning underscore its relevance in today’s educational practices. Continued refinement and updates will further elevate its capacity to meet the shifting demands of digital assessments.
References
[1] X. Yang, D. Wu, X. Yi, J. H. M. Lee, and Tan Lee (2022) introduced iExam, an innovative system designed for online test supervision utilizing facial detection and identification technologies, published in Electrical Engineering and Systems Science.
[2] Jiyou Jia and Yunfan He (2021) presented the architecture, deployment, and initial testing of a smart invigilation platform for online exams in the journal Interactive Technology and Smart Education.
[3] A. Nigam, R. Pasricha, T. Singh, and P. Churi (2021) conducted a comprehensive review of artificial intelligence-supported remote proctoring systems, discussing their evolution and future directions in Education and Information Technologies, Vol. 26, No. 5.
[4] M. Labayen et al. (2021) proposed a student authentication and monitoring solution using multiple biometric methods, detailed in IEEE Access, Vol. 9, Pages 72398–72411.
[5] K. Garg, K. Verma, K. Patidar, and N. Tejra (2020) developed a virtual exam supervision solution leveraging convolutional neural networks, presented at the 4th International Conference on Intelligent Computing and Control Systems (ICICCS), IEEE.
[6] Proctoredu (2020) offers a platform for conducting reliable online assessments globally, accessible at: https://www.proctoredu.com (accessed October 4, 2020).
[7] Comprobo (2020) provides services for automated online exam invigilation, with more information available at: https://comprobo.co.uk (accessed October 4, 2020).
[8] M. Ghizlane, B. Hicham, and F. H. Reda (2019) proposed a continuous automated monitoring model for online examinations, discussed at the International Conference on Systems, Collaboration, Big Data, Internet of Things & Security.
[9] H. S. G. Asep and Y. Bandung (2019) designed a user verification mechanism for online proctoring in mobile learning environments, presented in the International Conference on Electrical Engineering and Informatics.