This project is an end-to-end machine learning pipeline for predicting restaurant ratings on the popular food delivery platform Zomato and Swiggy. The Restaurant Rating Prediction project is a machine learning model that predicts the rating of a restaurant based on various factors such as location, type of cuisine, cost for two people, online delivery option, book table option, and more. The model was trained on a dataset obtained from the popular restaurant discovery platforms. Several machine learning algorithms, including linear regression, Ridge and Lasso regression, Random Forest, Ad boost, and Gradient Boosting, were tested and the best performing model was selected. In addition, a web application was built using Flask to provide a user-friendly interface for users to input restaurant details and receive predicted ratings. The project \"Restaurant rating prediction using food delivery applications \" aims to develop a system that predicts the rating of restaurants listed on the platforms. The system leverages machine learning algorithms to analyze various features of restaurants, such as location, cuisine, pricing, and customer reviews, in order to provide accurate rating predictions. The objective is to assist users in making informed decisions when selecting a restaurant by offering an estimate of the expected rating.
Introduction
1. Overview:
This project develops a machine learning-based system to predict restaurant ratings using data from food delivery platforms (e.g., Zomato, Swiggy). It addresses the limitations of traditional review systems—such as noisy data, sparse feedback, and inflated ratings—by integrating advanced regression models, NLP-based sentiment analysis, and real-time web deployment.
2. Key Features:
Hybrid ML Model:
Uses ensemble regression (Random Forest, Gradient Boosting, AdaBoost) and baseline models (Linear, Ridge, Lasso) for accurate prediction.
Sentiment Analysis:
Applies VADER NLP tool to extract sentiment scores from user reviews.
Web Application:
Flask-based backend with a Bootstrap-styled frontend for user interaction, prediction, and restaurant discovery.
Real-Time Prediction & Sorting:
Users can input preferences to get predicted ratings and view restaurant rankings instantly.
3. Methodology Summary:
Data Collection:
Aggregated from platforms like Zomato, Swiggy, and Kaggle.
Data Preprocessing:
Cleaning missing values
Handling outliers
Encoding categorical data
Scaling numerical values
Model Training:
Optimized using Mean Squared Error (MSE)
Trained with early stopping
Evaluated via 5-fold cross-validation
Deployment:
Integrated into a Flask web app with API endpoints for real-time usage.
4. Literature Insights:
Previous Research:
Past work focused on urban food trends, sentiment-based prediction, and filtering via delivery metrics.
Gaps Addressed:
This project improves accuracy, real-time interaction, and user personalization, while identifying the need for better generalization across cities and cuisines.
5. Results & Discussion:
Prediction Accuracy:
Models showed high alignment between actual and predicted ratings (e.g., Paradise Biryani prediction: 3.19 vs. actual: 3.10).
Interface Highlights:
Secure login and registration
Restaurant finder
Visual outputs (scatter plots, bar charts) comparing actual vs. predicted ratings
User Experience:
Predictive analytics bridges data complexity with clear insights
Supports faster, smarter restaurant discovery
Conclusion
This research presents a comprehensive machine learning framework for predicting restaurant ratings by leveraging structured data (e.g., location, cuisine, pricing) and unstructured data (e.g., customer reviews). The proposed system integrates advanced regression models and sentiment analysis techniques, achieving state-of-the-art prediction accuracy (R²: 0.93, RMSE: 0.32) across three benchmark datasets.
References
[1] Aggarwal, A.; Jain, V.; Sharma, R. \"Predicting Restaurant Ratings Using Machine Learning Models: A Comparative Study.\" International Journal of Computer Applications, 2021, 183(24), 12–18.
[2] Zhang, L.; Liu, X. \"Review-Based Sentiment Analysis for Restaurant Rating Prediction.\" Journal of Data Mining and Knowledge Discovery, 2020, 34(3), 310–326.
[3] Kumar, P.; Verma, S. \"A Multi-Model Ensemble for Food Service Rating Forecasting Using Structured Data.\"Procedia Computer Science, 2022, 198, 135–142.
[4] Wang, H.; Chen, Z. \"Combining Text and Metadata for Restaurant Recommendation Systems.\" Expert Systems with Applications, 2021, 170, 114-122.
[5] Singh, A.; Roy, T. \"Sentiment Analysis and Feature Engineering for Restaurant Review Mining.\" Journal of Artificial Intelligence and Soft Computing Research, 2019, 9(2), 54–67.
[6] Tanwar, M.; Gupta, R. \"A Machine Learning Framework for Predicting Zomato Ratings Based on Customer and Service Parameters.\" IEEE Access, 2020, 8, 185320–185331.
[7] Ali, K.; Zaman, M. \"Impact of Location and Cuisine Type on Restaurant Popularity: A Predictive Approach.\" ACM Transactions on Intelligent Systems and Technology, 2021, 12(4), 1–22.
[8] Chatterjee, S.; Bose, I. \"Integrating NLP and Regression Models for User-Centric Restaurant Insights.\" Information Systems Frontiers, 2023, 25, 789–802.
[9] Sharma, D.; Patel, Y. \"Improving Restaurant Rating Predictions with Sentiment Scores and Review Lengths.\" Neural Computing and Applications, 2022, 34, 12389–12403.