Authors: Shivam Kadam, Sohum Kulkarni, Saakshi Padamwar, Prof. Shubhangi Bhagat
Certificate: View Certificate
Machine learning based on categorical classification has integrated usages in a variety of fields like prediction, finance, supply chain management, sales and operations as well as product analytics. This study shows how Support Vector Machine Learning Model from the Supervised Learning sub-branch of Machine Learning predicts the suicidal intent of a person’s “tweet” on the social media platform ‘Twitter’. This model basically indicates whether the text tweeted or posted by a person may be suicidal or not. Regularised data set for Modeling is divided into test data set and training data set at the rate of 3:7. Machine Learning for this study needs to use Pandas, Numpy , Pillow, ScikitLearn, Textblob and Nltk frameworks. Massive amount of actual Twitter data is used as a dataset for the training and testing purpose so the model can analyse the text with maximum accuracy.
Machine Learning algorithms are used for the classification of large data into smaller chunks. We aim to order the extremity of the tweet where it is either suicidal or not. On the off chance that the tweet is sad but not suicidal then the model would identify it as not suicidal. Basically the more predominant estimation sentiment to be picked. Various machine learning algorithms can be used to extract the features from the data.
A Machine Learning model which can accurately identify the suicidal intent of a person just by taking input the ‘Tweets’ he/she made on the Twitter platform is the main objective. This application can be used by all the therapists to track the virtual behaviour of the patient and can treat him/her accordingly. There are many Sentiment Analysis or Text Analysis models readily available, but a model specially tailored to address the issue of suicidal patients is far from practically or commercially available in a ready-to-use state for the therapists/psychaitrists.
The following are the challenges of this project:
II. SYSTEM PLANNING
A. Literature Review
In the modern era where advancement has been carried out in various sectors including Industries, Technology and Hospitality but the workload towards the human brain has also been building day by day due to the massive work functioning and hectic daily routine.
The authors have taken some initial steps towards the solution of the human problems which are causing some serious damage towards their health and their balanced lifestyle, However the remedy includes not only the analysis of the thoughts but also their guidance throughout their important life phases with a constant support from external psychological community builders to make the audience feel positive during their low feelings.
Lastly the authors have gone through the internal aspects of the working model and made use of a recursive trial and error technique to make it more accurate towards the result with constant aim linked with maximum correct result with minimum time consumption.
III. FEASIBILITY STUDY
In this phase, the project's practicality is evaluated, and a business proposal is presented that outlines a broad plan for the project and provides some cost approximations. In the system analysis phase, the feasibility of the suggested system is assessed to ensure that it does not impose an excessive burden on the company. It is necessary to have a basic understanding of the primary system requirements to conduct a feasibility analysis.
Three key components of the feasibility analysis are
A. User Requirements
B. Non-functional Requirements
In Computer engineering and Requirement engineering, a non-functional requirement is a requirement that specifies criteria that can be used to judge the operation of a system, rather than specific aspects. They are into two opposite directions with functional requirements that define specific behavior or functions. Non-functional requirements add massive value to business development .It is commonly misled by a lot of people. It should be mandatory for business stakeholders, and Clients to clearly develop the requirements and their high expectations in measurable terminology. If the non-functional requirements are not scalable then they should be revised or interpreted again to gain better clarity. For example, User stories help in dissolving the gap between developers and the user stakeholders in Agile Methodology.
Prioritize the End users tweet as per the sentiment analysis terminology and then segmenting the tweets into diversified importance while simultaneously working with the dataset.
Reliability refers to the level of confidence in a system that is established over time through its use. It indicates the extent to which software can perform its intended functions without encountering errors or issues within a specific timeframe.The number of issues and bugs encountered while execution of the working model were fixed with the help of reliability behavioral testing and also through exclusive discussion panels throughout the functioning of the model. Your goal should be to create point to point dense algorithms for the machine learning model and which makes the model easy to implement and familiar to the user of the working directory.
Under what circumstances and at what specific peak times, such as during stress periods like the end of the month or during payroll disbursement, should system response times be measured from any point? Additionally, are there times when the load on the system will be abnormally high?
V. IMPLEMENTATION OF SYSTEM
A. Existing System
The increasing number suicides is not a new issue and many projects have been conducted and implemented in the past to address this issue and tried to find a practical solution which will contribute in eventually reducing the number of suicides. Most of these projects involve models which make use of Sentiment Analysis or Text Analysis to segregate the texts into negative and positive behaviours.
The major disadvantages of these models include
These models fail to find a difference between the tweets which are just sad and those which are actually suicidal and may prove the person to have a suicidal intent.
C. Proposed System
This research proposes a Machine Learning model which will identify the intent of a person who may or may not be having thoughts about committing suicide using the tweets he/she posts on Twitter. This model can be implemented practically by therapists to analyse the behavioural patterns of their patients and treat them with proper medications accordingly. A Support Vector Machine (SVM) technique is used for the classification purpose. It involves mapping the input data points into a higher dimensional feature space, where a hyperplane is then used to separate the classes or predict the target values. It aims to find the optimal hyperplane that maximizes the margin, which is the distance between the hyperplane and the closest data points from each class.
D. Advantages of Proposed System
VI. MODULE DESCRIPTION
A. Machine Learning and its Categories
Machine learning is a subfield of artificial intelligence that involves developing algorithms and statistical behavioural models that enable computer systems to inculcate from data and make decisions without being explicitly programmed. In understandable terms, machine learning involves using statistical techniques to enable Virtual Machines to improve their performance on a specific task module by learning from data.Machine learning algorithms are used in diversified applications, such as image and speech recognition, natural language processing, predictive analytics, and autonomous vehicles.To function, machine learning algorithms require three key components: data, models, and optimization. The data provides the algorithm with the information necessary to learn and make decisions. The model is the algorithm that is trained on the data, enabling the system to recognize patterns and make predictions. Optimization is the process of adjusting the model to improve its accuracy and effectiveness.The increasing amount of available data has made machine learning more critical in recent years. It has the potential to transform numerous industries, from healthcare and finance to manufacturing and transportation, by providing more accurate predictions and faster decision-making. However, it also raises significant ethical and social concerns, including privacy, bias, and job displacement, that require careful consideration and resolution.
B. Supervised Machine Learning
Supervised learning involves training a model on a labelled dataset, where the output is known for each input, while unsupervised learning involves training a model on an unlabeled dataset to discover patterns and structures in the data.Supervised learning is a commonly used machine learning algorithm that involves training a model on a labelled dataset. In supervised learning, the model is provided with input data and the corresponding correct output data, allowing it to learn the relationship between the two. Once the model is trained, it can be used to make predictions on new input data by applying the learned relationship to the new data. This makes supervised learning a useful tool for tasks such as classification and regression.In this example, the input data are the number of bedrooms, bathrooms, square footage, and location, while the output data is the actual selling price of the house. The model is trained using the labelled dataset, and once it has learned the relationship between the features and the selling price, it can be used to predict the selling price of new houses that it has not seen before.
C. Support Vector Machine (SVM)
A Support Vector Machine (SVM) is a sort of machine learning algorithm that is conventionally employed for grouping and regression analysis. It entails projecting the input data points into an augmented dimensional feature space, where a hyperplane is then utilised to segregate the categories or prognosticate the target values. The SVM aspires to seek out the most favourable hyperplane that maximises the gap, which is the interval between the hyperplane and the nearest data points from each category. The SVM is taught exploiting a set of labelled data and can subsequently be employed to foretell the labels or target values of recent, unlabeled data points. The formula has been broadly employed in numerous applications, comprising image identification, text classification, and bioinformatics. In the suggested framework, the Twitter data employed to train the SVM is amassed from bona fide sources (institutions) in which factual tweets from individuals who are grappling with anxiety or depression issues are employed, and a dichotomous target variable is attributed to each tweet, with 0 or 1 representing non-self-destructive or self-destructive, respectively.
VII. FUTURE WORK
This study has enormous possibilities in enhancing the techniques of premature recognition of an individual devising or contemplating suicide. There are numerous means to enhance the current model. Utilisation of Real-time Tweets can be performed by leveraging the Twitter Developer account to extract real-time data and scrutinise it.
One of the uses of this inquiry can culminate in producing a program for physicians/psychotherapists, wherein they can merely integrate their patients' Twitter accounts into the software to track everyday musings and trigger warnings if any tweet is vital. It can be employed for a vast-scale tracking of patients and also the precision of the model will rise as the corpus becomes more voluminous. This model can also be fused with different novel technologies like Artificial Neural Networks or Deep Learning methods.
Machine Learning algorithms like Supervised Learning approaches are highly valuable in resolving pragmatic issues. In this study, the primary aim is to establish how the suicidal intention of an individual can be detected by scrutinizing the tweet he/she disseminated through the utilization of the SVM model. The corpus of authentic tweets is utilised to achieve this objective.
 J. T. Fiquer, P. S. Boggio, and C. Gorenstein, “Talking bodies: Nonverbal behavior in the assessment of depression severity,” Journal of affective disorders, vol. 150, no. 3, pp. 1114–1119, 2013.  N. Cummins, S. Scherer, J. Krajewski, S. Schnieder, J. Epps, and T. F. Quatieri, “A review of depression and suicide risk assessment using speech analysis,” Speech Communication, vol. 71, pp. 10–49, 2015.  J. F. Cohn, T. S. Kruez, I. Matthews, Y. Yang, M. H. Nguyen, M. T. Padilla, F. Zhou, and F. De la Torre, “Detecting depression from facial actions and vocal prosody,” in Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on. IEEE, 2009, pp. 1–7.  L.-S. A. Low, N. C. Maddage, M. Lech, L. Sheeber, and N. Allen, “Influence of acoustic low-level descriptors in the detection of clinical depression in adolescents,” in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 5154–5157.  J. C. Mundt, P. J. Snyder, M. S. Cannizzaro, K. Chappie, and D. S. Geralts, “Voice acoustic measures of depression severity and treatment response collected via interactive voice response (ivr) technology,” Journal of neurolinguistics, vol. 20, no. 1, pp. 50–64, 2007.  S. Alghowinem, “From joyous to clinically depressed: Mood detection using multimodal analysis of a person’s appearance and speech,” in Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on. IEEE, 2013, pp. 648–654.  Y. Yang, C. Fairbairn, and J. F. Cohn, “Detecting depression severity from vocal prosody,” IEEE Transactions on Affective Computing, vol. 4, no. 2, pp. 142–150, 2013.  T. R. Almaev and M. F. Valstar, “Local gabor binary patterns from three orthogonal planes for automatic facial expression recognition,” in 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction. IEEE, 2013, pp. 356–361.  J. M. Girard, J. F. Cohn, M. H. Mahoor, S. M. Mavadati, Z. Hammal, and D. P. Rosenwald, “Nonverbal social withdrawal in depression: Evidence from manual and automatic analyses,” Image and vision computing, vol. 32, no. 10, pp. 641–647, 2014.
Copyright © 2023 Shivam Kadam, Sohum Kulkarni, Saakshi Padamwar, Prof. Shubhangi Bhagat. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.