Представлена в редакцию
Bots Recognition in Social Networks Using the
Download 1.31 Mb. Pdf ko'rish
|
raspoznavanie-botov-v-onlaynovyh-sotsialnyh-setyah-pri-pomoschi-algoritma-sluchaynyy-les
Bots Recognition in Social Networks Using the Random Forest Algorithm M.G. Khachatrian 1, * , P.G. Klyucharev 1 1 Bauman Moscow State Technical University, Moscow, Russia Keywords: Random Forest, Twitter, cross-validation, F 1 -metric, stratification Online social networks are of essence, as a tool for communication, for millions of people in their real world. However, online social networks also serve an arena of information war. One tool for infowar is bots, which are thought of as software designed to simulate the real user’s be- haviour in online social networks. The paper objective is to develop a model for recognition of bots in online social networks. To develop this model, a machine-learning algorithm “Random Forest” was used. Since imple- mentation of machine-learning algorithms requires the maximum data amount, the Twitter online social network was used to solve the problem of bot recognition. This online social network is regularly used in many studies on the recognition of bots. For learning and testing the Random Forest algorithm, a Twitter account dataset was used, which involved above 3,000 users and over 6,000 bots. While learning and testing the Random Forest algorithm, the optimal hyper-parameters of the algorithm were determined at which the highest value of the F 1 metric was reached. As a programming language that allowed the above actions to be implemented, was chosen Python, which is frequently used in solving problems re- lated to machine learning. To compare the developed model with the other authors’ models, testing was based on the two Twitter account datasets, which involved as many as half of bots and half of real users. As a result of testing on these datasets, F 1 -metrics of 0.973 and 0.923 were obtained. The obtained F 1 - metric values are quite high as compared with the papers of other authors. As a result, in this paper a model of high accuracy rates was obtained that can recognize bots in the Twitter online social network. |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling