Comparative Analysis of Text Data Streams in Social Networks


  • Maximilian Khotilin Samara State Aerospace University, Samara, 443086, Russia
  • Aleksandr Blagov Samara State Aerospace University, Samara, 443086, Russia


Big Data, Internet, Twitter, Real-time Data, Social Networks,


This article presents a research in the area of processing and analysis of text data streams in social networks. This work was conducted by means of methods and instruments of Big Data. The article details the process of working with data from social media, which starts with data collection, followed by an analysis, which subsequent ends with conclusions and hypotheses. Based on the examples of statements of the most popular sciences among users of social network Twitter, the distributions of these references taken from all over the world were developed, investigated and compared.


A., El-Hoiydi, and J.D., Decotignie. “WiseMAC: an ultra-low power MAC protocol for the downlink of infrastructure wireless

sensor networks,” in Ninth International Symposium on Computers

and Communications, Proceedings. ISCC, 2004, vol. 1, pp. 244–

W., Tan, M.B., Blake, I., Saleh, and S., Dustdar. “Social-networksourced big data analytics”, IEEE Internet Computing.2013. No. 5. pp. 62-69.

K., Semertzidis, E., Pitoura, P., Tsaparas, “How people describe themselves on Twitter”, Proceedings of the ACM SIGMOD Workshop on Databases and Social Networks. ACM, 2013.Pp. 25-30.

A., Blagov, I., Rytcarev, K., Strelkov, M., Khotilin, “Big Data

Instruments for Social Media Analysis”, Proceedings of the 5th

International Workshop on Computer Science and Engineering,

Pp. 179-184.

H., Wang, D., Can, A., Kazemzadeh, F., Bar, and S., Narayanan,

“A system for real-time twitter sentiment analysis of 2012 US presidential election cycle”, Proceedings of the ACL System

Demonstrations. Association for Computational Linguistics, 2012,.


H., Saif, Y., He, H., Alani. “Alleviating data sparsity for twitter

sentiment analysis”. CEUR Workshop Proceedings (CEUR-WS.

org), 2012.

R., Groot. “Data mining for tweet sentiment classification”, 2012.

D., Borthakur, J., Gray, J., Sarma, K., Muthukkaruppan, N.,

Spiegelberg, H., Kuang, and R., Schimdt. “Apache Hadoop goes

realtime at Facebook”, Proceedings of the ACM SIGMOD

International Conference on Management of Data., 2011, 1071-1080.

P. T., Goetz, B., O'Neill, Storm, “blueprints: Patterns for

distributed real-time computation”, Packt Publishing Ltd, 2014.

S., Wanderman-Milne, N., Li. “Runtime Code Generation in

Cloudera Impala”, IEEE Data Eng. Bull 2014, Т. 37, No. 1. 31-37.

J., Dean, S., Ghemawat. “MapReduce: simplified data processing on large clusters” Communications of the ACM. 2008. Т. 51. No. 1. pp. 107-113.

V. V., Rykov, B. Yu Itkin, “Mathematical statistics and design of

experiment”, Moscow: Russian State University of Oil and Gas

named by I.M. Gubkin, 2008.

Kolmogorov-Smirnov test [Electronic resource], Academician.

Mathematical encyclopedia, URL: (accessed:





How to Cite

Khotilin, M., & Blagov, A. (2017). Comparative Analysis of Text Data Streams in Social Networks. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 9(1-3), 21–25. Retrieved from