Sentiment Analysis and Opinion Mining within Social Networks using Konstanz Information Miner
Keywords:
Konstanz Information Miner (KNIME), Opinion Mining, Sentiment Analysis,Abstract
Evaluations, opinions, and sentiments have become very obvious due to rapid emerging interest in ecommerce which is also a significant source of expression of opinions and analysis of sentiment. In this study, a general introduction on sentiment analysis, steps of sentiment analysis, sentiments analysis applications, sentiment analysis research challenges, techniques used for sentiment analysis, etc., were discussed in detail. With these details given, it is hoped that researchers will engage in opinion mining and sentiment analysis research to attain more successes correlated to these issues. The research is based on data input from web services and social networks, including an application that performs such actions. The main aspects of this study are to statistically test and evaluate the major social network websites: In this case Twitter, because it is has rich data source and easy within social networks tools. In this study, firstly a good understanding of sentiment analysis and opinion mining research based on recent trends in the field is provided. Secondly, various aspects of sentiment analysis are explained. Thirdly, various steps of sentiment analysis are introduced. Fourthly, various sentiment analysis, research challenges are discussed. Finally, various techniques used for sentiment analysis are explained and Konstanz Information Miner (KNIME) that can be used as sentiment analysis tool is introduced. For future work, recent machine learning techniques including big data platforms may be proposed for efficient solutions for opinion mining and sentiment analysisReferences
M. R. Saleh, M.T. Martín-Valdivia, A. Montejo-Ráez, L.A. UreñaLópez, Experiments with SVM to classify opinions in different domains, Expert Systems with Applications 38 (2011) 14799–14804.
W. Medhat et al. Sentiment analysis algorithms and applications: A survey, Ain Shams Eng J (2014), http:// dx.doi.org/10.1016/j.asej.2014.04.011
A. Montoyo, P. Martínez-Barco, A. Balahur, Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments, Decision Support Systems 53 (2012) 675–679.
Y.M. Li, T.-Y. Li, Deriving market intelligence from microblogs, Decision Support Systems 55 (2013) 206–217
D. Kang, Y. Park, Review-based measurement of customer satisfaction in mobile service: Sentiment analysis and VIKOR approach. Expert Systems with Applications (2013), http://dx.doi.org/10.1016/j.eswa.2013.07.101
J. Bollen, H. Mao, X. Zeng, Twitter mood predicts the stock market, Journal of Computational Science 2 (2012) 1-8
O. Popescu, and C. Strapparava, Time corpora: Epochs, opinions and changes, Knowledge-Based Systems 69 (2014): 3-13.
B. Pang, L. Lee, Opinion mining and sentiment analysis, Foundations and Trends in Information Retrieval 2 (2008) 1–135
H. Tang, S. Tan, X. Cheng, A survey on sentiment detection of reviews, Expert Systems with Applications 36 (2009) 10760–10773.
D.E. O'Leary, Blog mining-review and extensions: “From each according to his opinion”, Decision Support Systems 51 (2011) 821–830.
A. Montoyo, P. Martínez-Barco, A. Balahur, Subjectivity and
sentiment analysis: An overview of the current state of the area and envisaged developments, Decision Support Systems 53 (2012) 675–679.
M. Tsytsarau, T. Palpanas, Survey on mining subjective data on the web, Data Min Knowl Disc (2012) 24:478–514, DOI
1007/s10618-011-0238-6
E. Cambria, B. Schuller, Y. Xia, C. Havasi, New Avenues in Opinion Mining and Sentiment Analysis, Knowledge-Based Approaches to Concept-Level Sentiment Analysis, IEEE Intelligent Systems, 2013.
R. Feldman, Techniques and Applications for Sentiment Analysis, Review Articles, Communications of the ACM, Vol. 56 No. 4, Pages 82-89, April 2013.
B. Liu, Sentiment analysis and opinion mining, Morgan and Claypool publishers, May 2012.
W. Medhat et al. Sentiment analysis algorithms and applications: A survey, Ain Shams Eng J (2014), http://
dx.doi.org/10.1016/j.asej.2014.04.011
J. Bollen, H. Mao, X. Zeng, Twitter mood predicts the stock market, Journal of Computational Science 2 (2012) 1-8
Y.-M. Li, Y.-L. Shiu, A diffusion mechanism for social advertising over microblogs, Decision Support Systems 54 (2012) 9–22
J. Du et al., Box office prediction based on microblog. Expert Systems with Applications (2013),
http://dx.doi.org/10.1016/j.eswa.2013.08.065
T. Nasukawa, J. Yi, Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the KCAP-03, 2nd Intl. Conf. on Knowledge Capture (2003).
K. Dave, S. Lawrence, D. M. Pennock, Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of International Conference on World Wide Web (WWW- 2003) (2003).
V. Hatzivassiloglou, J. L. Klavans, M. L. Holcombe, R. Barzilay, M.-Y. Kan, K. R. McKeown, Simfinder: A flexible clustering tool or summarization. In Proceedings of the Workshop on Summarization in NAACL-01. (2001).
J. Wiebe, R. F. Bruce, T. P. O'Hara, Development and use of a goldstandard data set for subjectivity classifications. In Proceedings of the Association for Computational Linguistics (ACL-1999). (1999).
A. Montoyo, P. Martínez-Barco, A. Balahur, Subjectivity and
sentiment analysis: An overview of the current state of the area and envisaged developments, Decision Support Systems 53 (2012) 675–679
S. Poria, A. Gelbukh, A. Hussain, N. Howard, D. Das, S.
Bandyopadhyay, Enhanced SenticNet with Affective Labels for ConceptBased Opinion Mining, Knowledge-Based Approaches to
Concept-Level Sentiment Analysis, IEEE Intelligent Systems, (2013)
M. Godsay, The process of sentiment analysis: a study. International Journal of Computer Applications 127, 7 (2015), 26-30
S. Kumar, F. Morstatter, H. Liu, Twitter Data Analytics, August 19, 2013, Springer.
F.H. Khan et al., TOM: Twitter opinion mining framework using hybrid classification scheme, Decision Support Systems (2013), http://dx.doi.org/10.1016/j.dss.2013.09.004.
E. Kontopoulos, C. Berberidis, T. Dergiades, N. Bassiliades,
Ontology-based sentiment analysis of twitter posts, Expert systems with Applications 40 (2013) 4065–4074.
H. Bao, Q. Li, S. S. Liao, S. Song, H. Gao, A new temporal and social PMF-based method to predict users' interests in micro-blogging, Decision Support Systems 55 (2013) 698–709.
W. Li, H. Xu, Text-based emotion classification using emotion cause extraction. Expert Systems with Applications (2013),
http://dx.doi.org/10.1016/j.eswa.2013.08.073
W. Medhat et al. Sentiment analysis algorithms and applications: A survey, Ain Shams Eng J (2014), http://
dx.doi.org/10.1016/j.asej.2014.04.011
S. Wang, D. Li, X. Song, Y. Wei, H. Li, A feature selection method based on improved Fisher’s discriminant ratio for text sentiment classification, Expert Systems with Applications 38 (2011) 8696–8702.
G. Vinodhini, R. M. Chandrasekaran, Opinion mining using principal component analysis based ensemble model for e-commerce application, CSI Transactions on ICT (2014): 1-11.
A. Abbasi, H. Chen, A. Salem, Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification in Web Forums, ACM Transactions on Information Systems, Vol. 26, No. 3, Article 12, Publication date: June 2008.
B. Pang, L. Lee, S. Vaithyanathan, Thumbs up? Sentiment
classification using machine learning techniques, Proceedings of the ACL-02 conference on empirical methods in natural language
processing (Vol. 10, pp. 79–86). Association for Computational
Linguistics, 2002.
B. Pang, L. Lee, A sentiment education: Sentiment analysis using subjectivity summarization based on minimum cuts, in: Proceedings of the 42nd annual meeting on Association for Computational Linguistics (p. 271), 2004, July.
J. Liu, Yunbo Cao, Chin-Yew Lin, Yalou Huang, and Ming Zhou. Low-quality product review detection in opinion summarization. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL-2007). 2007.
B. O'Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith. From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In Proceedings of the International AAAI Conference on Weblogs and Social Media (ICWSM 2010). 2010.
A. Tumasjan, Timm O. Sprenger, Philipp G. Sandner, and Isabell M. Welpe. Predicting elections with twitter: What 140 characters reveal about political sentiment. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM-2010). 2010.
Lu´ıs Cabral and Ali Hortac¸su. The dynamics of seller reputation: Theory and evidence from eBay. Working paper, downloaded version revised in March, 2006. URL
http://pages.stern.nyu.edu/lcabral/workingpapers/CabralHortacsu_M
ar06.pdf.
J. Tatemura, Virtual reviewers for collaborative exploration of movie reviews. In Proceedings of Intelligent User Interfaces (IUI), pages 272–275, 2000.
Ellen Spertus. Smokey: Automatic recognition of hostile messages. In Proceedings of Innovative Applications of Artificial Intelligence (IAAI), pages 1058–1065, 1997.
X. Jin, Y. Li, T. Mah, J. Tong, Sensitive webpage classification for
content advertising. In Proceedings of the International Workshop on
data Mining and Audience Intelligence for Advertising, 2007.
Ellen Riloff, Janyce Wiebe, and William Phillips. Exploiting subjectivity classification to improve information extraction. In Proceedings of AAAI, pages 1106–1111, 2005
L. V. Lita, A. H. Schlaikjer, W. Hong, E. Nyberg, Qualitative dimensions in question answering: Extending the definitional QA task. In Proceedings of AAAI, pages 1616–1617, 005. Student abstract.
M. Laver, K. Benoit, J. Garry, Extracting policy positions from political texts using words as data. American Political Science Review, 97(2):311–331, 2003.
T. Mullen, R. Malouf, A preliminary investigation into sentiment analysis of informal political discourse. In AAAI Symposium on Computational Approaches to Analyzing Weblogs (AAAICAAW), pages 159–162, 2006.
N. Kwon, S. Shulman, E. Hovy, Multidimensional text analysis for eRulemaking. In Proceedings of Digital Government Research (dg.o), 2006.
J. G. Conrad, F. Schilder, Opinion mining in legal blogs. In
Proceedings of the International Conference on Artificial Intelligence and Law (ICAIL), pages 231–236, New York, NY, USA, 2007. ACM
E. Rogers, Diffusion of Innovations. Free Press, New York, 1962. ISBN 0743222091. Fifth edition dated 2003.
C. Yubo, J. Xie, Online consumer review: Word-of-mouth as a new element of marketing communication mix. Management Science, 2008. 54(3): p. 477-491.
W. Theresa, J. Wiebe, R. Hwa, Just how mad are you? Finding strong and weak opinion clauses. In Proceedings of National Conference on Artificial Intelligence (AAAI-2004). 2004.
M. Hu, B. Liu, Mining and summarizing customer reviews. In Proceedings of ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining (KDD-2004). 2004.[55] Z. Lei, B. Liu, Aspect and entity extraction for opinion mining, Data
mining and knowledge discovery for big data. Springer Berlin
Heidelberg, 1-40, 2014.
N. Jindal, B. Liu, Opinion spam and analysis. In Proceedings of the Conference on Web Search and Web Data Mining (WSDM-2008). 2008
N. Preslav, et al., SemEval-2016 task 4: Sentiment analysis in Twitter, In Proceedings of the 10th international workshop on semantic evaluation (SemEval 2016), San Diego, US (forthcoming). 2016.
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In ACL Conference on Empirical Methods in Natural Language Processing, pages 79–86, 2002.
Y. R. Tausczik and J. W. Pennebaker. The psychological meaning of words: Liwc and computerized text analysis methods. Journal of Language and Social Psychology, 29(1):24–54, 2010.
J. Golbeck, Negativity and anti-social attention seeking among narcissists on Twitter: A linguistic analysis, First Monday (2016).
] J. Park, V. Barash, C. Fink, and M. Cha. Emoticon style:
Interpreting differences in emoticons across cultures. In International
AAI Conference on Weblogs and Social Media (ICWSM), 2013.
P. S. Dodds and C. M. Danforth. Measuring the happiness of largescale written expression: songs, blogs, and presidents. Journal of Happiness Studies, 11(4):441– 456, 2009.
E. Cambria, R. Speer, C. Havasi, and A. Hussain. Senticnet: A publicly available semantic resource for opinion mining. In AAAI Fall Symposium Series, 2010.
A. Mihanović, H. Gabelica, Ž. Krstić, Big Data and Sentiment Analysis using KNIME: Online Reviews vs. Social Media, In 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), 1464–1468, 2014.
Downloads
Published
How to Cite
Issue
Section
License
TRANSFER OF COPYRIGHT AGREEMENT
The manuscript is herewith submitted for publication in the Journal of Telecommunication, Electronic and Computer Engineering (JTEC). It has not been published before, and it is not under consideration for publication in any other journals. It contains no material that is scandalous, obscene, libelous or otherwise contrary to law. When the manuscript is accepted for publication, I, as the author, hereby agree to transfer to JTEC, all rights including those pertaining to electronic forms and transmissions, under existing copyright laws, except for the following, which the author(s) specifically retain(s):
- All proprietary right other than copyright, such as patent rights
- The right to make further copies of all or part of the published article for my use in classroom teaching
- The right to reuse all or part of this manuscript in a compilation of my own works or in a textbook of which I am the author; and
- The right to make copies of the published work for internal distribution within the institution that employs me
I agree that copies made under these circumstances will continue to carry the copyright notice that appears in the original published work. I agree to inform my co-authors, if any, of the above terms. I certify that I have obtained written permission for the use of text, tables, and/or illustrations from any copyrighted source(s), and I agree to supply such written permission(s) to JTEC upon request.