Towards Prediction of Election Outcomes Using Social Media

Full Text (PDF, 648KB), PP.20-28

Views: 0 Downloads: 0


Vinay K. Jain 1,* Shishir Kumar 1

1. Department of CSE, Jaypee University of Engineering and Technology, Guna, 473236, India

* Corresponding author.


Received: 4 Mar. 2017 / Revised: 11 May 2017 / Accepted: 8 Jun. 2017 / Published: 8 Dec. 2017

Index Terms

Sentiment analysis, Opinion mining, Twitter, Text classification, Feature Selection, Latent Dirichlet allocation(LDA)


Exploiting social media data by extracting key information from it is one of the great challenges in data mining and knowledge discovery. Every election campaign has an online presence of voters which uses these social media platform to express their sentiments and opinions towards political parties, leaders and important topics. This paper presents a novel data collection technique for prediction of election outcomes and a topic modeling method for extracting topics. Data collection technique used RSS (Rich Site Summary) feeds of news articles and trending keywords from Twitter simultaneously and constructed an intelligent prediction model based primarily on the volume of tweets and sentiment of users. This paper effort to improve electoral predictions using social media data based dynamic keyword methodology.
Different techniques for electoral prediction based on social media data has been investigated based on existing literature and isolate the factors which improve our methodology. Meaningful inferences such as the popularity of leaders and parties during different intervals, trending issues, and important factors are extracted from the data set. The election outcomes are compared with traditional methods used by survey agencies for exit polls and validation of results showed that social media data can predict with better accuracy. The research has identified that data collection technique and timing play an important role in yielding better accuracy in predicting outcomes and extracting meaningful inferences.

Cite This Paper

Vinay K. Jain, Shishir Kumar, "Towards Prediction of Election Outcomes Using Social Media", International Journal of Intelligent Systems and Applications(IJISA), Vol.9, No.12, pp.20-28, 2017. DOI:10.5815/ijisa.2017.12.03


[1]Vinay K Jain, Shishir Kumar, "Big Data Analytic Using Cloud Computing", in 2nd IEEE International Conference on Advances in Computing and Communication Engineering (ICACCE 2015), Dehradun, pp.667-672, 2015.
[2]B. Pang and L. Lee, “Opinion mining and sentiment analysis”, Foundations and Trends in Information Retrieval, vol.2, pp.1–135, 2008.
[3]Vinay K Jain, Shishir Kumar, “An Effective Approach to Track Levels of Influenza-A (H1N1) Pandemic in India Using Twitter”, Procedia Computer Science, Vol.70, pp.801-807, 2015.
[4]Vinay K Jain, Shishir Kumar, “A Novel Approach to Track Public Emotions Related to Epidemics In Multilingual Data”, in 2nd International Conference and Youth School Information Technology and Nanotechnology (ITNT 2016),Russia,pp.883-889,May 2016.
[5]Vinay K Jain, Shishir Kumar, "Extraction of Emotions from Multilingual Text Using Intelligent Text Processing and Computational Linguistics" Journal of Computational Science, 2017.(doi: 10.1016/j.jocs.2017.01.010).
[6]M.Anjaria,, R.M.Guddeti, “Influence Factor Based Opinion Mining of Twitter Data Using Supervised Learning”, in Proceeding of Communication Systems and Networks [COMSNETS], 2014 Sixth International Conference ,Banglore, India,pp.1-6,2014.
[7]M.Arias ., A. Arratia, and R. Xuriguera “Forecasting with Twitter Data” ACM Transactions on Intelligent Systems and Technology, Vol. 5, Article no.8,2012.
[8]N.Li,,D. Wu,“Using text mining and sentiment analysis for online forums hotspot detection and forecast”, Decision Support Systems, vol.48, pp.354–368,2010.
[9]H.Wang, C.Dogan,A. Kazemzadeh,F. Bar & S.Narayanan, “A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle”, in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, pp.115–120,2012.
[10]Delhi Assembly Election, “Delhi Legislative Assembly election 2015”, available at: http://en. wikipedia org /wiki /Delhi_Legislative_Assembly_election,_2015[accessed 15-March-2016].
[11]F.Matthew and S.Dutta “Obama’s win means future elections must be fought online”, available at:,2008.[accessed 23-Feb-2015]
[12]F.Wigand,L. Dianne, “ Tweets and retweets: Twitter takes wing in government”, Information Polity, vol.16, pp.215-224,2011.
[13]W.Wang,D. Rothschild ,S.Goel,A. Gelmana“ Forecasting elections with non-representative polls”, International Journal of Forecasting, vol.31,pp.980-991,2015.
[14]K.Grosse, M.P. González, C.I. Chesñevar,A.G. Maguitman, A.G., “Integrating argumentation and sentiment analysis for mining opinions from Twitter, AI Communications, Vol.28, pp.387-401,2015.
[15]S.P. Robertson, M. Douglas, B. Maruyama, “Political discourse on social networking sites: Sentiment, in-group/out-group orientation and rationality”, Information Polity, Vol. 18, No. 2,pp. 107-126,2013.
[16]L.A.Hughes and L.Palen “Twitter adoption and use in mass convergence and emergency events”, International Journal of Emergency Management, pp. 248-260, 2009.
[17]A. Jungherr,P. Jrgens and H. Schoen, “Why the Pirate Party Won the German Election of 2009”, Social Science Computer Review, Vol.30,pp.229-234,2012.
[18]F.A.Nooralahzadeh, “2012 Presidential Elections on Twitter--An Analysis of How the US and French Election were reflected in Tweets”, in 19th International Conference on Control Systems and Computer Science [CSCS], Bucharest, pp.240-248,2013.
[19]B. O. Connor, R. Balasubramanyan, B. R.Routledge and N. A.Smith , “From tweets to polls: Linking text sentiment to public opinion time series” in: Proceedings of the International AAAI Conference on Weblogs and Social Media, Washington, DC, May 2010.
[20]A. Tumasjan, T.O. Sprenger, P.Sandner, and I.Welpe, “Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment” in International AAAI Conference on Weblogs and Social Media 2010.
[21]D. J.Hopkins and G King, “A Method of Automated Nonparametric Content Analysis for Social Science” American Journal of Political Science, vol.54, pp.229–247,2010.
[22]N. A.Diakopoulos, D. A.Shamma , “Characterizing debate performance via aggregated twitter sentiment” in: CHI '10 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems ,pp. 1195-1198,2010.
[23]M.Choy , L. F. M.Cheong , M. N.Liak, K. P.Shung , “A sentiment analysis of Singapore Presidential Election 2011 using Twitter data with census correction” Research Collection School Of Information Systems ,2012.
[24]D.Gayo-Avello , “Don't turn social media into another'literarydigest'poll” in: Communications of the ACM vol.54, pp 121-128,2011.
[25]A.Bermingham and A. F. Smeaton , “On using Twitter to monitor political sentiment and predict election results”in International Joint Conference for Natural Language Processing (IJCNLP), Chiang Mai, Thailand,pp.56-63,2011.
[26]P. T. A. Metaxas, “How (not) to predict elections: in IEEE third international conference on social computing (SocialCom) ,pp. 165-171,2011.
[27]M. D. Conover, B.Goncalves, J.Ratkiewicz, A.Flammini, and F.Menczer , “Predicting the political alignment of twitter users”in IEEE third international conference on social computing (socialcom) ,pp. 192-199,2011.
[28]A. D. Maynard and A Funk, “Automatic detection of political opinions in Tweets: in ESWC 2011 Workshops, Heraklion, Greece,pp 88-99,2011
[29]E. T. Sang, “Predicting the 2011 dutch senate election results with twitter”in Workshop on Semantic Analysis in Social Media, 2012,pp. 53-60.
[30]A. O. Larsson and M Hallvard“Studying political microblogging: Twitter users in the 2010 Swedish election campaign” New Media & Society, vol.14,729-747,2012.
[31]M. C. Choy , “US Presidential Election 2012 Prediction using Census Corrected Twitter Model” Computers and Society,2012.
[32]M Skoric , N Poor, P Achananuparp, E P Lim, , and J Jiang, “Tweets and votes: A study of the 2011 singapore general election”in System Science (HICSS), 45th Hawaii International Conference on System Science IEEE,pp. 2583-2591,2012.
[33]E.Tjong , K Sang, J Bos, Predicting the 2011 Dutch Senate Election Results with Twitter” in EACL 2012Workshop on Semantic Analysis in Social Networks, Avignon, France,2012.
[34]F .B. Marquez, D Gayo-Avello, M Mendoza and B Poblete “Opinion Dynamics of Elections in Twitter “in LA-WEB '12 Proceedings of the 2012 Eighth Latin American Web Congress,pp. 32-39,2012.
[35]J. M.Soler, F Cuartero, M Roblizo, “Twitter as a Tool for Predicting Elections Results” in ASONAM, IEEE/ACM International Conference, 2012, pp.1194 – 1200,2012.
[36]H. Wang, C. Dogan, A. Kazemzadeh ,F. Bar and S. Narayanan, “A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle” in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, pp. 115–120,2012.
[37]D. Contractor, T. A.Faruquie “Understanding Election Candidate Approval Ratings Using Social Media Data, in WWW 2013 Companion, Rio de Janeiro, Brazil,2013.
[38]M. P. Cameron, Can Social Media Predict Election Results? Evidence from New Zealand, Working Paper in Economics 13/08, University of Waikato, 2013.
[39]A. C. Ceron, “Every tweet counts? How sentiment analysis of social media can improve our knowledge of citizens’ political preferences with an application to Italy and France” New Media & Society, vol. 16, 340-358, 2013.
[40]M. S. Gaurav, “Leveraging candidate popularity on Twitter to predict election outcome” in Proceedings of the 7th Workshop on Social Network Mining and Analysis.(ACM) 2013.
[41]F .T. Wong, “Media pundits and the US presidential election: Quantifying political leanings from tweets” in Proceedings of the International Conference on Weblogs and Social Media, 2013.
[42]E. Sanders , “Relating Political Party Mentions on Twitter with Polls and Election Results” in Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval,pp. 68-71,2013.
[43]C. B. Fink, “Twitter, Public Opinion, and the 2011 Nigerian Presidential Election” in SocialCom, International Conference on Social Computing IEEE,pp.311-320,2013.
[44]H. D. Kim, M. Castellanos , M. Hsu , C. X. Zhai, T. Rietz , D. Diermeier , “Mining Causal Topics in Text Data: Iterative Topic Modeling with Time Series Feedback”, in CIKM’13, San Francisco, CA, USA,2013.
[45]A. Bakliwal, J. Foster, J. V.Puil, R. O’Brien, L .Tounsi and M. Hughes, “Sentiment Analysis of Political Tweets: Towards an Accurate Classifier” in: LASM, Workshop on Language in Social Media ,Atlanta, Georgia,pp. 49–58,2013.
[46]N. A.Thapen, M MGhanem, “Towards Passive Political Opinion Polling using Twitter” in BCS SGAI Workshop on Social Media Analysis , Cambridge, UK.2013.
[47]C.Vaccari, A Valeriani , P.Barberá, R.Bonneau “Social media and political communication: a survey of Twitter users during the 2013 Italian general election” Rivistaitaliana di scienzapolitica, vol.43,pp. 325-355,2013.
[48]M.Vergeer, L Hermans, “Campaigning on Twitter: Microblogging and Online Social Networking as Campaign Tools in the 2010 General Elections in the Netherlands” Journal of Computer-Mediated Communication ,vol.18 ,pp. 399–419,2013.
[49]F. M. Wong, “Quantifying Political Leaning from Tweets and Retweets” in ICWSM,2013.
[50]M. Song , C M Kim, Y K Jeong, “Analyzing the Political Landscape of 2012 Korean Presidential Election in Twitter”,IEEE Intelligent Systems ,vol.1,pp.1541-1672,2013.
[51]A. R.Makazhanov, “Predicting political preference of Twitter users” Social Network Analysis and Mining, vol.1, pp.1-15,2014.
[52]G.Ifrim, B. Shi, I.Brigadir , “Event Detection in Twitter using Aggressive Filtering and Hierarchical Tweet Clustering”,in SNOW 2014 Data Challenge, Seoul, Korea, 2014.
[53]P.Mehndiratta , S Sachdeva, P Sachdeva, Y Sehgal “Elections Again, Twitter May Help!!! A Large Scale Study for Predicting Election Results Using Twitter” in Third International Conference, BDA 2014, New Delhi, India,pp. 133-144,2014.
[54]P.Barbera and G.Rivero, “Understanding the Political Representativeness of Twitter Users”, Social Science Computer Review, vol.1, pp.56-67, 2014.
[55]C.Vaccari, A Valeriani, P Barberá, R Bonneau, J T Jost, J Nagler and J A Tucker , “Political Expression and Action on Social Media: Exploring the Relationship Between Lower- and Higher-Threshold Political Activities Among Twitter Users in Italy” Journal of Computer-Mediated Communication,vol.20,pp. 221–239.2015.
[56]Elvyna Tunggawan,Yustinus Eko Soelistio, “And the Winner is ...: Bayesian Twitter-based Prediction on 2016 U.S. Presidential Election” ,International Conference on Computer, Control, Informatics and its Applications.pp.33-37,2016.
[57]B. Narendra, Mr. K. Uday Sai, Mr. G. Rajesh, Mr. K. Hemanth, Mr. M. V. Chaitanya Teja, Mr. K. Deva Kumar “Sentiment Analysis on Movie Reviews: A Comparative Study of Machine Learning Algorithms and Open Source Technologies” I.J. Intelligent Systems and Applications, vol.8, pp.66-70,2016 .
[58]Pooja Wadhwa, M.P.S Bhatia, “Discovering Hidden Networks in On-line Social Networks”, I.J. Intelligent Systems and Applications, vol.5, pp.44-54, 2014.
[59]David M. Blei, Latent Dirichlet Allocation, Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.
[60]Mayy M. Al-Tahrawi, “Arabic Text Categorization Using Logistic Regression” I.J. Intelligent Systems and Applications, vol.6,pp. 71-78,2015.
[61]Rasim M. Alguliyev,Ramiz M. Aliguliyev and Irada Y. Alakbarova, “Extraction of Hidden Social Networks from Wiki-Environment Involved in Information Conflict”, I.J. Intelligent Systems and Applications, vol. 2,pp. 20-27,2016.
[62]Y Bao, C Quan,L Wang,F. Ren , “The Role of Pre-processing in Twitter Sentiment Analysis”, in ICIC,Taiyuan, China,pp.615-624,2014.
[63]Indrajit Mukherjee,Jasni M Zain,P. K. Mahanti, “An Automated Real-Time System for Opinion Mining using a Hybrid Approach”,I.J. Intelligent Systems and Applications, vol.7,p. 55-64,2016.
[64]Ibrahim S. I. Abuhaiba,Hassan M. Dawoud, “Combining Different Approaches to Improve Arabic Text Documents Classification” I.J. Intelligent Systems and Applications, vol.4,pp. 39-52,2017.