A Framework for Development of Recommender System for Financial Data Analysis

Full Text (PDF, 893KB), PP.18-27

Views: 0 Downloads: 0


Pradeep Kumar M. Kanaujia 1,* Manjusha Pandey 1 Siddharth Swarup Rautaray 1

1. School of Computer Engineering, KIIT University, Bhubaneswar - 751024, Orissa, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijieeb.2017.05.03

Received: 10 Apr. 2017 / Revised: 15 May 2017 / Accepted: 11 Jun. 2017 / Published: 8 Sep. 2017

Index Terms

Big Data, Recommender system, Apache Hadoop, Apache Mahout


The huge amount of data is being created every day by various organisations and users all over the world. Structured, semi-structured and unstructured data is being created at a very rapid speed from heterogeneous sources like reviews, ratings, feedbacks, shopping details, etc., it is termed as Big Data. This data generated from different users share many common patterns which can be filtered and analysed to give some recommendation regarding the product, goods or services in which a user is interested. Recommendation systems are the software tools used to give suggestions to users on the basis of their requirements. Today no system is available for suggesting a person on how to use their money for saving, where to invest and how to manage expenditures. Few consulting systems are available which provide investment and saving tips but they are not much effective and are much complex. The presented paper proposed a collaborative filtering based recommender system for financial analysis based on Saving, Expenditure and Investment using Apache Hadoop and Apache Mahout. Many savings and investment consulting systems are available but no system provides effective and efficient recommendation regarding management and beneficial utilisation of salary. The advantage of proposed recommender system is that it provides better suggestion to a person for saving, expenditure and investment of their salary which in turns maximises their wealth. Due to enormous amount of data involved, Apache Hadoop framework is used for distributed processing. Collaborative filtering and Apache Mahout is used for analysing the data and implementation of the recommender system.

Cite This Paper

Pradeep Kumar M. Kanaujia, Manjusha Pandey, Siddharth Swarup Rautaray, "A Framework for Development of Recommender System for Financial Data Analysis", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.9, No.5, pp. 18-27, 2017. DOI:10.5815/ijieeb.2017.05.03


[1]What is big data? In: Big Data Now: 2012 Edition, 1st edn., p. 3. O'Reilly Media, Inc, 1005 Gravenstein Highway North, Sebastopol, CA 95472 (2012)
[2]Gartner IT Glossary Big Data. http://www.gartner.com/it-glossary/big-data/
[3]M. A. u. d. Khan, M. F. Uddin and N. Gupta, "Seven V's of Big Data understanding Big Data to extract value," American Society for Engineering Education (ASEE Zone 1), 2014 Zone 1 Conference of the, Bridgeport, CT, 2014, pp. 1-5.
[4]van der Aalst, W.M.: Process cubes: Slicing, dicing, rolling up and drilling down event data for process mining. In: Asia-Pacific Conference on Business Process Management, pp. 1-22 (2013). Springer
[5]Gray, J.: Data management: Past, present, and future. IEEE 13 (1996)
[6]Chandarana, P., Vijayalakshmi, “Big data analytics frameworks”, 2014 IEEE International Conference On Circuits, Systems, Communication and Information Technology Applications (CSCITA), pp. 430-434, 2014.
[7]Thomas Erl, W.K., Buhler, P., Big Data Fundamentals Concepts, Drivers and Techniques, 1st ed., pp. 29, USA: Prentice Hall, 2015.
[8]Thorat, Poonam B., R. M. Goudar, and Sunita Barve. "Survey on collaborative filtering, content-based filtering and hybrid recommendation system," International Journal of Computer Applications 110.4, 2015.
[9]Francesco Ricci and Lior Rokach and Bracha Shapira, “Introduction to Recommender Systems Handbook, Springer, pp. 135, 2011.
[10]J. B. Schafer, D. Frankowski, et al.,“Collaborative filtering recommender systems”, The Adaptive Web, pp. 291–324, 2007.
[11]G. Linden, B. Smith and J. York, “Amazon.com recommendations: item-to-item collaborative filtering,” in IEEE Internet Computing, vol. 7, no. 1, pp. 76-80, Jan/Feb 2003.
[12]X. Luo, Y. Xia, Q. Zhu, “Applying the learning rate adaptation to the matrix factorization based collaborative filtering”, Knowledge Based Systems 37, pp. 154–164, 2013.
[13]I. Markovsky, “Low-Rank Approximation: Algorithms, Implementation, Applications,” Springer, 2012.
[14]B. Sarwar, G. Karypis, J. Konstan, J. Riedl, “Application of dimensionality reduction in recommender system – a case study,” ACM WebKDD Workshop, 2000b, pp. 264–272.
[15]Takács, G.; Pilászy, I.; Németh, B.; Tikk, D., "Scalable Collaborative Filtering Approaches for Large Recommender Systems," Journal of Machine Learning Research, vol. 10, pp. 623–656.
[16]Elahi, Mehdi, et al., “A survey of active learning in collaborative filtering recommender systems,” Computer Science Review, Elsevier, 2016.
[17]Sanghack Lee and Jihoon Yang and SungYong Park, “Discovery of Hidden Similarity on Collaborative Filtering to Overcome Sparsity Problem,” Discovery Science, 2007.
[18]Zhou, Jia, and Tiejian Luo. "A novel approach to solve the sparsity problem in collaborative filtering." Networking, Sensing and Control (ICNSC), 2010 International Conference on. IEEE, 2010.
[19]Van Meteren, Robin, and Maarten Van Someren. "Using content-based filtering for recommendation." Proceedings of the Machine Learning in the New Information Age: MLnet/ECML2000 Workshop. 2000.
[20]M. Balabanovic, Y. Shoham, “Content-based, collaborative recommendation”, Communications of the ACM, pp. 66–72, 1997.
[21]Gomez-Uribe, Carlos A., and Neil Hunt, "The Netflix recommender system: Algorithms, business value, and innovation," ACM Transactions on Management Information Systems (TMIS), 2016.
[22]Rubens, Neil, et al., "Active Learning in Recommender Systems," Recommender Systems Handbook. Springer, US, 2016.
[23]Adomavicius, G.; Tuzhilin, A., "Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions", IEEE Transactions on Knowledge and Data Engineering, vol.17, no.6, pp.734-749, June 2005.
[24]Sarwar, B., George Karypis, Joseph Konstan, and John Riedl, "Item-based collaborative filtering recommendation algorithms", In Proceedings of the 10th international conference on World Wide Web, ACM, pp. 285-295, 2001.
[25]L. T. Ponnam, S. Deepak Punyasamudram, S. N. Nallagulla and S. Yellamati, "Movie recommender system using item based collaborative filtering technique", 2016 International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS), Pudukkottai, pp. 1-5, 2016.
[26]W. Weijie, Y. Jing and H. Liang, "An Improved Collaborative Filtering Based on Item Similarity Modified and Common Ratings", 2012 International Conference on Cyberworlds, Darmstadt, pp. 231-235, 2012.
[27]S. Wei, N. Ye, S. Zhang, X. Huang and J. Zhu, "Item-Based Collaborative Filtering Recommendation Algorithm Combining Item Category with Interestingness Measure", 2012 International Conference on Computer Science and Service System, Nanjing, pp. 2038-2041, 2012.
[28]deRoos, D., Zikopoulos, P.C., Brown, B., Coss, R., Melnyk, R.B., “Hadoop For Dummies,” pp. 13, John Wiley & Sons, Inc., 2014.
[29]J. B. Schafer, D. Frankowski, et al.,“Collaborative filtering recommender systems”, The Adaptive Web, pp. 291–324, 2007.
[30]Patel, A.B., Birla, M., Nair, U., “Addressing big data problem using hadoop and mapreduce,” 2012 Nirma University International Conference on Engineering (NUiCONE), IEEE, pp. 1-5, 2012.
[31]Dean, Jeffrey, and Sanjay Ghemawat, "MapReduce: simplified data processing on large clusters," Communications of the ACM, pp. 107-113, 2008.
[32]McAuley, Julian, et al. "Image-based recommendations on styles and substitutes." Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2015.
[33]Alam, M.I., Pandey, M. and Rautaray, S.S., “A Proposal of Resource Allocation Management for Cloud Computing”, International Journal of Cloud Computing and Services Science (IJ-CLOSER), 3(2), pp.79-86, 2014.
[34]Dey, Monali, and Siddharth Swarup Rautaray, "Disease Predication of Cardio-Vascular Diseases, Diabetes and Malignancy in Lungs Based on Data Mining Classification Techniques.", International Journal of Computer Science International Journal of Computer Science and Engine and Engineering Open Access, 2014.
[35]Zaied, Abdel Nasser H., Gawaher Soliman Hussein, and Mohamed M. Hassan. "The role of knowledge management in enhancing organizational performance." International Journal of Information Engineering and Electronic Business 4.5 (2012): 27.
[36]Olaiya, Folorunsho, and Adesesan Barnabas Adeyemo. "Application of data mining techniques in weather prediction and climate change studies." International Journal of Information Engineering and Electronic Business 4.1 (2012): 51.