A study on forecasting bigmart sales using optimized machine learning techniques

(1) * N.M Saravana Kumar Mail (Department of Artificial Intelligence and Data Science, M. Kumarasamy College of Engineering, Karur, India)
(2) K Hariprasath Mail (Department of Information Technology, Vivekanandha College of Engineering for Women, Namakkal, India)
(3) N Kaviyavarshini Mail (Department of Information Technology, Vivekanandha College of Engineering for Women, Namakkal, India)
(4) A Kavinya Mail (Department of Information Technology, Vivekanandha College of Engineering for Women, Namakkal, India)
*corresponding author

Abstract


Data mining is an in-depth study of enormous amounts of data present in an organization or institution’s repository. Business experts mostly utilize data analytics approaches to confirm their opinion. It will rapidly boost the global interest of the organization. In this scenario, the information and conclusion are gathered from Data analysis by data analytics. The experts also use it to validate, diagnose, or authenticate speculate layouts suppositions and completion of the analysis. In this paper, the prediction is based on grocery data sets by inspecting and analyzing the big mart sales data set. Among several predictive algorithms, data mining algorithms are considered for prediction. It includes Decision Tree, Naïve Bayes, Adaboost with Particle Swarm optimization, and Random forest. The proposed method of this research is a novel Naïve Bayes with a PSO algorithm. This algorithm optimizes the model iteratively. Exploration of the data must be done before prediction. The root means squared error (RMSE) is used as evaluation metrics for comparing the data mining algorithms.  The proposed algorithm performs well and gives a lower RMSE value. So, the proposed algorithm fits the best model when compared with the existing algorithms. This paper describes the prediction of high-quality data analysis data and determines the efficiency of data mining algorithms.

Keywords


Exploration; Naive bayes with PSO; Predictive modeling; Decision Tree; Data analysis

   

DOI

https://doi.org/10.31763/sitech.v1i2.167
      

Article metrics

10.31763/sitech.v1i2.167 Abstract views : 3118 | PDF views : 1002

   

Cite

   

Full Text

Download

References


Y. Meier, J. Xu, O. Atan, and M. Van Der Schaar, “Personalized grade prediction: a data mining approach,” in 2015 IEEE International Conference on Data Mining, Nov. 2015, pp. 907–912, doi: 10.1109/ICDM.2015.54.

K. Punam, R. Pamula, and P. K. Jain, “A two-level statistical model for big mart sales prediction,” in 2018 International Conference on Computing, Power and Communication Technologies (GUCON), Sep. 2018, pp. 617–620, doi: 10.1109/GUCON.2018.8675060.

M. N, P. Chatradi, A. C. V, S. M. Kalavala, and N. K. S, “Improvizing big market sales prediction,” J. Xi’an Univ. Archit. Technol., vol. XII, no. IV, pp. 4307–4313, 2020, doi: 10.37896/JXAT12.04/1172.

K. H. Sadia, A. Sharma, A. Paul, Sarmistha Padhi, and S. Sanyal, “Stock market prediction using machine learning algorithms,” Int. J. Eng. Adv. Technol., vol. 8, no. 4, pp. 25–31, 2019, [Online]. Available: https://www.ijeat.org/wp-content/uploads/papers/v8i4/D6321048419.pdf.

K. Chen, Y. Li, and X. Xu, “Rotating target classification base on micro-Doppler features using a modified adaptive boosting algorithm,” in 2015 International Conference on Computers, Communications, and Systems (ICCCS), Nov. 2015, pp. 236–240, doi: 10.1109/CCOMS.2015.7562907.

B. X. Chen, R. Sahdev, and J. K. Tsotsos, “Person following robot using selected online ada-boosting with stereo camera,” in 2017 14th Conference on Computer and Robot Vision (CRV), May 2017, pp. 48–55, doi: 10.1109/CRV.2017.55.

N. N. Sakhare and S. S. Imambi, “Performance analysis of regression based machine learning techniques for prediction of stock market movement,” Int. J. Recent Technol. Eng., vol. 7, no. 6S4, pp. 206–213, 2019, [Online]. Available: https://www.ijedr.org/papers/IJEDR1804010.pdf .

A. Chandel, A. Dubey, S. Dhawale, and M. Ghuge, “Sales prediction system using machine learning,” Int. J. Sci. Res. Eng. Dev., vol. 2, no. 2, pp. 667–670, 2019, [Online]. Available: http://www.ijsred.com/volume2/issue2/IJSRED-V2I2P83.pdf.

H. Kadam, R. Shevade, D. Ketkar, and S. Rajguru, “A forecast for big mart sales based on random forests and multiple linear regression,” Int. J. Eng. Dev. Res., vol. 6, no. 4, pp. 41–42, 2018, [Online]. Available: https://www.ijedr.org/papers/IJEDR1804010.pdf.

B. Gokulakrishnan, P. Priyanthan, T. Ragavan, N. Prasath, and As. Perera, “Opinion mining and sentiment analysis on a Twitter data stream,” in International Conference on Advances in ICT for Emerging Regions (ICTer2012), Dec. 2012, pp. 182–188, doi: 10.1109/ICTer.2012.6423033.

T. Jain, A. . Dua, and V. Sharma, “Quantitative analysis of apriori and eclat algorithm for association rule mining,” Int. J. Eng. Comput. Sci., vol. 4, no. 10, pp. 14649–14652, 2015, [Online]. Available: http://103.53.42.157/index.php/ijecs/article/view/2973/2752.

P. P. Shinde, K. S. Oza, and R. K. Kamat, “Big data predictive analysis: Using R analytical tool,” in 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Feb. 2017, pp. 839–842, doi: 10.1109/I-SMAC.2017.8058297.

P.-Y. Zhou, K. C. C. Chan, and C. X. Ou, “Corporate communication network and stock price movements: insights from data mining,” IEEE Trans. Comput. Soc. Syst., vol. 5, no. 2, pp. 391–402, Jun. 2018, doi: 10.1109/TCSS.2018.2812703.

F.-J. Yang, “An extended idea about decision trees,” in 2019 International Conference on Computational Science and Computational Intelligence (CSCI), Dec. 2019, pp. 349–354, doi: 10.1109/CSCI49370.2019.00068.

S. Patil and U. Kulkarni, “Accuracy prediction for distributed decision tree using machine learning approach,” in 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), Apr. 2019, pp. 1365–1371, doi: 10.1109/ICOEI.2019.8862580.

A. Narkhede, M. Awari, S. Gawali, and A. Mhaisgawali, “Big mart sales prediction using machine learning techniques,” Int. J. Sci. Res. Eng. Dev., vol. 3, no. 4, pp. 693–697, 2020. Available: Google Scholar.

D. Shah, H. Isah, and F. Zulkernine, “Stock market analysis: a review and taxonomy of prediction techniques,” Int. J. Financ. Stud., vol. 7, no. 2, p. 26, May 2019, doi: 10.3390/ijfs7020026.

Y. Huang and L. Li, “Naive bayes classification algorithm based on small sample set,” in 2011 IEEE International Conference on Cloud Computing and Intelligence Systems, Sep. 2011, pp. 34–39, doi: 10.1109/CCIS.2011.6045027.

A. M. Rahat, A. Kahir, and A. K. M. Masum, “Comparison of Naive Bayes and SVM Algorithm based on Sentiment Analysis Using Review Dataset,” in 2019 8th International Conference System Modeling and Advancement in Research Trends (SMART), Nov. 2019, pp. 266–270, doi: 10.1109/SMART46866.2019.9117512.

H. Pan, Y. Zhu, and L. Z. Xia, “Hierarchical PSO-adaboost based classifiers for fast and robust face detection,” Int. J. Information, Control Comput. Sci., vol. 4, no. 11, 2011, doi: 10.5281/zenodo.1327696.

H. Faris, I. Aljarah, and B. Al-Shboul, “A hybrid approach based on particle swarm optimization and random forests for e-mail spam filtering,” in Nguyen NT., Iliadis L., Manolopoulos Y., Trawiński B. (eds) Computational Collective Intelligence. ICCCI 2016. Lecture Notes in Computer Science, Cham: Springer, 2016, pp. 498–508, doi: 10.1007/978-3-319-45243-2_46.

A. Tripathi, S. Yadav, and R. Rajan, “Naive Bayes Classification Model for the Student Performance Prediction,” in 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT), Jul. 2019, pp. 1548–1553, doi: 10.1109/ICICICT46008.2019.8993237.

U. N. Dulhare, “Prediction system for heart disease using Naive Bayes and particle swarm optimization,” Biomed. Res., vol. 29, no. 12, 2018, doi: 10.4066/biomedicalresearch.29-18-620.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2020 N.M Saravana Kumar, K Hariprasath, N Kaviyavarshini, A Kavinya

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
Science in Information Technology Letters
ISSN 2722-4139
Published by Association for Scientific Computing Electrical and Engineering (ASCEE)
W : http://pubs2.ascee.org/index.php/sitech
E : sitech@ascee.org, andri@ascee.org, andri.pranolo.id@ieee.org

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

View My Stats