Naive Bayes in Focus: A Thorough Examination of its Algorithmic Foundations and Use Cases

Raj Kumar; Bigit Krishna Goswami; Soham Motiram Mhatre; Sneha Agrawal1

1

Publication Date: 2024/06/07

Abstract: The Naive Bayes (NB) algorithm, a widely adopted probabilistic classification technique, holds significant importance across various domains such as natural language processing, spam detection, and sentiment analysis. This study thoroughly investigates the foundational principles of NB, Bayesian inference, and its practical implementations. Emphasizing its simplicity and efficiency, NB relies on the "naive" assumption of feature independence as its core principle. The study examines the implications of this assumption on model performance and offers strategies for addressing real-world deviations. Comparisons are drawn with four research papers that delve into different facets of Naive Bayes. The first paper, "Hidden Naive Bayes," explores methods for uncovering concealed dependencies within data and introduces a novel algorithm for this purpose. The second paper, "Learning the Naive Bayes Classifier with Optimization Models," investigates optimization techniques to enhance the performance of the Naive Bayes classifier. In contrast, the third paper, "Naive Bayes for Regression," explores the utilization of Naive Bayes in regression analysis. Lastly, the fourth paper, "Naive Bayes Classifiers," discusses various variants of NB tailored for different data types and presents comparative analyses across diverse scenarios.

Keywords: Naïve Bayes, Probabilistic Classification, Bayesian Inference, Independence Assumption, Real-World Case Studies.

DOI: https://doi.org/10.38124/ijisrt/IJISRT24MAY1438

PDF: https://ijirst.demo4.arinfotech.co/assets/upload/files/IJISRT24MAY1438.pdf

REFERENCES

  1. Rish, I. (2001, August). An empirical study of the naive Bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence (Vol. 3, No. 22, pp. 41-46).
  2. Webb, G. I., Keogh, E., & Miikkulainen, R. (2010). Naïve Bayes. Encyclopedia of machine learning15(1), 713-714.
  3. Yang, F. J. (2018, December). An implementation of naive bayes classifier. In 2018 International conference on computational science and computational intelligence (CSCI) (pp. 301-306). IEEE.
  4. Jiang, L., Zhang, H., & Cai, Z. (2008). A novel bayes model: Hidden naive bayes. IEEE Transactions on knowledge and data engineering21(10), 1361-1371.
  5. Jiang, L., Wang, D., Cai, Z., & Yan, X. (2007). Survey of improving naive bayes for classification. In Advanced Data Mining and Applications: Third International Conference, ADMA 2007 Harbin, China, August 6-8, 2007. Proceedings 3 (pp. 134-145). Springer Berlin Heidelberg.
  6. Rish, I. (2001, August). An empirical study of the naive Bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence (Vol. 3, No. 22, pp. 41-46).
  7. John, G. H., & Langley, P. (2013). Estimating continuous distributions in Bayesian classifiers. arXiv preprint arXiv:1302.4964.
  8. Bayes, T. (1968). Naive bayes classifier. Article Sources and Contributors, 1-9.
  9. Zhang, H., & Li, D. (2007, November). Naïve Bayes text classifier. In 2007 IEEE international conference on granular computing (GRC 2007) (pp. 708-708). IEEE.
  10. Chen, S., Webb, G. I., Liu, L., & Ma, X. (2020). A novel selective naïve Bayes algorithm. Knowledge-Based Systems192, 105