An Enhancement of Deep Feature Synthesis Algorithm Using Mean, Median, and Mode Imputation

Josefa Ysabelle J. Maliwat; Princess A. Ylade; Richard C. Regala; Dan Michael A. Cortez; Antolin J. Alipio; Khatalyn E. Mata; Mark Christopher R. Blanco1

1

Publication Date: 2022/05/18

Abstract: The Deep Feature Synthesis (DFS) algorithm automates feature engineering and is capable of extracting and applying complicated featuresto a variety of processes. Due to the novelty of DFS as a method for feature engineering, critical ways for dealing with missing values and unwanted data in a dataset have yet to be established. This paper discusses the usage of mean, median, and mode imputation to preprocess data before analyzing it.However, it is only limited to displaying the differences between nonimputed and imputed datasets. This strategy enables users to obtain more precise results by eliminating biased estimations. This study demonstrates that there is a distinct difference between the two datasets. This paper is concluded by proving that imputing datasets will cause distinctness in the results compared to the results of the datasets with missing and unwanted values.

Keywords: Deep Feature Synthesis, Auto Feature Engineering, Imputation

DOI: https://doi.org/10.5281/zenodo.6558729

PDF: https://ijirst.demo4.arinfotech.co/assets/upload/files/IJISRT22APR1549.pdf

REFERENCES

No References Available