Comparison of Data Normalization for Wine Classification Using K-NN
Algorithm
Dublin Core
Title
Comparison of Data Normalization for Wine Classification Using K-NN
Algorithm
Algorithm
Subject
Normalization, K-fold cross validation, K-NN
Description
The range of values that are not balanced on each attribute can affect the quality of data mining results. For this reason, it is necessary to
pre-process the data. This preprocessing is expected to increase the accuracy of the results from the wine dataset classification. The
preprocessing method used is data transformation with normalization. There are three ways to do data transformation with normalization,
namely min-max normalization, z-score normalization, and decimal scaling. Data that has been processed from each normalization method will
be compared to see the results of the best classification accuracy using the K-NN algorithm. The K used in the comparisons were 1, 3, 5, 7, 9,
11. Before classifying the normalized wine dataset, it was divided into test data and training data with k-fold cross validation. The division of
the data using k is equal to 10. The results of the classification test with the K-NN algorithm show that the best accuracy lies in the wine dataset
which has been normalized using the min-max normalization method with K = 1 of 65.92%. The average obtained is 59.68%.
pre-process the data. This preprocessing is expected to increase the accuracy of the results from the wine dataset classification. The
preprocessing method used is data transformation with normalization. There are three ways to do data transformation with normalization,
namely min-max normalization, z-score normalization, and decimal scaling. Data that has been processed from each normalization method will
be compared to see the results of the best classification accuracy using the K-NN algorithm. The K used in the comparisons were 1, 3, 5, 7, 9,
11. Before classifying the normalized wine dataset, it was divided into test data and training data with k-fold cross validation. The division of
the data using k is equal to 10. The results of the classification test with the K-NN algorithm show that the best accuracy lies in the wine dataset
which has been normalized using the min-max normalization method with K = 1 of 65.92%. The average obtained is 59.68%.
Creator
Rohitash Chandra 1,*, Kaylash Chaudhary 2, Akshay Kumar
Date
2022
Contributor
PERI IRAWAN
Format
PDF
Language
ENGLISH
Type
TEXT
Files
Collection
Citation
Rohitash Chandra 1,*, Kaylash Chaudhary 2, Akshay Kumar, “Comparison of Data Normalization for Wine Classification Using K-NN
Algorithm,” Repository Horizon University Indonesia, accessed June 7, 2025, https://repository.horizon.ac.id/items/show/9316.
Algorithm,” Repository Horizon University Indonesia, accessed June 7, 2025, https://repository.horizon.ac.id/items/show/9316.