Journal of Physics: Conference Series | 2021

Improved Random Forest Algorithm Performance For Big Data

 
 

Abstract


In this paper, the effectiveness of using random forest algorithm in the big data is studied. The reason for choosing this algorithm is because of its effective results in many previous studies, so it was chosen. The random forest algorithm was applied to the big data Internet of Things (IoT) dataset, with size 150,000 instances. After applying the algorithm, it gave an Accuracy score of 99.976%, and this indicates the effectiveness of the random forest algorithm in the big data. This result made the researcher search for one of the ways that increases the value of Accuracy, knowing that it is an excellent result. And use a filter to remove frequent values, and this helps reduce data volume and keep related data. After applying the filter with the random forest algorithm, the Accuracy value appeared at 99.998%, which indicates that it improved the random forest algorithm’s performance in the big data.

Volume 1897
Pages None
DOI 10.1088/1742-6596/1897/1/012071
Language English
Journal Journal of Physics: Conference Series

Full Text