MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE

In line with rapid business process digitalization in the tax directorate, the size of data stored in the institution has grown exponentially. The issue of how to generate value out of the valuable data assets remains unsolved. Correspondingly, this research provides machine learning-based predictiv...

Full description

Saved in:
Bibliographic Details
Main Author: David Febriminanto, Raden
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/67765
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:67765
spelling id-itb.:677652022-08-26T08:04:40ZMACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE David Febriminanto, Raden Indonesia Theses Machine learning, Predictive Analytics, Tax, Data mining, Decision tree, Random forest, Logistic regression INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/67765 In line with rapid business process digitalization in the tax directorate, the size of data stored in the institution has grown exponentially. The issue of how to generate value out of the valuable data assets remains unsolved. Correspondingly, this research provides machine learning-based predictive analytics as a solution on how to use the taxpayers’ trigger data as a decision support system to discover and realize unexplored tax potential. More specifically, this research presents predictive analytic models that can accurately predict which potential taxpayers are likely to pay their due. We develop three machine learning models, namely logistic regression, random forest, and decision tree. We analyzed a total of 5,562 tax revenue potential data samples with eight predictors such as trigger data nominal value, distance tax office, type of taxpayer, media of tax report, type of tax, type of report status, and registered year of the taxpayer, and area coverage. Our study shows that the random forest model provides the best prediction performance. The results of the weight of each attribute indicated that the status of the tax report is the top tier of variable importance in predicting tax revenue potential. The analytics can help tax officers in determining potential taxpayers with the highest likelihood of paying their due. Given the size of the data records, this approach can provide tax administrations with a powerful tool to increase work efficiency, combat tax evasion, and provide better customer service. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description In line with rapid business process digitalization in the tax directorate, the size of data stored in the institution has grown exponentially. The issue of how to generate value out of the valuable data assets remains unsolved. Correspondingly, this research provides machine learning-based predictive analytics as a solution on how to use the taxpayers’ trigger data as a decision support system to discover and realize unexplored tax potential. More specifically, this research presents predictive analytic models that can accurately predict which potential taxpayers are likely to pay their due. We develop three machine learning models, namely logistic regression, random forest, and decision tree. We analyzed a total of 5,562 tax revenue potential data samples with eight predictors such as trigger data nominal value, distance tax office, type of taxpayer, media of tax report, type of tax, type of report status, and registered year of the taxpayer, and area coverage. Our study shows that the random forest model provides the best prediction performance. The results of the weight of each attribute indicated that the status of the tax report is the top tier of variable importance in predicting tax revenue potential. The analytics can help tax officers in determining potential taxpayers with the highest likelihood of paying their due. Given the size of the data records, this approach can provide tax administrations with a powerful tool to increase work efficiency, combat tax evasion, and provide better customer service.
format Theses
author David Febriminanto, Raden
spellingShingle David Febriminanto, Raden
MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
author_facet David Febriminanto, Raden
author_sort David Febriminanto, Raden
title MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
title_short MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
title_full MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
title_fullStr MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
title_full_unstemmed MACHINE LEARNING ANALYTICS FOR PREDICTING TAX REVENUE POTENTIAL: AN EMPIRICAL CASE OF ANONYMOUS TAX OFFICE
title_sort machine learning analytics for predicting tax revenue potential: an empirical case of anonymous tax office
url https://digilib.itb.ac.id/gdl/view/67765
_version_ 1822933443043917824