DETERMINING HOME-BASED WORK TRIP BASED ON TWITTER DATA USING MACHINE LEARNING

This research aims to predict the number of Home-Based Work (HBW) trips at zonal level using Twitter data and Machine Learning approaches. The conclusion of this research shows that using Twitter data alone is not effective, and the integration of Twitter data with with Home-Interview (HI) survey...

全面介紹

Saved in:
書目詳細資料
主要作者: Sora Rayat, Rempu
格式: Dissertations
語言:Indonesia
主題:
在線閱讀:https://digilib.itb.ac.id/gdl/view/87177
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Institut Teknologi Bandung
語言: Indonesia
實物特徵
總結:This research aims to predict the number of Home-Based Work (HBW) trips at zonal level using Twitter data and Machine Learning approaches. The conclusion of this research shows that using Twitter data alone is not effective, and the integration of Twitter data with with Home-Interview (HI) survey data shows better model performance, namely being able to increase the accuracy of the model predicting worker trip-rates per zone. The PM approach is used to predict the value of explanatory variables in the prediction model, where the target variable is worker trip-rate per zone. Predicting the amount of HBW production per zone with an unbalanced amount of data in urban zones uses Oridinary Least Square (OLS). In this research, Twitter data from 2018 to 2021 was used to obtain information on residence location, workplace location, employment status and type of user's employment, education level, income level, ownership of 2-wheeled vehicles (motorbikes), 4-wheeled vehicles (cars), distance from residence location to work location, and number of daily HBW trips. As support, 2018 HI data is used to provide more comprehensive socio-economic information and trip patterns. The data integration process involved matching individual origin zones in both Twitter data HI survey data, employment status, occupation type, education level, income level, vehicle ownership (two-wheelers: motor bike and four-wheelers: car), the distance between home and work location, and the number of daily HBW trips. The prediction model based on Twitter data integrated with 2018 HI data shows superior performance compared to using only Twitter data. The OLS method provides coefficients for each explanatory variable in the model, which cannot be obtained when using the ANN method. The model in question was then used to estimate the number of HBW trip production for each zone in the research area based on Twitter data for 2018, 2019, 2020 and 2021. The case study was conducted in Serang City, Indonesia.