Streamsight: a toolkit for offline evaluation of recommender systems

There have been numerous Recommender System (RS) toolkits for offline evaluation that have been released over the years. However, little emphasis has been placed on observing the temporal aspects in the framework of these toolkits. We noticed that current toolkits tend to prioritize complex algorith...

全面介紹

Saved in:
書目詳細資料
主要作者: Ng, Tze Kean
其他作者: Sun Aixin
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/181114
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:There have been numerous Recommender System (RS) toolkits for offline evaluation that have been released over the years. However, little emphasis has been placed on observing the temporal aspects in the framework of these toolkits. We noticed that current toolkits tend to prioritize complex algorithm implementations and the variety of metrics that are used to evaluate these algorithms. Instead, we would like to take a step back to consider another angle of approaching the implementation of toolkits for RS. That is, to consider appropriate approaches in handling the temporal aspects of the data pertaining to the data split scheme and how it can be observed during the evaluation of RS. This report introduces Streamsight, an open-source Python RS toolkit developed and made available on Python Package Index (PyPI). Streamsight provides a framework which considers the existing gaps discussed and implements the proposed solutions in this report. Streamsight provides the entire framework to develop and test RS, mainly targeted towards implementing a global sliding window as a proposed data split scheme and evaluation method for RS which considers a temporal aspect. With the observance of the temporal element, we aim to bring offline evaluation closer to the actual dynamic data communication and flow in the online setting. In this library, we provide the programmer with the APIs that abstract the underlying implementation for easy and standardized use of the implementation. The project and API documentation can be found in Github and PyPI.