Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction

This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies f...

Full description

Saved in:
Bibliographic Details
Main Author: Goh, Jeremy Chun Hao
Other Authors: Long Cheng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181199
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181199
record_format dspace
spelling sg-ntu-dr.10356-1811992024-11-18T02:24:37Z Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction Goh, Jeremy Chun Hao Long Cheng College of Computing and Data Science c.long@ntu.edu.sg Computer and Information Science Engineering LLMs Urban data analysis This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies for traffic data prediction qualitatively and quantitatively, and comparing its performance against traditional data analysis methods. Tailored prompts were designed to facilitate the experiment and leveraged the capabilities of Poe.com, a platform which allowed users to create their own agent with customized knowledge base. The study revealed two main techniques for leveraging LLMs in urban data analytics: Standard Prompting and a One-time Setup method by creating a personalised assistant. While standard prompting techniques requires a new prompt for each analysis, the technique of developing a prompt as a personalised assistant eliminates the need for repeated prompt crafting, saving time and bridging the gap for prompt engineering knowledge in users. Key findings for qualitative data prediction have shown that ChatGPT-4o excels in data interpretation while Claude 3.5 – Sonnet can provide actionable insights and realistic forecasts. Llama, however, has faced challenges with achieving a moderate level of accuracy in data interpretation. Additional findings during evaluation of accuracy in quantitative predictions also revealed significant limitations when forecasting data such as total number of cars, which changes under highly volatile environments due to changing policies. However, data such as public transport ridership reflected a more stable and predictable pattern for public transport usage. Overall, this project demonstrates that LLMs can significantly enhance urban data analytic workflows, gaining quicker insights into traffic patterns compared to traditional methods, although some limitations remain, such as the need for domain expertise in interpreting results. Bachelor's degree 2024-11-18T02:24:37Z 2024-11-18T02:24:37Z 2024 Final Year Project (FYP) Goh, J. C. H. (2024). Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181199 https://hdl.handle.net/10356/181199 en SCSE23-0952 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Engineering
LLMs
Urban data analysis
spellingShingle Computer and Information Science
Engineering
LLMs
Urban data analysis
Goh, Jeremy Chun Hao
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
description This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies for traffic data prediction qualitatively and quantitatively, and comparing its performance against traditional data analysis methods. Tailored prompts were designed to facilitate the experiment and leveraged the capabilities of Poe.com, a platform which allowed users to create their own agent with customized knowledge base. The study revealed two main techniques for leveraging LLMs in urban data analytics: Standard Prompting and a One-time Setup method by creating a personalised assistant. While standard prompting techniques requires a new prompt for each analysis, the technique of developing a prompt as a personalised assistant eliminates the need for repeated prompt crafting, saving time and bridging the gap for prompt engineering knowledge in users. Key findings for qualitative data prediction have shown that ChatGPT-4o excels in data interpretation while Claude 3.5 – Sonnet can provide actionable insights and realistic forecasts. Llama, however, has faced challenges with achieving a moderate level of accuracy in data interpretation. Additional findings during evaluation of accuracy in quantitative predictions also revealed significant limitations when forecasting data such as total number of cars, which changes under highly volatile environments due to changing policies. However, data such as public transport ridership reflected a more stable and predictable pattern for public transport usage. Overall, this project demonstrates that LLMs can significantly enhance urban data analytic workflows, gaining quicker insights into traffic patterns compared to traditional methods, although some limitations remain, such as the need for domain expertise in interpreting results.
author2 Long Cheng
author_facet Long Cheng
Goh, Jeremy Chun Hao
format Final Year Project
author Goh, Jeremy Chun Hao
author_sort Goh, Jeremy Chun Hao
title Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
title_short Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
title_full Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
title_fullStr Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
title_full_unstemmed Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
title_sort large language models for urban data analysis: exploration of various methods and llms for traffic data prediction
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181199
_version_ 1816859025284792320