Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction
This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies f...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181199 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181199 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1811992024-11-18T02:24:37Z Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction Goh, Jeremy Chun Hao Long Cheng College of Computing and Data Science c.long@ntu.edu.sg Computer and Information Science Engineering LLMs Urban data analysis This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies for traffic data prediction qualitatively and quantitatively, and comparing its performance against traditional data analysis methods. Tailored prompts were designed to facilitate the experiment and leveraged the capabilities of Poe.com, a platform which allowed users to create their own agent with customized knowledge base. The study revealed two main techniques for leveraging LLMs in urban data analytics: Standard Prompting and a One-time Setup method by creating a personalised assistant. While standard prompting techniques requires a new prompt for each analysis, the technique of developing a prompt as a personalised assistant eliminates the need for repeated prompt crafting, saving time and bridging the gap for prompt engineering knowledge in users. Key findings for qualitative data prediction have shown that ChatGPT-4o excels in data interpretation while Claude 3.5 – Sonnet can provide actionable insights and realistic forecasts. Llama, however, has faced challenges with achieving a moderate level of accuracy in data interpretation. Additional findings during evaluation of accuracy in quantitative predictions also revealed significant limitations when forecasting data such as total number of cars, which changes under highly volatile environments due to changing policies. However, data such as public transport ridership reflected a more stable and predictable pattern for public transport usage. Overall, this project demonstrates that LLMs can significantly enhance urban data analytic workflows, gaining quicker insights into traffic patterns compared to traditional methods, although some limitations remain, such as the need for domain expertise in interpreting results. Bachelor's degree 2024-11-18T02:24:37Z 2024-11-18T02:24:37Z 2024 Final Year Project (FYP) Goh, J. C. H. (2024). Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181199 https://hdl.handle.net/10356/181199 en SCSE23-0952 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Engineering LLMs Urban data analysis |
spellingShingle |
Computer and Information Science Engineering LLMs Urban data analysis Goh, Jeremy Chun Hao Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
description |
This project investigates the application of Large Language Models (LLMs) for urban data analysis, with a focus on traffic data. The primary objective of this project includes exploring capabilities of various LLMs such as ChatGPT, Claude, and Llama in urban data analysis, developing methodologies for traffic data prediction qualitatively and quantitatively, and comparing its performance against traditional data analysis methods. Tailored prompts were designed to facilitate the experiment and leveraged the capabilities of Poe.com, a platform which allowed users to create their own agent with customized knowledge base.
The study revealed two main techniques for leveraging LLMs in urban data analytics: Standard Prompting and a One-time Setup method by creating a personalised assistant. While standard prompting techniques requires a new prompt for each analysis, the technique of developing a prompt as a personalised assistant eliminates the need for repeated prompt crafting, saving time and bridging the gap for prompt engineering knowledge in users.
Key findings for qualitative data prediction have shown that ChatGPT-4o excels in data interpretation while Claude 3.5 – Sonnet can provide actionable insights and realistic forecasts. Llama, however, has faced challenges with achieving a moderate level of accuracy in data interpretation. Additional findings during evaluation of accuracy in quantitative predictions also revealed significant limitations when forecasting data such as total number of cars, which changes under highly volatile environments due to changing policies. However, data such as public transport ridership reflected a more stable and predictable pattern for public transport usage.
Overall, this project demonstrates that LLMs can significantly enhance urban data analytic workflows, gaining quicker insights into traffic patterns compared to traditional methods, although some limitations remain, such as the need for domain expertise in interpreting results. |
author2 |
Long Cheng |
author_facet |
Long Cheng Goh, Jeremy Chun Hao |
format |
Final Year Project |
author |
Goh, Jeremy Chun Hao |
author_sort |
Goh, Jeremy Chun Hao |
title |
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
title_short |
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
title_full |
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
title_fullStr |
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
title_full_unstemmed |
Large language models for urban data analysis: exploration of various methods and LLMs for traffic data prediction |
title_sort |
large language models for urban data analysis: exploration of various methods and llms for traffic data prediction |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181199 |
_version_ |
1816859025284792320 |