Deep learning techniques for math word problems
This paper presents a comprehensive investigation into Curriculum Learning (CL) applied to Math Word Problem (MWP) solving, examining its efficacy across a spectrum of scales and difficulty levels. Our study encompasses extensive experiments conducted on both small-scale and large-scale language...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/178278 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | This paper presents a comprehensive investigation into Curriculum Learning (CL)
applied to Math Word Problem (MWP) solving, examining its efficacy across a spectrum
of scales and difficulty levels. Our study encompasses extensive experiments conducted
on both small-scale and large-scale language models across three distinct MWP datasets,
featuring problems of varying difficulty ranges. Through rigorous evaluation, we find
that curriculum learning yield better performance than traditional training in MWP
solving tasks. We also find the potential of anti-curriculum learning in solving hard
mathematical questions. Additionally, we offer an in-depth analysis of the mechanisms
and effects of curriculum learning and its variations in MWP solving |
---|