$Generalizing math word problem solvers via solution diversification$

Generalizing math word problem solvers via solution diversification

Current math word problem (MWP) solvers are usually Seq2Seq models trained by the (one-problem; one-solution) pairs, each of which is made of a problem description and a solution showing reasoning flow to get the correct answer. However, one MWP problem naturally has multiple solution equations. The...

Full description

Saved in:

Bibliographic Details
Main Authors:	LIANG, Zhenwen, ZHANG, Jipeng, WANG, Lei, WANG, Yan, SHAO, Jie, ZHANG, Xiangliang
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Question Answering multiple solution equations Argumentation problem description training framework Artificial Intelligence and Robotics Mathematics Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/8089 https://ink.library.smu.edu.sg/context/sis_research/article/9092/viewcontent/26548_pvoa.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9092
record_format	dspace
spelling	sg-smu-ink.sis_research-90922023-09-07T07:27:11Z Generalizing math word problem solvers via solution diversification LIANG, Zhenwen ZHANG, Jipeng WANG, Lei WANG, Yan SHAO, Jie ZHANG, Xiangliang Current math word problem (MWP) solvers are usually Seq2Seq models trained by the (one-problem; one-solution) pairs, each of which is made of a problem description and a solution showing reasoning flow to get the correct answer. However, one MWP problem naturally has multiple solution equations. The training of an MWP solver with (one-problem; one-solution) pairs excludes other correct solutions, and thus limits the generalizability of the MWP solver. One feasible solution to this limitation is to augment multiple solutions to a given problem. However, it is difficult to collect diverse and accurate augment solutions through human efforts. In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator. The buffer includes solutions generated by an MWP solver to encourage the training data diversity. The discriminator controls the quality of buffered solutions to participate in training. Our framework is flexibly applicable to a wide setting of fully, semi-weakly and weakly supervised training for all Seq2Seq MWP solvers. We conduct extensive experiments on a benchmark dataset Math23k and a new dataset named Weak12k, and show that our framework improves the performance of various MWP solvers under different settings by generating correct and diverse solutions. 2023-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8089 info:doi/10.1609/aaai.v37i11.26548 https://ink.library.smu.edu.sg/context/sis_research/article/9092/viewcontent/26548_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Question Answering multiple solution equations Argumentation problem description training framework Artificial Intelligence and Robotics Mathematics Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Question Answering multiple solution equations Argumentation problem description training framework Artificial Intelligence and Robotics Mathematics Theory and Algorithms
spellingShingle	Question Answering multiple solution equations Argumentation problem description training framework Artificial Intelligence and Robotics Mathematics Theory and Algorithms LIANG, Zhenwen ZHANG, Jipeng WANG, Lei WANG, Yan SHAO, Jie ZHANG, Xiangliang Generalizing math word problem solvers via solution diversification
description	Current math word problem (MWP) solvers are usually Seq2Seq models trained by the (one-problem; one-solution) pairs, each of which is made of a problem description and a solution showing reasoning flow to get the correct answer. However, one MWP problem naturally has multiple solution equations. The training of an MWP solver with (one-problem; one-solution) pairs excludes other correct solutions, and thus limits the generalizability of the MWP solver. One feasible solution to this limitation is to augment multiple solutions to a given problem. However, it is difficult to collect diverse and accurate augment solutions through human efforts. In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator. The buffer includes solutions generated by an MWP solver to encourage the training data diversity. The discriminator controls the quality of buffered solutions to participate in training. Our framework is flexibly applicable to a wide setting of fully, semi-weakly and weakly supervised training for all Seq2Seq MWP solvers. We conduct extensive experiments on a benchmark dataset Math23k and a new dataset named Weak12k, and show that our framework improves the performance of various MWP solvers under different settings by generating correct and diverse solutions.
format	text
author	LIANG, Zhenwen ZHANG, Jipeng WANG, Lei WANG, Yan SHAO, Jie ZHANG, Xiangliang
author_facet	LIANG, Zhenwen ZHANG, Jipeng WANG, Lei WANG, Yan SHAO, Jie ZHANG, Xiangliang
author_sort	LIANG, Zhenwen
title	Generalizing math word problem solvers via solution diversification
title_short	Generalizing math word problem solvers via solution diversification
title_full	Generalizing math word problem solvers via solution diversification
title_fullStr	Generalizing math word problem solvers via solution diversification
title_full_unstemmed	Generalizing math word problem solvers via solution diversification
title_sort	generalizing math word problem solvers via solution diversification
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/8089 https://ink.library.smu.edu.sg/context/sis_research/article/9092/viewcontent/26548_pvoa.pdf
_version_	1779157151110922240

Generalizing math word problem solvers via solution diversification

Similar Items