Chain of preference optimization: Improving chain-of-thought reasoning in LLMs

Chain of preference optimization: Improving chain-of-thought reasoning in LLMs

The recent development of chain-of-thought (CoT) decoding has enabled large language models (LLMs) to generate explicit logical reasoning paths for complex problem-solving. However, research indicates that these paths are not always deliberate and optimal. The tree-of-thought (ToT) method employs tr...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHANG, Xuan, DU, Chao, PANG, Tianyu, LIU, Qian, GAO, Wei, LIN, Min
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/9881 https://ink.library.smu.edu.sg/context/sis_research/article/10881/viewcontent/2406.09136v2.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Cue-CoT: Chain-of-thought prompting for responding to in-depth dialogue questions with LLMs
by: WANG, Hongru, et al.
Published: (2023)

Plan-and-solve prompting: Improving zero-shot chain-of-thought reasoning by large language models
by: WANG, Lei, et al.
Published: (2023)

Multimodal misinformation detection by learning from synthetic data with multimodal LLMs
by: ZENG, Fengzhu, et al.
Published: (2024)

T-SciQ: Teaching multimodal Chain-of-Thought reasoning via large language model signals for science question answering
by: WANG, Lei, et al.
Published: (2024)

MITIGATING HALLUCINATION IN AUTOMATED INTERVIEWS USING A CHAIN-OF-VERIFICATION APPROACH ON LLMS
by: Bintang Nurmansyah, Ilham

Exploiting Reasoning Chains for Multi-hop Science Question Answering
by: XU, Weiwen, et al.
Published: (2021)

Performance analysis of Llama 2 among other LLMs
by: HUANG, Donghao, et al.
Published: (2024)

Chain-of-thought based iterative retrieval augmentation framework
by: Liu, Mengsha
Published: (2025)

Intriguing properties of data attribution on diffusion models
by: ZHENG, Xiaosen, et al.
Published: (2024)

LLMs-as-instructors : Learning from errors toward automating model improvement
by: YING, Jiahao, et al.
Published: (2024)

LLMs-based augmentation for domain adaptation in long-tailed food datasets
by: WANG, Qing, et al.
Published: (2025)

Evaluation of Orca 2 against other LLMs for Retrieval Augmented Generation
by: HUANG, Donghao, et al.
Published: (2024)

OPTIMIZING TOTAL VALUE CHAIN MARGIN IN SHRIMP COMMODITY THROUGH IMPROVEMENT OF SUPPLY CHAIN DESIGN
by: Wijaya, Ridwan

Explaining block chain
by: Lim, Donald Patrick L.
Published: (2023)

Artificial intelligence approach to analyzing the bullwhip effect in supply chains
by: QING, C., et al.
Published: (1999)

Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability
by: LV, Xin, et al.
Published: (2021)

Social preferences and supply chain performance: An experimental study
by: Loch, C.H., et al.
Published: (2013)

HARMONY IN COMPETITION: ON PREFERENCES FOR CONTRACTUAL FORMS IN SUPPLY CHAINS
by: LIJIAN LU, et al.
Published: (2018)

Demystifying AI: bridging the explainability gap in LLMs
by: Chan, Darren Inn Siew
Published: (2024)

Reverse multi-choice dialogue commonsense inference with graph-of-thought
by: ZHENG, Li, et al.
Published: (2024)

Robust supply chain networks design and ambiguous risk preferences
by: Guodong Yu, et al.
Published: (2018)

Evaluating effectiveness of LLMs in security analysis of embedded devices
by: Wong, Jovan Zi Xi
Published: (2025)

Cross-thought for sentence encoder pre-training
by: WANG, Shuohang, et al.
Published: (2020)

Evaluating vision-language models long-chain reasoning ability with multiple ground truths
by: Setiadharma, Christopher Arif
Published: (2024)

The burdens of ownership: Reasons for preferring renting
by: MOELLER, Sabine, et al.
Published: (2010)

Extracting link chains of relationship instances from a website
by: NAING, Myo-Myo, et al.
Published: (2006)

Chain-of-exemplar: Enhancing distractor generation for multimodal educational question generation
by: LUO, Haohao, et al.
Published: (2024)

Improving supply chain operations
by: Roqueza, Washington A., et al.
Published: (2001)

Optimization of petrochemical supply chain
by: Foo, Dan Yu Yao.
Published: (2010)

On the optimal chains or polycyclic groups
by: Petalcorin, Gaudencio C., Jr., et al.
Published: (2007)

On the optimal chains of a group
by: Petalcorin, Gaudencio C., Jr., et al.
Published: (2007)

Elevating supply chain resilience: The blockchain solution
by: Lim, Donald Patrick L.
Published: (2023)

THE IMPACT OF CARBON FOOTPRINT IN SUPPLY CHAIN OPTIMIZATION
by: MAK KOK WEI JOHN
Published: (2021)

Ideal flow of Markov Chain
by: Teknomo, Kardi
Published: (2018)

AUTOGEN and the Ethics of Co-Creation with Personalized LLMs?Reply to the Commentaries
by: Sebastian Porsdam Mann, et al.
Published: (2024)

InsAIghts: dynamic dashboard leveraging LLMs for data exploration
by: Loo, Nicky Tai Ler
Published: (2025)

CoSec : On-the-Fly security hardening of code LLMs via supervised co-decoding
by: LI, Dong, et al.
Published: (2024)

Exploring sustainable seafood supply chain management based on linguistic preferences: Collaboration in the supply chain and lean management drive economic benefits
by: Tseng, Ming Lang, et al.
Published: (2020)

On optimal preference diffusion over social networks
by: Long, Cheng, et al.
Published: (2021)

Feed for thought : factors predicting public support for funding and labelling preferences of alternative aquafeed
by: Tan, Wenqi, et al.
Published: (2021)