ALI-agent: Assessing LLMS’ alignment with human values via agent-based evaluation

ALI-agent: Assessing LLMS’ alignment with human values via agent-based evaluation

Large Language Models (LLMs) can elicit unintended and even harmful content when misaligned with human values, posing severe risks to users and society. To mitigate these risks, current evaluation benchmarks predominantly employ expertdesigned contextual scenarios to assess how well LLMs align with...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHENG, Jingnan, WANG, Han, NGUYEN, Tai D., ZHANG, An, SUN, Jun, CHUA, Tat-Seng
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/9834 https://ink.library.smu.edu.sg/context/sis_research/article/10834/viewcontent/8621_ALI_Agent_Assessing_LLMs_.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

CoSec : On-the-Fly security hardening of code LLMs via supervised co-decoding
by: LI, Dong, et al.
Published: (2024)

Agents in Medical Informatics
by: SHANKARARAMAN, Venky, et al.
Published: (2000)

Explaining regressions via alignment slicing and mending
by: WANG, Haijun, et al.
Published: (2019)

Automated sourcing agent (I)
by: Nguyen Thi Kim Ngan
Published: (2020)

Software agent for economic sustainability assessment of recycling technology
by: Sheng, Yunzhou.
Published: (2011)

Issues in multi-agent systems : the AgentCities, ES experience
Published: (2017)

Software agent for production process based techno-economics modeling and assessment
by: Chen, Min.
Published: (2009)

Modeling an intelligent agent augmented gaming environment
by: Kumar, Rohit Ashwini.
Published: (2011)

Autonomous agents in snake game via deep reinforcement learning
by: WEI, Zhepei, et al.
Published: (2018)

A conversational agent to shift students’ affect state
by: Ong, Ethel Chua Joy, et al.
Published: (2016)

Infrastructure of agent-based application service providing model
by: Chong, Chee Seng
Published: (2008)

Exploring and evaluating attributes, values, and structures for entity alignment
by: LIU, Zhiyuan, et al.
Published: (2020)

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment
by: Zhiyuan Liu, et al.
Published: (2021)

An agent-based model to assess value in organizational knowledge reuse
by: Liu, H., et al.
Published: (2014)

Demystifying AI: bridging the explainability gap in LLMs
by: Chan, Darren Inn Siew
Published: (2024)

Goal-oriented modeling for intelligent agents and their applications
by: Shen, Zhiqi
Published: (2008)

Multi-agent communication
by: Swe, Phone Myat.
Published: (2009)

Analyzing multi-agent systems with probabilistic model checking approach
by: SONG, Songzheng, et al.
Published: (2012)

Agent designer
by: Li, Siyao
Published: (2015)

EVOGENTS: Evolving agents
by: Hermano, Michael, et al.
Published: (2009)

Agent-Oriented Information Systems IV
Published: (2017)

Agent-Oriented Software Engineering VIII
Published: (2017)

Argumentation in Multi-Agent Systems
Published: (2017)

Cooperative Information Agents XII
by: Jaime G. Carbonell, J\"org Siekmann, Matthias Klusch, Michal Pěchouček, Axel Polleres.
Published: (2017)

Multi-agent incentive mechanism testbed simulator
by: Ng, Kang Loon
Published: (2020)

Evaluation of Orca 2 against other LLMs for Retrieval Augmented Generation
by: HUANG, Donghao, et al.
Published: (2024)

Performance analysis of Llama 2 among other LLMs
by: HUANG, Donghao, et al.
Published: (2024)

Successor features based multi-agent RL for event-based decentralized MDPs
by: GUPTA, Tarun, et al.
Published: (2019)

Multi agent monitoring system
by: Maung, Myo Minn Minn.
Published: (2010)

Application of agent in electronic commerce
by: Loke, Paul Daniel Yu Ming.
Published: (2008)

Learning in multi-agent systems
by: Chen, Junhwa
Published: (2008)

Coordination in multi-agent systems
by: Chen, Gang
Published: (2008)

Mobile agents in distance education
by: Srikanthan, K.
Published: (2008)

Towards human-centered proactive conversational agents
by: DENG, Yang, et al.
Published: (2024)

Agent-based workflow management system
by: Liao, Yunhe.
Published: (2008)

LLMs-as-instructors : Learning from errors toward automating model improvement
by: YING, Jiahao, et al.
Published: (2024)

SOFTWARE AGENT TECHNOLOGY AND ITS APPLICATIONS
by: WEI MING
Published: (2019)

Learner agents as student modeling: Design and analysis
by: Limoanco, Teresita C., et al.
Published: (2003)

A dynamic matrix for the administration of agents
by: Gu, Tao.
Published: (2008)

Framework of adaptive Web Search Agent
by: Deng, Jiawei
Published: (2008)