Improving environment novelty quantification for effective unsupervised environment design

Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student’s ability...

Full description

Saved in:
Bibliographic Details
Main Authors: TEOH, Jayden, LI, Wenjun, VARAKANTHAM, Pradeep
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9921
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10921
record_format dspace
spelling sg-smu-ink.sis_research-109212025-01-02T08:03:58Z Improving environment novelty quantification for effective unsupervised environment design TEOH, Jayden LI, Wenjun VARAKANTHAM, Pradeep Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student’s ability to handle unseen scenarios. Existing UED methods mainly rely on regret, a metric that measures the difference between the agent’s optimal and actual performance, to guide curriculum design. Regret-driven methods generate curricula that progressively increase environment complexity for the student but overlook environment novelty–a critical element for enhancing an agent’s generalizability. Measuring environment novelty is especially challenging due to the underspecified nature of environment parameters in UED, and existing approaches face significant limitations. To address this, this paper introduces the Coverage-based Evaluation of Novelty In Environment (CENIE) framework. CENIE proposes a scalable, domainagnostic, and curriculum-aware approach to quantifying environment novelty by leveraging the student’s state-action space coverage from previous curriculum experiences. We then propose an implementation of CENIE that models this coverage and measures environment novelty using Gaussian Mixture Models. By integrating both regret and novelty as complementary objectives for curriculum design, CENIE facilitates effective exploration across the state-action space while progressively increasing curriculum complexity. Empirical evaluations demonstrate that augmenting existing regret-based UED algorithms with CENIE achieves stateof-the-art performance across multiple benchmarks, underscoring the effectiveness of novelty-driven autocurricula for robust generalization. 2024-12-20T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/9921 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Unsupervised environment design Regret-driven metric Environment novelty Databases and Information Systems Educational Assessment, Evaluation, and Research
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Unsupervised environment design
Regret-driven metric
Environment novelty
Databases and Information Systems
Educational Assessment, Evaluation, and Research
spellingShingle Unsupervised environment design
Regret-driven metric
Environment novelty
Databases and Information Systems
Educational Assessment, Evaluation, and Research
TEOH, Jayden
LI, Wenjun
VARAKANTHAM, Pradeep
Improving environment novelty quantification for effective unsupervised environment design
description Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent. The teacher generates new training environments with high learning potential, curating an adaptive curriculum that strengthens the student’s ability to handle unseen scenarios. Existing UED methods mainly rely on regret, a metric that measures the difference between the agent’s optimal and actual performance, to guide curriculum design. Regret-driven methods generate curricula that progressively increase environment complexity for the student but overlook environment novelty–a critical element for enhancing an agent’s generalizability. Measuring environment novelty is especially challenging due to the underspecified nature of environment parameters in UED, and existing approaches face significant limitations. To address this, this paper introduces the Coverage-based Evaluation of Novelty In Environment (CENIE) framework. CENIE proposes a scalable, domainagnostic, and curriculum-aware approach to quantifying environment novelty by leveraging the student’s state-action space coverage from previous curriculum experiences. We then propose an implementation of CENIE that models this coverage and measures environment novelty using Gaussian Mixture Models. By integrating both regret and novelty as complementary objectives for curriculum design, CENIE facilitates effective exploration across the state-action space while progressively increasing curriculum complexity. Empirical evaluations demonstrate that augmenting existing regret-based UED algorithms with CENIE achieves stateof-the-art performance across multiple benchmarks, underscoring the effectiveness of novelty-driven autocurricula for robust generalization.
format text
author TEOH, Jayden
LI, Wenjun
VARAKANTHAM, Pradeep
author_facet TEOH, Jayden
LI, Wenjun
VARAKANTHAM, Pradeep
author_sort TEOH, Jayden
title Improving environment novelty quantification for effective unsupervised environment design
title_short Improving environment novelty quantification for effective unsupervised environment design
title_full Improving environment novelty quantification for effective unsupervised environment design
title_fullStr Improving environment novelty quantification for effective unsupervised environment design
title_full_unstemmed Improving environment novelty quantification for effective unsupervised environment design
title_sort improving environment novelty quantification for effective unsupervised environment design
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9921
_version_ 1821237286074843136