Instance-DAC

Dynamic Algorithm Configuration (DAC) addresses the challenge of dynamically setting hyperparameters of an algorithm for a diverse set of instances rather than focusing solely on individual tasks. Agents trained with Deep Reinforcement Learning (RL) offer a pathway to solve such settings. However, the limited generalization performance of these agents has significantly hindered the application in DAC. Our hypothesis is that a potential bias in the training instances limits generalization capabilities. We take a step towards mitigating this by selecting a representative subset of training instances to overcome overrepresentation and then retraining the agent on this subset to improve its generalization performance. For constructing the meta-features for the subset selection, we particularly account for the dynamic nature of the RL agent by computing time series features on trajectories of actions and rewards generated by the agent's interaction with the environment. Through empirical evaluations on the Sigmoid and CMA-ES benchmarks from the standard benchmark library for DAC, called DACBench, we discuss the potentials of our selection technique compared to training on the entire instance set. Our results highlight the efficacy of instance selection in refining DAC policies for diverse instance spaces.

Installation

git clone https://github.com/automl/Instance-DAC.git
cd Instance-DAC
conda create -n instance_dac python=3.11
conda activate instance_dac

# Install for usage
pip install .

# Install for development
make install-dev

pip install -r requirements.txt
git clone git@github.com:automl/DACBench.git
cd DACBench
git checkout instance_dac
pip install -e .[modcma]


# SELECTOR
git clone https://github.com/gjorgjinac/InstanceDACSelector.git lib/InstanceDACSelector
pip install -r lib/InstanceDACSelector/requirements.txt

Experiments

Check scripts/sigmoid_experiment.sh and scripts/cmaes_experiment.sh for commands/details how to run the experiments.

Calculating Meta-Features

In order to build the similarity graph of instances we need to calculate meta-features. In this case, we calculate time-series features on the rollout history, e.g., the reward trajectory per episode. For this, R requirements need to be installed (R version 4.0.2 (2020-06-22), Rcatch22_0.1.13 package is used). The code to calculate the features is here. The filepaths need to be adjusted accordingly.

Data

With the script instance_dac/collect_data.py you can gather all the log files and create a csv file. The table will be saved under data/runs/<env_name>/<training_set> and can contain performance data on different test sets.

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
.github/workflows		.github/workflows
DACBench @ 95b011b		DACBench @ 95b011b
analysis		analysis
instance_dac		instance_dac
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
delete.ipynb		delete.ipynb
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instance-DAC

Installation

Experiments

Calculating Meta-Features

Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

automl/instance-dac

Folders and files

Latest commit

History

Repository files navigation

Instance-DAC

Installation

Experiments

Calculating Meta-Features

Data

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages