Sphere Function with Various Algorithms¶

This example shows how to run various QD algorithms in pyribs with the sphere linear projection benchmark that originated in Fontaine 2020. Specifically, we consider a version of the benchmark with 100-dimensional solutions and a 2-dimensional measure space. Below, we show the mean and standard deviation over 20 trials when running each algorithm on this benchmark for 10,000 iterations.

Algorithm	QD Score	Coverage
map_elites	416,111.64 ± 2,681.37	50.72 ± 0.40%
line_map_elites	491,013.92 ± 1,118.74	60.44 ± 0.17%
cvt_map_elites	416,350.11 ± 3,942.42	50.67 ± 0.49%
line_cvt_map_elites	490,335.42 ± 4,818.54	60.39 ± 0.58%
me_map_elites	533,477.98 ± 15,041.68	65.28 ± 2.05%
cma_me_imp	456,678.01 ± 2,882.10	55.84 ± 0.41%
cma_me_imp_mu	498,464.35 ± 2,214.86	61.11 ± 0.34%
cma_me_basic	398,192.00 ± 10,174.36	47.04 ± 1.30%
cma_me_rd	458,403.60 ± 2,739.42	56.21 ± 0.42%
cma_me_rd_mu	512,692.56 ± 4,070.15	63.78 ± 0.61%
cma_me_opt	57,517.77 ± 3,042.75	5.97 ± 0.33%
cma_me_mixed	458,827.27 ± 2,359.05	56.19 ± 0.35%
og_map_elites	387,800.00 ± 2,760.75	46.83 ± 0.41%
omg_mega	753,680.67 ± 3.68	100.00 ± 0.00%
cma_mega	753,834.86 ± 2.15	100.00 ± 0.00%
cma_mega_adam	753,880.12 ± 1.92	100.00 ± 0.00%
cma_mae	633,368.55 ± 1,504.43	81.02 ± 0.27%
cma_maega	753,834.72 ± 4.45	100.00 ± 0.00%
ns_cma	149,414.31 ± 10,034.15	18.51 ± 1.26%
nslc	510,085.20 ± 2,693.40	63.11 ± 0.39%
nslc_cma_imp	539,365.47 ± 2,916.88	68.20 ± 0.46%
dds_kde	344,447.87 ± 40,153.65	72.72 ± 3.21%
dds_kde_sklearn	325,402.21 ± 36,203.61	75.88 ± 3.15%
dds_cnf¹	325,247.96 ± ??	68.48 ± ??%
dms	700,776.58 ± 8,525.05	95.99 ± 1.43%

¹Due to computational cost, dds_cnf was only run for 1 trial (it takes ~19 hours). See #707 for more info.

sphere.py¶

This is the primary file, showing how to set up the sphere benchmark and run QD algorithms on it.

"""Runs various QD algorithms on the sphere linear projection benchmark.

Install the following dependencies before running this example:
    pip install ribs[visualize] torch "zuko>=1.0.0" tqdm fire loguru tabulate

The sphere function in this example is adapted from Section 4 of Fontaine 2020
(https://arxiv.org/abs/1912.02400). Namely, each solution value is clipped to the range
[-5.12, 5.12], and the optimum is moved from [0,..] to [0.4 * 5.12 = 2.048,..].
Furthermore, the objectives are normalized to the range [0, 100] where 100 is the
maximum and corresponds to 0 on the original sphere function.

There are two measures in this example. The first is the sum of the first n/2 clipped
values of the solution, and the second is the sum of the last n/2 clipped values of the
solution. Having each measure depend equally on several values in the solution space
makes the problem more difficult (refer to Fontaine 2020 for more info).

We support a number of algorithms in this script. The parameters for each algorithm are
stored in CONFIG. The parameters roughly reproduce the results from the CMA-MAE paper
(Fontaine 2023, https://arxiv.org/abs/2205.10752), i.e., they use the following
settings:
- Archives have 10,000 cells, either as a 100x100 grid archive or a 10,000-cell CVT
  archive.
- Each algorithm generates 540 solutions every iteration, typically as one emitter
  generating 540 solutions or 15 emitters generating 36 solutions each.
- We default to run each algorithm for 10,000 iterations.
- We default to run on the 100-dimensional version of the sphere problem.
Below we list the algorithms available.

MAP-Elites and MAP-Elites (line):
- `map_elites`: GridArchive with GaussianEmitter.
- `line_map_elites`: GridArchive with IsoLineEmitter.
- `cvt_map_elites`: CVTArchive with GaussianEmitter.
- `line_cvt_map_elites`: CVTArchive with IsoLineEmitter.

Multi-Emitter MAP-Elites:
- `me_map_elites`: MAP-Elites with Bandit Scheduler.

CMA-ME:
- `cma_me_imp`: GridArchive with EvolutionStrategyEmitter using
  TwoStageImprovmentRanker; this is the suggested version of CMA-ME.
- `cma_me_imp_mu`: GridArchive with EvolutionStrategyEmitter using
  TwoStageImprovmentRanker and mu selection rule.
- `cma_me_basic`: GridArchive with EvolutionStrategyEmitter using
  TwoStageImprovmentRanker, mu selection rule, and basic restart rule. This is the
  version of CMA-ME that was used as a baseline in Fontaine 2023.
- `cma_me_rd`: GridArchive with EvolutionStrategyEmitter using RandomDirectionRanker.
- `cma_me_rd_mu`: GridArchive with EvolutionStrategyEmitter using
  TwoStageRandomDirectionRanker and mu selection rule.
- `cma_me_opt`: GridArchive with EvolutionStrategyEmitter using ObjectiveRanker with mu
  selection rule.
- `cma_me_mixed`: GridArchive with EvolutionStrategyEmitter, where half (7) of the
  emitters use TwoStageRandomDirectionRanker and half (8) use TwoStageImprovementRanker.

DQD algorithms:
- `og_map_elites`: GridArchive with GradientOperatorEmitter; does not use measure
  gradients.
- `omg_mega`: GridArchive with GradientOperatorEmitter; uses measure gradients.
- `cma_mega`: GridArchive with GradientArborescenceEmitter.
- `cma_mega_adam`: GridArchive with GradientArborescenceEmitter using Adam Optimizer.

CMA-MAE and CMA-MAEGA:
- `cma_mae`: GridArchive (learning_rate = 0.01) with EvolutionStrategyEmitter using
  ImprovementRanker.
- `cma_maega`: GridArchive (learning_rate = 0.01) with GradientArborescenceEmitter using
  ImprovementRanker.

Novelty Search:
- `ns_cma`: Novelty Search with CMA-ES; implemented using a ProximityArchive with
  EvolutionStrategyEmitter. Results are stored in a passive GridArchive. Note that the
  objective will not be optimized in this case.
- `nslc`: Novelty Search with Local Competition (NSLC); uses a ProximityArchive with
  EvolutionStrategyEmitter and NSLCRanker to rank solutions by novelty and local
  competition.
- `nslc_cma_imp`: EvolutionStrategyEmitter with a ProximityArchive with local
  competition turned on. Thus, the archive returns two-stage improvement information
  that is fed to the EvolutionStrategyEmitter just like in CMA-ME.

DDS:
- `dds_kde`: Density Descent Search (Lee 2024; https://arxiv.org/abs/2312.11331) with a
  KDE as the density estimator. Uses DensityArchive and EvolutionStrategyEmitter with
  DensityRanker.
- `dds_kde_sklearn`: Density Descent Search using scikit-learn's KernelDensity as the
  density estimator.
- `dds_cnf`: Density Descent Search using a Continuous Normalizing Flow (CNF) as the
  density estimator.

DMS:
- `dms`: Discount Model Search (Tjanaka 2026, https://discount-models.github.io/), with
  the MLP discount model proposed in that paper. Note that the results presented in
  Tjanaka 2026 were with a version of the sphere domain that normalized the objectives
  to [0, 1], whereas this script uses objectives in [0, 100]. To convert results,
  multiply the QD Score from that paper by 100.

By default, outputs are saved in a directory called
`logs/sphere/{algorithm}_{dim}/YYYY-MM-DD_HH-MM-SS_seed-{seed}`, where
YYYY-MM-DD_HH-MM-SS is a timestamp. The directory contains the following outputs:
- The archive is saved as a CSV named `archive.csv`
- Snapshots of the heatmap are saved as `heatmap_{iteration}.png`.
- Metrics from the run are saved in `metrics.json`
- Plots of the metrics are saved in PNG's with the name `{metric_name}.png`.
- The log messages from the run are saved in `out.log`.

To generate a video of the heatmap from the heatmap images, use a tool like ffmpeg. For
example, the following will generate a 6 FPS (Frames Per Second) video showing the
heatmap for an example run of cma_me_imp with 100 dims.

    ffmpeg -r 6 -i "logs/sphere/cma_me_imp_100/2026-04-21_04-51-31_seed-None/heatmap_%*.png" \
        logs/sphere/cma_me_imp_100/2026-04-21_04-51-31_seed-None/heatmap_video.mp4

Usage (see sphere_main function for all args or run `python sphere.py --help`):
    python sphere.py ALGORITHM

Example:
    python sphere.py map_elites

    # To make numpy and sklearn run single-threaded, set env variables for BLAS
    # and OpenMP:
    OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 python sphere.py map_elites 100

Help:
    python sphere.py --help
"""

from __future__ import annotations

import copy
import json
import time
from datetime import datetime
from pathlib import Path

import fire
import matplotlib.pyplot as plt
import numpy as np
import torch
import tqdm
from loguru import logger as log

from ribs.archives import (
    ArchiveBase,
    CVTArchive,
    DensityArchive,
    DiscountArchive,
    GridArchive,
    ProximityArchive,
)
from ribs.discount_models import MLP, DiscountModelManager
from ribs.emitters import (
    EvolutionStrategyEmitter,
    GaussianEmitter,
    GradientArborescenceEmitter,
    GradientOperatorEmitter,
    IsoLineEmitter,
)
from ribs.schedulers import BanditScheduler, Scheduler
from ribs.visualize import cvt_archive_heatmap, grid_archive_heatmap

CONFIG = {
    ## MAP-Elites and MAP-Elites (line) ##
    "map_elites": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GaussianEmitter,
                "kwargs": {
                    "sigma": 0.5,
                    "batch_size": 540,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "line_map_elites": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": IsoLineEmitter,
                "kwargs": {
                    "iso_sigma": 0.5,
                    "line_sigma": 0.2,
                    "batch_size": 540,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cvt_map_elites": {
        "is_dqd": False,
        "archive": {
            "class": CVTArchive,
            "kwargs": {
                "centroids": 10000,
                "nearest_neighbors": "scipy_kd_tree",
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GaussianEmitter,
                "kwargs": {
                    "sigma": 0.5,
                    "batch_size": 540,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "line_cvt_map_elites": {
        "is_dqd": False,
        "archive": {
            "class": CVTArchive,
            "kwargs": {
                "centroids": 10000,
                "nearest_neighbors": "scipy_kd_tree",
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": IsoLineEmitter,
                "kwargs": {
                    "iso_sigma": 0.5,
                    "line_sigma": 0.2,
                    "batch_size": 540,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## Multi-Emitter MAP-Elites (ME-MAP-Elites) ##
    "me_map_elites": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "obj",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            },
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2rd",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            },
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            },
            {
                "class": IsoLineEmitter,
                "kwargs": {
                    "iso_sigma": 0.01,
                    "line_sigma": 0.1,
                    "batch_size": 36,
                },
                "num_emitters": 15,
            },
        ],
        "scheduler": {
            "class": BanditScheduler,
            "kwargs": {
                "num_active": 15,
                "reselect": "terminated",
            },
        },
    },
    ## CMA-ME ##
    "cma_me_imp": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "selection_rule": "filter",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_imp_mu": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "selection_rule": "mu",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_basic": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_rd": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2rd",
                    "selection_rule": "filter",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_rd_mu": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2rd",
                    "selection_rule": "mu",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_opt": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "obj",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_me_mixed": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2rd",
                    "batch_size": 36,
                },
                "num_emitters": 7,
            },
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "batch_size": 36,
                },
                "num_emitters": 8,
            },
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## DQD algorithms ##
    "og_map_elites": {
        "is_dqd": True,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GradientOperatorEmitter,
                "kwargs": {
                    "sigma": 0.5,
                    "sigma_g": 0.5,
                    "measure_gradients": False,
                    "normalize_grad": False,
                    # Divide by 2 since half of the solutions are used in ask_dqd(),
                    # and the other half are used in ask().
                    "batch_size": 540 // 2,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "omg_mega": {
        "is_dqd": True,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GradientOperatorEmitter,
                "kwargs": {
                    "sigma": 0.0,
                    "sigma_g": 10.0,
                    "measure_gradients": True,
                    "normalize_grad": True,
                    # Divide by 2 since half of the solutions are used in ask_dqd(),
                    # and the other half are used in ask().
                    "batch_size": 540 // 2,
                },
                "num_emitters": 1,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_mega": {
        "is_dqd": True,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GradientArborescenceEmitter,
                "kwargs": {
                    "sigma0": 10.0,
                    "lr": 1.0,
                    "grad_opt": "gradient_ascent",
                    "selection_rule": "mu",
                    # Subtract 1 since one solution is used in ask_dqd() and the
                    # rest are used in ask().
                    "batch_size": 36 - 1,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_mega_adam": {
        "is_dqd": True,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "result_archive": None,
        "emitters": [
            {
                "class": GradientArborescenceEmitter,
                "kwargs": {
                    "sigma0": 10.0,
                    "lr": 0.002,
                    "grad_opt": "adam",
                    "selection_rule": "mu",
                    # Subtract 1 since one solution is used in ask_dqd() and the
                    # rest are used in ask().
                    "batch_size": 36 - 1,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## CMA-MAE and CMA-MAEGA ##
    "cma_mae": {
        "is_dqd": False,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
                "threshold_min": 0,
                "learning_rate": 0.01,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "imp",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "cma_maega": {
        "is_dqd": True,
        "archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
                "threshold_min": 0,
                "learning_rate": 0.01,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": GradientArborescenceEmitter,
                "kwargs": {
                    "sigma0": 10.0,
                    "lr": 1.0,
                    "ranker": "imp",
                    "grad_opt": "gradient_ascent",
                    "restart_rule": "basic",
                    # Subtract 1 since one solution is used in ask_dqd() and the
                    # rest are used in ask().
                    "batch_size": 36 - 1,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## Novelty Search ##
    "ns_cma": {
        # Hyperparameters from DDS paper: https://arxiv.org/abs/2312.11331
        "is_dqd": False,
        "archive": {
            "class": ProximityArchive,
            "kwargs": {
                "k_neighbors": 15,
                "novelty_threshold": 0.037 * 512,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "nov",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "nslc": {
        "is_dqd": False,
        "archive": {
            "class": ProximityArchive,
            "kwargs": {
                "k_neighbors": 15,
                # Note: This is untuned.
                "novelty_threshold": 0.037 * 100,
                "local_competition": True,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "nslc",
                    "selection_rule": "filter",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "nslc_cma_imp": {
        "is_dqd": False,
        "archive": {
            "class": ProximityArchive,
            "kwargs": {
                "k_neighbors": 15,
                # Note: This is untuned.
                "novelty_threshold": 0.037 * 100,
                "local_competition": True,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "2imp",
                    "selection_rule": "filter",
                    "restart_rule": "no_improvement",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## DDS ##
    "dds_kde": {
        # Hyperparameters from DDS paper: https://arxiv.org/abs/2312.11331
        "is_dqd": False,
        # In DDS, the DensityArchive does not store any solutions, so emitters
        # must use the result archive instead.
        "pass_result_archive_to_emitters": True,
        "archive": {
            "class": DensityArchive,
            "kwargs": {
                "buffer_size": 10000,
                "density_method": "kde",
                "bandwidth": 25.6,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 1.5,
                    "ranker": "density",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "dds_kde_sklearn": {
        # Hyperparameters from DDS paper: https://arxiv.org/abs/2312.11331
        "is_dqd": False,
        # In DDS, the DensityArchive does not store any solutions, so emitters
        # must use the result archive instead.
        "pass_result_archive_to_emitters": True,
        "archive": {
            "class": DensityArchive,
            "kwargs": {
                # `density_method` and `sklearn_kwargs` are the only differences
                # from the `dds` config above. `kde_sklearn` tends to be slower
                # but it has more options available.
                "buffer_size": 10000,
                "density_method": "kde_sklearn",
                "bandwidth": 25.6,
                "sklearn_kwargs": {
                    "kernel": "gaussian",
                },
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 1.5,
                    "ranker": "density",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    "dds_cnf": {
        # Hyperparameters from DDS paper: https://arxiv.org/abs/2312.11331
        "is_dqd": False,
        # In DDS, the DensityArchive does not store any solutions, so emitters
        # must use the result archive instead.
        "pass_result_archive_to_emitters": True,
        "archive": {
            "class": DensityArchive,
            "kwargs": {
                "buffer_size": 10000,
                "density_method": "cnf",
                "cnf_train_steps": 5,
                "cnf_batch_size": 32,
                "cnf_kwargs": {
                    "hidden_features": (64, 64),
                },
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 1.5,
                    "ranker": "density",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
    ## DMS ##
    "dms": {
        # Hyperparameters from DMS paper: https://discount-models.github.io/
        "is_dqd": False,
        # In DMS, the DiscountArchive does not store any solutions, so emitters
        # must use the result archive instead.
        "pass_result_archive_to_emitters": True,
        "model": {
            "class": MLP,
            "kwargs": {
                # The `None` is filled in with measure_dim in `create_scheduler`.
                "layer_specs": [[None, 128], [128, 128], [128, 1]],
                "activation": torch.nn.ReLU,
            },
        },
        "optimizer": {
            "class": torch.optim.Adam,
            "kwargs": {
                "lr": 0.001,
                "betas": [0.9, 0.999],
            },
        },
        "discount_model_manager": {
            "class": DiscountModelManager,
            "kwargs": {
                "train_epochs": 5,
                "train_cutoff_loss": 0.05,
                "train_batch_size": 32,
                "normalize_measures": "negative_one_one",
                # Normalizing the discounts speeds up the algorithm because the discount
                # model requires less training to match the targets, since they are
                # between 0-1 instead of 0-100 -- neural networks generally work better
                # with inputs around -1 to 1.
                "normalize_discount": "zero_one",
            },
        },
        "archive": {
            "class": DiscountArchive,
            "kwargs": {
                "learning_rate": 0.1,
                "threshold_min": 0,
                "init_train_points": 1000,
                "empty_points": 100,
            },
        },
        "result_archive": {
            "class": GridArchive,
            "kwargs": {
                "dims": (100, 100),
            },
        },
        "emitters": [
            {
                "class": EvolutionStrategyEmitter,
                "kwargs": {
                    "sigma0": 0.5,
                    "ranker": "imp",
                    "selection_rule": "mu",
                    "restart_rule": "basic",
                    "batch_size": 36,
                },
                "num_emitters": 15,
            }
        ],
        "scheduler": {
            "class": Scheduler,
            "kwargs": {},
        },
    },
}


def sphere(
    solutions: np.ndarray,
) -> tuple[np.ndarray, np.ndarray, np.ndarray, np.ndarray]:
    """Sphere function evaluation and measures for a batch of solutions.

    Args:
        solutions: (batch_size, dim) batch of solutions.

    Returns:
        objectives: (batch_size,) batch of objectives.
        objective_grads: (batch_size, solution_dim) batch of objective gradients.
        measures: (batch_size, 2) batch of measures.
        measure_grads: (batch_size, 2, solution_dim) batch of measure gradients.
    """
    dim = solutions.shape[1]

    # Shift the Sphere function so that the optimal value is at x_i = 2.048.
    sphere_shift = 5.12 * 0.4

    # Normalize the objective to the range [0, 100] where 100 is optimal.
    best_obj = 0.0
    worst_obj = (-5.12 - sphere_shift) ** 2 * dim
    raw_obj = np.sum(np.square(solutions - sphere_shift), axis=1)
    objectives = (raw_obj - worst_obj) / (best_obj - worst_obj) * 100

    # Compute gradient of the objective.
    objective_grads = -2 * (solutions - sphere_shift)

    # Calculate measures.
    clipped = solutions.copy()
    clip_mask = (clipped < -5.12) | (clipped > 5.12)
    clipped[clip_mask] = 5.12 / clipped[clip_mask]
    measures = np.concatenate(
        (
            np.sum(clipped[:, : dim // 2], axis=1, keepdims=True),
            np.sum(clipped[:, dim // 2 :], axis=1, keepdims=True),
        ),
        axis=1,
    )

    # Compute gradient of the measures.
    derivatives = np.ones(solutions.shape)
    derivatives[clip_mask] = -5.12 / np.square(solutions[clip_mask])

    mask_0 = np.concatenate((np.ones(dim // 2), np.zeros(dim - dim // 2)))
    mask_1 = np.concatenate((np.zeros(dim // 2), np.ones(dim - dim // 2)))

    d_measure0 = derivatives * mask_0
    d_measure1 = derivatives * mask_1

    measure_grads = np.stack((d_measure0, d_measure1), axis=1)

    return (
        objectives,
        objective_grads,
        measures,
        measure_grads,
    )


def create_scheduler(
    config: dict, algorithm: str, seed: int | None = None
) -> Scheduler:
    """Creates a scheduler based on the algorithm.

    Args:
        config: Configuration dictionary with parameters for the various components.
        algorithm: Name of the algorithm.
        seed: Main seed for the various components.

    Returns:
        ribs.schedulers.Scheduler: A ribs scheduler for running the algorithm.
    """
    # Properties of the Sphere problem.
    solution_dim = config["dim"]
    max_bound = solution_dim / 2 * 5.12
    bounds = [(-max_bound, max_bound), (-max_bound, max_bound)]
    obj_low = 0.0  # There is actually no lower bound, but this is a good value.
    obj_high = 100.0
    initial_sol = np.zeros(solution_dim)

    # Create result archive.
    if config["result_archive"] is None:
        result_archive = None
    else:
        result_archive = config["result_archive"]["class"](
            solution_dim=solution_dim,
            # Note that using ranges here means we assume the result archive is a
            # GridArchive or CVTArchive. This will need to be modified for other result
            # archives.
            ranges=bounds,
            seed=seed,
            **config["result_archive"]["kwargs"],
        )

    # Create archive.
    archive_class = config["archive"]["class"]
    if archive_class == ProximityArchive:
        # Takes `measure_dim` instead of `ranges`.
        archive = archive_class(
            solution_dim=solution_dim,
            measure_dim=len(bounds),
            seed=seed,
            **config["archive"]["kwargs"],
        )
    elif archive_class == DensityArchive:
        device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

        archive = archive_class(
            measure_dim=len(bounds),
            seed=seed,
            # The device is simply ignored if we are running dds_kde.
            cnf_device=device,
            **config["archive"]["kwargs"],
        )
    elif archive_class == DiscountArchive:
        device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

        # Fill in measure_dim in the config.
        config["model"]["kwargs"]["layer_specs"][0][0] = len(bounds)

        model = config["model"]["class"](**config["model"]["kwargs"])
        model.to(device)

        optimizer = config["optimizer"]["class"](
            params=model.parameters(),
            **config["optimizer"]["kwargs"],
        )
        discount_model_manager = config["discount_model_manager"]["class"](
            model=model,
            optimizer=optimizer,
            device=device,
            measures_low=[b[0] for b in bounds],
            measures_high=[b[1] for b in bounds],
            discount_low=obj_low,
            discount_high=obj_high,
            **config["discount_model_manager"]["kwargs"],
        )
        archive = archive_class(
            solution_dim=solution_dim,
            measure_dim=len(bounds),
            discount_model_manager=discount_model_manager,
            result_archive=result_archive,
            seed=seed,
            **config["archive"]["kwargs"],
        )
    else:
        archive = archive_class(
            solution_dim=solution_dim,
            ranges=bounds,
            seed=seed,
            **config["archive"]["kwargs"],
        )

    # Usually, emitters take in the archive. However, it may sometimes be necessary to
    # take in the result_archive, such as in DDS.
    archive_for_emitter = (
        result_archive if config.get("pass_result_archive_to_emitters") else archive
    )

    # Create emitters. Each emitter needs a different seed so that they do not all do
    # the same thing, hence we create an rng here to generate seeds. The rng may be
    # seeded with None or with a user-provided seed.
    seed_sequence = np.random.SeedSequence(seed)
    emitters = []
    for e in config["emitters"]:
        emitter_class = e["class"]
        emitters += [
            emitter_class(
                archive_for_emitter,
                x0=initial_sol,
                **e["kwargs"],
                seed=s,
            )
            for s in seed_sequence.spawn(e["num_emitters"])
        ]

    # Create Scheduler
    scheduler = config["scheduler"]["class"](
        archive,
        emitters,
        result_archive,
        **config["scheduler"]["kwargs"],
    )

    log.info(
        "Create {} for {} using solution dim {} and {} emitters.",
        scheduler.__class__.__name__,
        algorithm,
        solution_dim,
        len(emitters),
    )
    return scheduler


def save_heatmap(archive: ArchiveBase, heatmap_path: str | Path) -> None:
    """Saves a heatmap of the archive to the given path.

    Args:
        archive: The archive to save.
        heatmap_path: Image path for the heatmap.
    """
    if isinstance(archive, GridArchive):
        plt.figure(figsize=(8, 6))
        grid_archive_heatmap(archive, vmin=0, vmax=100)
        plt.tight_layout()
        plt.savefig(heatmap_path)
    elif isinstance(archive, CVTArchive):
        plt.figure(figsize=(16, 12))
        cvt_archive_heatmap(archive, vmin=0, vmax=100)
        plt.tight_layout()
        plt.savefig(heatmap_path)
    else:
        raise NotImplementedError(
            "This script currently does not plot heatmaps for this archive."
        )
    plt.close(plt.gcf())


def sphere_main(
    algorithm: str,
    *,
    dim: int = 100,
    itrs: int = 10000,
    grid_dims: tuple[int, int] | None = None,
    learning_rate: float | None = None,
    es: str | None = None,
    outdir: str | None = None,
    log_freq: int = 250,
    seed: int | None = None,
    verbose: bool = True,
    save_archive: bool = True,
) -> dict[str, float]:
    """Demo on the Sphere function.

    Args:
        algorithm: Name of the algorithm.
        dim: Dimensionality of the sphere function.
        itrs: Iterations to run.
        grid_dims: Grid dimensions for GridArchive.
        learning_rate: The archive learning rate.
        es: If passed, this will set the ES for all EvolutionStrategyEmitter instances.
        outdir: Directory to save output. If not provided, it will be automatically set
            to `logs/sphere/{algorithm}_{dim}/YYYY-MM-DD_HH-MM-SS_seed-{seed}`.
        log_freq: Number of iterations to wait before recording metrics and saving
            heatmap.
        seed: Seed for the algorithm. By default, there is no seed.
        verbose: Whether to write outputs to the terminal, such as the progress bar and
            log messages. Otherwise, log messages are just written to a file.
        save_archive: Whether to save the archive as a CSV file. This can require a lot
            of time and is often not very useful.
    """
    config = copy.deepcopy(CONFIG[algorithm])

    # Add params that are not in the config.
    config["dim"] = dim
    config["itrs"] = itrs

    # Add params that are in the config by default but may be passed in.
    if grid_dims is not None:
        if config["archive"]["class"] == GridArchive:
            config["archive"]["kwargs"]["dims"] = grid_dims
        if config["result_archive"]["class"] == GridArchive:
            config["result_archive"]["kwargs"]["dims"] = grid_dims
    if learning_rate is not None:
        config["archive"]["kwargs"]["learning_rate"] = learning_rate
    if es is not None:
        # Set ES for all EvolutionStrategyEmitter.
        for e in config["emitters"]:
            if e["class"] == EvolutionStrategyEmitter:
                e["kwargs"]["es"] = es

    name = f"{algorithm}_{config['dim']}"
    if es is not None:
        name += f"_{es}"

    # Initialize output directory.
    outdir = (
        (
            Path("logs")
            / Path(__file__).stem
            / name
            / datetime.now().strftime(f"%Y-%m-%d_%H-%M-%S_seed-{seed}")
        )
        if outdir is None
        else Path(outdir)
    )
    outdir.mkdir(parents=True, exist_ok=False)

    # Initialize loggers --
    # https://loguru.readthedocs.io/en/stable/resources/recipes.html#interoperability-with-tqdm-iterations
    log.remove()
    if verbose:
        log.add(lambda msg: tqdm.tqdm.write(msg, end=""), colorize=True)
    log.add(outdir / "out.log")  # Save logs in outdir.
    log.info("Saving outputs to: {}", outdir)

    scheduler = create_scheduler(config, algorithm, seed=seed)
    result_archive = scheduler.result_archive
    is_dqd = config["is_dqd"]
    has_discount_model = config["archive"]["class"] == DiscountArchive
    itrs = config["itrs"]
    metrics = {
        "QD Score": {
            "x": [0],
            "y": [result_archive.stats.qd_score],
        },
        "Coverage": {
            "x": [0],
            "y": [result_archive.stats.coverage],
        },
    }

    non_logging_time = 0.0
    save_heatmap(result_archive, outdir / f"heatmap_{0:05d}.png")

    if has_discount_model:
        scheduler.archive.init_discount_model()

    for itr in tqdm.trange(1, itrs + 1) if verbose else range(1, itrs + 1):
        itr_start = time.time()

        if is_dqd:
            solutions = scheduler.ask_dqd()
            (objectives, objective_grads, measures, measure_grads) = sphere(solutions)
            objective_grads = np.expand_dims(objective_grads, axis=1)
            jacobians = np.concatenate((objective_grads, measure_grads), axis=1)
            scheduler.tell_dqd(objectives, measures, jacobians)

        solutions = scheduler.ask()
        objectives, _, measures, _ = sphere(solutions)
        scheduler.tell(objectives, measures)
        non_logging_time += time.time() - itr_start

        if has_discount_model:
            scheduler.archive.train_discount_model()

        # Metrics.
        metrics["QD Score"]["x"].append(itr)
        metrics["QD Score"]["y"].append(result_archive.stats.qd_score)
        metrics["Coverage"]["x"].append(itr)
        metrics["Coverage"]["y"].append(result_archive.stats.coverage)

        # Logging.
        if itr % log_freq == 0 or itr == itrs:
            log.info(
                "Itr {} | Coverage: {:.3%} QD Score: {:.3f}",
                itr,
                metrics["Coverage"]["y"][-1],
                metrics["QD Score"]["y"][-1],
            )
            save_heatmap(result_archive, outdir / f"heatmap_{itr:05d}.png")

    # Save archive as a CSV.
    if save_archive:
        result_archive.data(return_type="pandas").to_csv(outdir / "archive.csv")

    # Plot metrics.
    log.info("Algorithm Time (Excludes Logging and Setup): {}s", non_logging_time)
    for metric, values in metrics.items():
        plt.plot(values["x"], values["y"])
        plt.title(metric)
        plt.xlabel("Iteration")
        plt.savefig(str(outdir / f"{metric.lower().replace(' ', '_')}.png"))
        plt.clf()

    # Convert metrics to Python scalars by calling .item(), since each stats value is a
    # 0-D array by default, and JSON cannot serialize 0-D arrays.
    for metric in metrics:  # pylint: disable = consider-using-dict-items
        metrics[metric]["y"] = [
            m if isinstance(m, (int, float)) else m.item() for m in metrics[metric]["y"]
        ]

    # Save metrics to JSON.
    with (outdir / "metrics.json").open("w") as file:
        json.dump(metrics, file, indent=2)

    # Return a summary of metrics. Note that Fire automatically prints these to stdout.
    return {
        "QD Score": metrics["QD Score"]["y"][-1],
        "Coverage": metrics["Coverage"]["y"][-1],
    }


if __name__ == "__main__":
    fire.Fire(sphere_main)

sphere_multirun.py¶

This script calls sphere.py to run each algorithm for multiple trials and produce the benchmark results at the top of this page.

r"""Runs multiple trials of algorithms in sphere.py and computes statistics.

Install the following dependencies before running this example:
    pip install ribs[visualize] torch tqdm fire loguru tabulate

Usage:
    # To run all algorithms at once. Use OPENBLAS_NUM_THREADS and OMP_NUM_THREADS to
    # make sure each run only takes up a single thread. Otherwise, they would use many
    # threads and create a lot of contention.
    OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 python sphere_multirun.py \
        --algos=ALL --trials=20 --max-workers=40

    # Excludes certain algorithms.
    OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 python sphere_multirun.py \
        --algos=ALMOST_ALL --trials=20 --max-workers=40

    # To run just a single algorithm.
    OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 python sphere_multirun.py \
        --algos=cma_mae --trials=20 --max-workers=40

    # To run two algorithms for 5000 itrs.
    OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 python sphere_multirun.py \
        --algos=cma_mae,dms --trials=20 --max-workers=40 --itrs=5000
"""

import collections
import concurrent.futures
from datetime import datetime
from pathlib import Path

import fire
import numpy as np
import pandas as pd
import sphere
import tqdm
from loguru import logger as log


def aggregate_results(results: dict[list], outdir: Path) -> None:
    """Aggregates and saves the results from the runs."""
    log.info("Recording results")

    df = pd.DataFrame(results).sort_values("Algorithm")
    df.to_csv(outdir / "results.csv")

    metrics = df[["Algorithm", "QD Score", "Coverage"]].groupby("Algorithm")
    metrics_mean = metrics.mean()
    metrics_std = metrics.std()

    count = df[["Algorithm", "Trial"]].groupby("Algorithm").count()

    summary_df = pd.DataFrame(index=metrics_mean.index)
    summary_df["QD Score"] = [
        f"{mean:,.2f} ± {std:,.2f}"
        for mean, std in zip(
            metrics_mean["QD Score"], metrics_std["QD Score"], strict=True
        )
    ]
    summary_df["Coverage"] = [
        f"{mean * 100:,.2f} ± {std:,.2%}"
        for mean, std in zip(
            metrics_mean["Coverage"], metrics_std["Coverage"], strict=True
        )
    ]
    summary_df["Trials"] = count["Trial"]

    summary_df.to_csv(outdir / "results_summary.csv")
    summary_df.to_markdown(outdir / "results_summary.md", stralign="right")


def main(
    algos: str | list[str],
    trials: int,
    itrs: int = 10000,
    outdir: str | None = None,
    seed: int | None = None,
    max_workers: int | None = None,
) -> None:
    """Runs multiple trials of algorithms in sphere.py and computes statistics.

    Args:
        algos: Algorithms to evaluate. On the command line, this can be passed as a
            single algorithm name, e.g., "cma_mae". It can also be a comma-separated
            list, e.g., "cma_mae,dms". Finally, it can be "ALL", to indicate running all
            algorithms in sphere.py, or "ALMOST_ALL", to exclude certain algorithms.
        trials: Number of trials to run each algorithm.
        itrs: Iterations to run the algorithm.
        outdir: Directory to save output. If not provided, it will be automatically set
            to `logs/sphere_multirun/YYYY-MM-DD_HH-MM-SS_seed-{seed}`.
        seed: Base seed for the trials. By default, there is no seed.
        max_workers: Maximum number of workers when using ProcessPoolExecutor to run the
            trials.
    """
    # Initialize output directory for the overall run.
    outdir = (
        (
            Path("logs")
            / Path(__file__).stem
            / datetime.now().strftime(f"%Y-%m-%d_%H-%M-%S_seed-{seed}")
        )
        if outdir is None
        else Path(outdir)
    )
    outdir.mkdir(parents=True, exist_ok=False)

    log.remove()
    log.add(lambda msg: tqdm.tqdm.write(msg, end=""), colorize=True)
    log.add(outdir / "out.log")  # Save logs in outdir.
    log.info("Saving overall outputs to: {}", outdir)

    rng = np.random.default_rng(seed)
    results = collections.defaultdict(list)

    if isinstance(algos, str):
        if algos == "ALL":  # Run all available algos.
            algo_list = list(sphere.CONFIG)
        elif algos == "ALMOST_ALL":
            # Exclude dds_cnf since it takes a very long time to run.
            algo_list = list(sphere.CONFIG)
            algo_list.remove("dds_cnf")
        else:
            algo_list = [algos]
    else:
        algo_list = algos

    log.info("Evaluating {} algorithm(s): {}", len(algo_list), algo_list)

    with concurrent.futures.ProcessPoolExecutor(max_workers=max_workers) as executor:
        # Submit all jobs simultaneously.
        future_to_info = {}
        for algo in algo_list:
            seeds = rng.choice(10_000, size=trials)
            for trial, trial_seed in enumerate(seeds):
                f = executor.submit(
                    sphere.sphere_main,
                    algorithm=algo,
                    outdir=outdir / algo / f"{trial:02d}_seed-{trial_seed}",
                    seed=trial_seed,
                    itrs=itrs,
                    verbose=False,
                    save_archive=False,
                )
                future_to_info[f] = (algo, trial, trial_seed)

        # Collect all results.
        for i, f in enumerate(
            concurrent.futures.as_completed(future_to_info.keys()), start=1
        ):
            algo, trial, trial_seed = future_to_info[f]
            try:
                res = f.result()
                results["Algorithm"].append(algo)
                results["Trial"].append(trial)
                results["Seed"].append(trial_seed)
                for metric, val in res.items():
                    results[metric].append(val)
                log.info(
                    "{}/{} (SUCCESS): {} trial {}, seed {}",
                    i,
                    len(future_to_info),
                    algo,
                    trial,
                    trial_seed,
                )
            except Exception as e:  # pylint: disable = broad-exception-caught
                # Any uncaught exception is costly because it kills the whole run.
                log.info(
                    "{}/{} (FAILURE): {} trial {}, seed {}\n{}",
                    i,
                    len(future_to_info),
                    algo,
                    trial,
                    trial_seed,
                    e,
                )

    aggregate_results(results, outdir)
    log.info("Done")


if __name__ == "__main__":
    fire.Fire(main)