.. _synthesis-objects:

Synthesis object design
***********************

The following describes how synthesis objects are structured. This is probably
most useful if you are creating a new synthesis method that you would like to
include in or be compliant with ``plenoptic``, rather than using existing ones.

The synthesis methods included in ``plenoptic`` generate one or more novel
images based on the output of a model. These images can be used to better
understand the model or as stimuli for an experiment comparing the model against
another system. Beyond this rather vague description, however, there is a good
deal of variability. We use inheritance in order to try and keep synthesis
methods as similar as possible, to facilitate user interaction with them (and
testing), but we want to avoid forcing too much similarity.

In the following description:

** *must* connotes a requirement; any synthesis object not meeting this property
  will not be merged and is not considered "plenoptic-compliant".
* *should* connotes a suggestion; a compelling reason is required if the
  property is not met.
* *may* connotes an option; these properties may make things easier (for
  developers or users), but are completely optional.

All synthesis methods
=====================

To that end, all synthesis methods must inherit the
:class:`plenoptic.synthesize.synthesis.Synthesis` class. This requires the synthesis method
to have a ``synthesize()`` method, and provides helper functions for ``save()``,
``load()``, and ``to()``, which must be used when implementing them.
Furthermore:

* the initialization method (``__init__()``) must accept any images as its first
  input(s). If only a single image is accepted, it must be named ``image``. If
  more than one, their names must be of the form ``image_X``, replacing ``X``
  with a more descriptive string. These must all have type ``torch.Tensor`` and
  they must be validated with :func:`plenoptic.tools.validate.validate_input`.
  This should be stored in an attribute with the same name as the argument.
* the initialization method's next argument(s) must be any models or metrics
  that the synthesis will be based on. Similarly, if a single model / metric is
  accepted, they must be named ``model`` / ``metric``. If more than one, their
  names should be of the form ``X_model`` / ``X_metric``, replacing ``X`` with a
  more descriptive string. These must be validated with
  :func:`plenoptic.tools.validate.validate_model` /
  :func:`plenoptic.tools.validate.validate_metric`. This should be stored in an
  attribute with the same name as the argument.
* any other arguments to the initialization method may follow.
* the object must be able to work on GPU and CPU. Users must be able to use
  the GPU either by initializing the synthesis object with tensors or models
  already on the GPU or by calling ``.to()``. The easiest way to do this is to
  use ``torch.rand_like()`` and analogous methods, and explicitly calling
  ``.to()`` on any other newly-created tensors.
* ideally, the same for different float and complex data types (e.g., support
  both ``torch.float32`` and ``torch.float64``), though this is not a strict
  requirement if there's a good reason.
* if ``synthesize()`` operates in an iterative fashion, it must accept a
  ``max_iter: int`` argument to specify how long to run synthesis for and a
  ``stop_criterion: float`` argument to allow for early termination if some
  convergence is reached. *What* exactly is being checked for convergence (e.g.,
  change in loss, change in pixel values) may vary, but it must be clarified
  in the docstring. A ``stop_iters_to_check: int`` argument may also be
  included, which specifies how many iterations ago to check. If it is not
  included, the number of iterations must be clarified in docstring.
* additionally, if synthesis is iterative, `tqdm.auto.tqdm
  <https://tqdm.github.io/docs/shortcuts/#tqdmauto>`_ must be used as a progress
  bar, initialized with ``pbar = tqdm(range(max_iter))``, which should present
  information using ``pbar.set_postfix()`` (such the loss or whatever else is
  checked for convergence, as discussed above).
* ``synthesize()`` must not return anything. The outputs of synthesis must
  be stored as attributes of the object. The number of large attributes should
  be minimized in order to reduce overall size in memory.
* the synthesis output must be stored as an attribute with the same name as
  the class (e.g., ``Metamer.metamer``).
* any attribute or method that the user does not need should be hidden (i.e.,
  start with ``_``).
* consider using the ``@property`` decorator to make important attributes
  write-only or differentiate between the public and private views. For example,
  the optimized attribute of the :class:`plenoptic.synthesize.geodesic.Geodesic`
  class is named ``_geodesic``, but the ``geodesic`` attribute returns this
  tensor concatenated with two (unchanging) endpoints, as this is what the user
  will most often want to interact with.

The above are the only requirements that all synthesis methods must meet.

Helper / display functions
==========================

It may also be useful to include some functions for investigating the status or
output(s) of synthesis. As a general rule, if a function will be called during
synthesis (e.g., to compute a loss value), it should be a method of the object.
If it is only called afterwards (e.g., to display the synthesis outputs in a
useful way), it should be included as a function in the same file (see
:func:`plenoptic.synthesize.metamer.display_metamer` for an example).

Functions that show images or videos should be called ``display_X``, whereas
those that show numbers as a scatter plot, line plot, etc. should be called
``plot_X``. These must be axes-level matplotlib functions: they must accept
an axis as an optional argument named ``ax``, which will contain the plot. If no
``ax`` is supplied, ``matplotlib.pyplot.gca()`` must be used to create / grab
the axis. If a multi-axis figure is called for (e.g., to display the synthesis
output and plot the loss), a function named ``plot_synthesis_status()`` should
be created. This must have an optional ``fig`` argument, creating a figure if
none is supplied. See :func:`plenoptic.synthesize.metamer.plot_synthesis_status`
for an example. If possible, this plot should be able to be animated to show
progress over time. See
:func:`plenoptic.synthesize.metamer.plot_synthesis_status` for an example.

See our :doc:`/tutorials/advanced/Display` notebook for description and examples of the
included plotting and display code.

Optimized synthesis
===================

Many synthesis methods will use an optimizer to generate their outputs. If the
method makes use of a ``torch.optim.Optimizer`` object, it must inherit
:class:`plenoptic.synthesize.synthesis.OptimizedSynthesis` class (this is a
subclass of:class:`plenoptic.synthesize.synthesis.Synthesis`, so the above all
still applies).

Currently, the following are required (if not all of these are applicable to new
methods, we may modify ``OptimizedSynthesis``):

* the points about iterative synthesis described above all hold:
  ``synthesize()`` must accept ``max_iter``, ``stop_criterion``, may accept
  ``stop_iters_to_check``, and must use `tqdm.auto.tqdm
  <https://tqdm.github.io/docs/shortcuts/#tqdmauto>`_.
* the object must have an ``objective_function()`` method, which returns a
  measure of "how bad" the current synthesis output is. Optimization is
  minimizing this value.
* the object must have a ``_check_convergence()`` method, which is used (along
  with ``stop_criterion`` and, optionally, ``stop_iters_to_check``) to determine
  if synthesis has converged.
* the object must have an ``_initialize()`` method, which initializes the
  synthesis output (e.g., with an appropriately-shaped sample of noise) and is
  called during the object's initilization.
* the initialization method may accept some argument to affect this
  initialization, which should be named ``initial_X`` (replacing ``X`` as
  appropriate). For example, this could be another image to use for
  initialization (``initial_image``) or some property of noise used to generate
  an initial image (``initial_noise``).
* the initialization method must accept ``range_penalty_lambda: float`` and
  ``allowed_range: Tuple[float, float]`` arguments, which should be used with
  :func:`plenoptic.tools.optim.penalize_range` to constrain the range of
  synthesis output.
* the ``synthesize()`` method must accept an optional ``optimizer:
  torch.optim.Optimizer`` argument, which defaults to ``None``.
  ``OptimizedSynthesis._initialize_optimizer()`` is a helper function that
  should be called to set this up: it creates a default optimizer if the user
  does not specify one and double-checks that the optimizer parameter is the
  correct object if the user did.
* during synthesis, the object should update the ``_losses``,
  ``_gradient_norm``, and ``_pixel_change_norm`` attributes on each iteration.
* the object may have a ``_closure()`` method, which performs the gradient
  calculation. This (when passed to ``optimizer.step()`` during the synthesis
  loop in ``synthesize()``) enables optimization algorithms that perform several
  evaluations of the gradient before taking a step (e.g., second-order methods).
  See ``OptimizedSynthesis._closure()`` for the simplest version of this.
* the ``synthesize()`` method should accept a ``store_progress`` argument, which
  optionally stores additional information over iteration, such as the synthesis
  output-in-progress. ``OptimizedSynthesis`` has a setter method for this
  attribute, which will enable things are correct. This argument can be an
  integer (in which case, the attributes are updated every ``store_progress``
  iterations), ``True`` (same behavior as ``1``), or ``False`` (no updating of
  attributes). This should probably be done in a method named ``_store()``.
* the ``synthesize()`` method should be callable multiple times with the same
  object, in which case progress is resumed. On all subsequent calls,
  ``optimizer`` must be ``None`` (this is checked by
  ``OptimizedSynthesis._initialize_optimizer()``) and ``store_progress``,
  ``stop_criterion``, and ``stop_iters_to_check`` must have the same values.

How to order methods
====================

Python doesn't care how you order any of the methods or properties of a class,
but doing so in a consistent manner will make reading the code easier, so try to
follow these guidelines:

* The caller should (almost always) be above the callee and related concepts
  should be close together.
* ``__init__()`` should be first, followed by any methods called within it. This
  will probably include ``_initialize()``, for those classes that have it.
* After all those initialization-related methods, ``synthesize()`` should come
  next. Again, this should be followed by most of the the methods called within
  it, ordered roughly by importance. Thus, the first methods should probably be
  ``objective_function()`` and ``_optimizer_step()``, followed by
  ``_check_convergence()``. What shouldn't be included in this section are
  helper methods that aren't scientifically interesting (e.g.,
  ``_initialize_optimizer()``, ``_store()``).
* Next, any other content-related methods, such as helper methods that perform
  useful computations that are not called by ``__init__()`` or ``synthesize()``
  (e.g., :class:`plenoptic.synthesize.geodesic.Geodesic.calculate_jerkiness`).
* Next, the helper functions we ignored from earlier, such as
  ``_initialize_optimizer()`` and ``_store()``.
* Next, ``save()``, ``load()``, ``to()``.
* Finally, all the properties.