Run this notebook yourself!

Download the executed notebook: Metamer.ipynb!

Run it in your browser: Metamer.ipynb!

Metamers#

Metamers are an old concept in the study of perception, dating back to the color-matching experiments in the 18th century that first provided support for the existence of three cone types (though it would be another two hundred years before anatomical evidence was found). These color-matching evidences demonstrated that, by combining three colored lights in different proportions, you could generate a color that humans perceived as identical to any other color, even though their physical spectra were different. Perceptual metamers, then, refer to two images that are physically different but perceived as identical.

For the purposes of plenoptic, wherever we say “metamers”, we mean “model metamers”: images that are physically different but have identical representation for a given model, i.e., that the model “perceives” as identical. Like all synthesis methods, it is model-specific, and one potential experiment is to determine if model metamers can serve as human percpetual metamers, which provides support for the model as an accurate representation of the human visual system.

In the Lab for Computational Vision, this goes back to Portilla and Simoncelli, 2001, where the authors created a parametric model of textures and synthesized novel images as a way of demonstrating the cases where the model succeeded and failed. In that paper, the model did purport to have anything to do with human vision, and they did not refer to their images as “metamers”, that term did not appear until Freeman and Simoncelli, 2011, where the authors pool the Portilla and Simoncelli texture statistics in windows laid out in a log-polar fashion to generate putative human perceptual metamers.

This notebook demonstrates how to use the Metamer class to generate model metamers.

import imageio
import matplotlib.pyplot as plt
import numpy as np
import torch

import plenoptic as po

# this notebook runs just about as fast with GPU and CPU
DEVICE = torch.device("cpu")

# so that relative sizes of axes created by po.plot.imshow and others look right
plt.rcParams["figure.dpi"] = 72
# Animation-related settings
plt.rcParams["animation.html"] = "html5"
# use single-threaded ffmpeg for animation writer
plt.rcParams["animation.writer"] = "ffmpeg"
plt.rcParams["animation.ffmpeg_args"] = ["-threads", "1"]


%load_ext autoreload
%autoreload 2

Basic usage#

As with all our synthesis methods, we start by grabbing a target image and initializing our model.

img = po.data.curie().to(DEVICE)
po.plot.imshow(img);

../../_images/f7b6ddda06219dfa6c372eb3825d599269b9ee310682acbdb3e3221a048c1af0.png

For the model, we’ll use a simple On-Off model of visual neurons

model = po.models.OnOff((7, 7))
model.to(DEVICE)
model.eval()
po.remove_grad(model)

When this model is called on the image, it returns a 4d tensor. This representation is what the Metamer class will try to match.

print(model(img))

tensor([[[[0.6911, 0.6918, 0.6930,  ..., 0.6935, 0.6935, 0.6937],
          [0.6919, 0.6925, 0.6934,  ..., 0.6934, 0.6936, 0.6938],
          [0.6935, 0.6936, 0.6937,  ..., 0.6933, 0.6939, 0.6941],
          ...,
          [0.6928, 0.6928, 0.6933,  ..., 0.6929, 0.6938, 0.6941],
          [0.6942, 0.6941, 0.6939,  ..., 0.6930, 0.6935, 0.6937],
          [0.6947, 0.6946, 0.6941,  ..., 0.6929, 0.6932, 0.6933]],

         [[0.6952, 0.6945, 0.6933,  ..., 0.6928, 0.6928, 0.6926],
          [0.6944, 0.6938, 0.6929,  ..., 0.6929, 0.6927, 0.6925],
          [0.6928, 0.6927, 0.6925,  ..., 0.6930, 0.6924, 0.6922],
          ...,
          [0.6935, 0.6935, 0.6930,  ..., 0.6934, 0.6925, 0.6922],
          [0.6920, 0.6922, 0.6924,  ..., 0.6933, 0.6928, 0.6926],
          [0.6916, 0.6917, 0.6922,  ..., 0.6934, 0.6931, 0.6930]]]])

In order to visualize this, we can use the helper function plot_representation (see Display notebook for more details here). In this case, the representation looks like two images, and so we plot it as such:

po.plot.plot_representation(data=model(img), figsize=(11, 5));

../../_images/9f6741a501d584901e14c24098ea3c108bd5a634d4c600da0a20b459636a6606.png

At the simplest, to use Metamer, simply initialize it with the target image and the model, then call synthesize. By setting store_progress=True, we update a variety of attributes (all of which start with saved_) on each iteration so we can later examine, for example, the synthesized image over time.

met = po.Metamer(img, model)
met.synthesize(store_progress=True, max_iter=50)

We then call the synthesis_status function to see how things are doing. The image on the left shows the metamer at this moment, while the center plot shows the loss over time, with the red dot pointing out the current loss, and the rightmost plot shows the representation error. If a model has a plot_representation method, this plot can be more informative, but this plot can always be created.

# model response error plot has two subplots, so we increase its relative width
po.plot.synthesis_status(met, width_ratios={"metamer_representation_error": 2});

../../_images/9da512289733d53985b56660f63f0f22fcbeb3090a1abbbb7eb7a2ed1003508e.png

synthesis_status is a helper function to show all of this at once, but the individual components can be created separately:

fig, axes = plt.subplots(1, 3, figsize=(25, 5), gridspec_kw={"width_ratios": [1, 1, 2]})
po.plot.synthesis_imshow(met, ax=axes[0])
po.plot.synthesis_loss(met, ax=axes[1])
po.plot.metamer_representation_error(met, ax=axes[2]);

../../_images/ad0b100cf7c4dd07e6c737e60fc137959deb95546eebe91dbedc854c1d4c31a1.png

The loss is decreasing, but clearly there’s much more to go. So let’s continue.

You can resume synthesis as long as you pass the same argument to store_progress on each run.

Everything that stores the progress of the optimization (losses, saved_metamer) will persist between calls and so potentially get very large.

met.synthesize(store_progress=True, max_iter=100)

/home/jenkins/agent/workspace/CCN_neurorse_plenoptic_main/lib/python3.12/site-packages/plenoptic/_synthesize/metamer.py:374: UserWarning: Loss has converged, stopping synthesis
  warnings.warn("Loss has converged, stopping synthesis")

Let’s examine the status again. But instead of looking at the latest iteration, let’s look at 10 from the end:

po.plot.synthesis_status(
    met, iteration=-10, width_ratios={"metamer_representation_error": 2}
);

../../_images/4e578d257619792bd6f960d5e21aabce68150df9c4e946ae5491023214c61db0.png

Since we have the ability to select which iteration to plot (as long as we’ve been storing the information), we can create an animation showing the synthesis over time. This FuncAnimation object can either be viewed in the notebook (note that this requires the matplotlib configuration options in the first cell of this notebook) or saved as some video format (e.g., anim.save('test.mp4').

anim = po.plot.synthesis_animate(met, width_ratios={"metamer_representation_error": 2})
anim

Generally speaking, synthesis will run until you hit max_iter iterations. However, synthesis can also stop if it looks like the loss has stopped changing. This behavior is controlled with the loss_thresh and loss_change_iter arguments: if the loss has changed by less than loss_thresh over the past loss_change_iter iterations, we stop synthesis.

Moving between devices#

Metamer has a to method for moving the object between devices or dtypes. Call it as you would call any torch.Tensor.to and it will move over the necessary attributes.

Saving and loading#

Finally, you probably want to save the results of your synthesis. As mentioned above, you can save the synthesis animation, and all of the plots return regular matplotlib Figures and can be manipulated as expected. The synthesized image itself is a tensor and can be detached, converted to a numpy array, and saved (either as an image or array) as you’d expect. to_numpy is a convenience function we provide for operations like this, which detaches the tensor, sends it to the CPU, and converts it to a numpy array with appropriate dtype. Note that it doesn’t squeeze the tensor, so you may want to do that yourself.

met_image = po.to_numpy(met.metamer).squeeze()
# convert from array to int8 for saving as an image
print(f"Metamer range: ({met_image.min()}, {met_image.max()})")
met_image = po.convert_float_to_int(np.clip(met_image, 0, 1))
imageio.imwrite("test.png", met_image)

Metamer range: (-0.002226863522082567, 1.000531792640686)

The metamer lies slightly outside the range [0, 1], so we clip before saving as an image. Metamer’s objective function has a quadratic penalty on the synthesized image’s range, and the weight on this penalty can be adjusted by changing the value of penalty_lambda at initialization.

You can also save the entire Metamer object with its save method. This can be fairly large (depending on how many iterations you ran it for and how frequently you stored progress), but stores all information:

met.save("test.pt")

You can then load it back in using the method load. Note that you need to first instantiate the Metamer object and then call load — it must be instantiated with the same image, model, and loss function in order to load it in!

met_copy = po.Metamer(img, model)
# it's modified in place, so this method doesn't return anything
met_copy.load("test.pt")
(met_copy.saved_metamer == met.saved_metamer).all()

tensor(True)

Because the model itself can be quite large, we do not save it along with the Metamer object. This is why you must initialize it before loading from disk.

Reproducibility#

You can set the seed before you call synthesize for reproducibility by using set_seed. This will set both the pytorch and numpy seeds, but note that we can’t guarantee complete reproducibility: see Reproducibility and Compatibility for some caveats.

Also note that pytorch does not guarantee identical results between CPU and GPU, even with the same seed.

More Advanced Options#

The solution found by the end of the Basic usage section is only one possible metamer. In general, optimization in a high-dimensional space with non-linear models is inherently challenging and so we can’t guarantee you’ll find a model metamer, but we do provide some tools / extra functionality to help.

Initialization#

By default, we initialize the metamer attribute with with uniformly-distributed random noise between 0 and 1. If you wish to use some other image for initialization, you can initialize it yourself (it must be the same shape as target_signal) and pass to the optional setup method before calling synthesize.

Optimization basics#

You can set all the various optimization parameters you’d expect. setup has an optimizer argument, which accepts an uninitialized pytorch optimizer, and an optimizer_kwargs arg, which accepts a dictionary. You can therefore change the optimizer from the default Adam and/or specify any of the arguments you would like:

met = po.Metamer(img, model)
met.setup(optimizer=torch.optim.SGD, optimizer_kwargs={"lr": 0.001})
met.synthesize()

/home/jenkins/agent/workspace/CCN_neurorse_plenoptic_main/lib/python3.12/site-packages/plenoptic/_synthesize/metamer.py:374: UserWarning: Loss has converged, stopping synthesis
  warnings.warn("Loss has converged, stopping synthesis")

setup also accepts a scheduler argument, so that you can pass a pytorch scheduler, which modifies the learning rate during optimization.

Regularization penalty#

It is sometimes useful to control properties of the synthesized metamer beyond those captured by the input model. Some examples include controlling the range of pixel values, penalizing high frequencies, or matching the spectrum of the target image.

For this purpose, the Metamer class takes an optional penalty_function argument at initialization. The penalty_function is a callable that takes as an input the synthesized metamer image, and returns a scalar penalty. This scalar penalty is added to the loss during optimization, and it can be used to control certain properties of the synthesized metamer.

For example, the default penalty_function uses the penalize_range function to penalize pixel values that fall outside the range [0, 1], helping to keep the synthesized metamer within this range. The user can pass custom penalty functions that control other properties of the synthesized metamer. For example, we can constrain the image pixels to fall inside a different range, by using the argument allowed_range in the penalize_range function to define a new range penalization. Below we show how to constrain the pixel range to be between 0.2 and 0.8.

# Create custom_penalty function, that penalizes pixels outside of [0.2, 0.8] range
def custom_penalty(image):
    penalty = po.regularize.penalize_range(image, allowed_range=(0.2, 0.8))
    return penalty


# Pass the custom_penalty function to the Metamer class, and synthesize the metamer
met = po.Metamer(
    img,
    model,
    penalty_function=custom_penalty,
)
met.synthesize(store_progress=True, max_iter=50)

po.plot.synthesis_histogram(met);

../../_images/d1bba8511e4444bb21247591f52f71e97f8372186dda8c5ac7feb3d919105191.png

We see that the metamer pixel histogram ranges from 0.2 to 0.8, while the original target image ranges from 0.0 to 1.0.

The output of this penalty function is stored at each iteration in the penalties attribute, analogous to the losses attribute:

print(met.penalties)
print(met.losses)

tensor([3.5104e+02, 3.0113e+02, 2.5635e+02, 2.1644e+02, 1.8113e+02, 1.5013e+02,
        1.2315e+02, 9.9867e+01, 7.9970e+01, 6.3146e+01, 4.9083e+01, 3.7477e+01,
        2.8039e+01, 2.0491e+01, 1.4567e+01, 1.0020e+01, 6.6207e+00, 4.1588e+00,
        2.4464e+00, 1.3167e+00, 6.2375e-01, 2.4205e-01, 6.5581e-02, 7.6872e-03,
        4.5609e-04, 4.5555e-04, 3.7710e-04, 3.6208e-04, 2.3162e-04, 4.4498e-04,
        2.9728e-04, 2.4223e-04, 2.6075e-04, 2.4412e-04, 3.2399e-04, 1.6287e-04,
        2.2779e-04, 2.2183e-04, 2.2760e-04, 2.1324e-04, 2.2485e-04, 1.7958e-04,
        9.4977e-05, 1.5403e-04, 1.3475e-04, 1.7680e-04, 2.0200e-04, 1.5314e-04,
        1.3851e-04, 1.2586e-04, 1.0349e-04])
tensor([3.5104e+01, 3.0113e+01, 2.5635e+01, 2.1644e+01, 1.8113e+01, 1.5013e+01,
        1.2315e+01, 9.9867e+00, 7.9970e+00, 6.3146e+00, 4.9083e+00, 3.7477e+00,
        2.8039e+00, 2.0491e+00, 1.4568e+00, 1.0021e+00, 6.6211e-01, 4.1592e-01,
        2.4467e-01, 1.3170e-01, 6.2401e-02, 2.4230e-02, 6.5824e-03, 7.9194e-04,
        6.7879e-05, 6.6962e-05, 5.8324e-05, 5.6094e-05, 4.2376e-05, 6.3091e-05,
        4.7748e-05, 4.1711e-05, 4.3070e-05, 4.0947e-05, 4.8503e-05, 3.1991e-05,
        3.8107e-05, 3.7159e-05, 3.7405e-05, 3.5659e-05, 3.6528e-05, 3.1726e-05,
        2.3007e-05, 2.8666e-05, 2.6506e-05, 3.0490e-05, 3.2801e-05, 2.7717e-05,
        2.6066e-05, 2.4623e-05, 2.2218e-05])

The Metamer class also has a penalty_lambda argument, that weights the contribution of the penalty function to optimization.

Coarse-to-fine optimization#

Some models, such as the Portilla-Simoncelli texture statistics, have a multiscale representation of the image, which can complicate the optimization. It’s generally recommended that you normalize the representation (or use a specific loss function) so that the different scales all contribute equally to the representation (see Tweaking the model for more information).

We provide the option to use coarse-to-fine optimization, such that you optimize the different scales separately (starting with the coarsest and then moving progressively finer) and then, at the end, optimizing all of them simultaneously. This was first used in Portilla and Simoncelli, 2000, and can help avoid local optima in image space. Unlike everything else described in this notebook, it will not work for all models. There are two specifications the model must meet (see Model requirements page for more details):

It must have a scales attribute that gives the scales in the order they should be optimized.
Its forward method must accept a scales keyword argument, which accepts a list and causes the model to return only the scale(s) included. See forward for an example.

We can see that the included PortillaSimoncelli model satisfies these constraints, and that the model returns a subset of its output when the scales argument is passed to forward:

# we change images to a texture, which the PS model can do a good job capturing
img = po.data.reptile_skin()
ps = po.models.PortillaSimoncelli(img.shape[-2:])
print(ps.scales)
print(ps.forward(img).shape)
print(ps.forward(img, scales=[0]).shape)

['pixel_statistics', 'residual_lowpass', 3, 2, 1, 0, 'residual_highpass']
torch.Size([1, 1, 710])
torch.Size([1, 1, 181])

There are two choices for how to handle coarse-to-fine optimization: 'together' or 'separate'. In 'together' (recommended), we start with the coarsest scale and then gradually add each finer scale (this is like blurring the objective function and then gradually adding details). In 'separate', we compute the gradient with respect to each scale separately (ignoring the others), then with respect to all of them at the end.

If our model meets the above requirements, then we can use the MetamerCTF class, which uses this coarse-to-fine procedure. We specify which of the two above options are used during initialization, and it will work through the scales as described above (and will resume correctly if you resume synthesis). Note that the progress bar now specifies which scale we’re on.

met = po.MetamerCTF(img, ps, loss_function=po.loss.l2_norm, coarse_to_fine="together")
met.synthesize(store_progress=True, max_iter=100)
# we don't show our synthesized image here, because it hasn't gone through all the
# scales, and so hasn't finished synthesizing

/home/jenkins/agent/workspace/CCN_neurorse_plenoptic_main/lib/python3.12/site-packages/plenoptic/validate.py:347: UserWarning: Validating whether model can work with coarse-to-fine synthesis -- this can take a while!
  warnings.warn(

In order to control when synthesis considers a scale to be “done” and move on to the next one, you can set two arguments: change_scale_criterion and ctf_iters_to_check. If the scale-specific loss (current_scale_loss in the progress bar above) has changed by less than change_scale_criterion over the past ctf_iters_to_check iterations, we consider that scale to have reached a local optimum and move on to the next. You can also set change_scale_criterion=None, in which case we always shift scales after ctf_iters_to_check iterations

# initialize with some noise that is approximately mean-matched and with low variance
im_init = torch.rand_like(img) * 0.1 + img.mean()
met = po.MetamerCTF(
    img,
    ps,
    loss_function=po.loss.l2_norm,
    coarse_to_fine="together",
)
met.setup(im_init)
met.synthesize(
    store_progress=10,
    max_iter=500,
    change_scale_criterion=None,
    ctf_iters_to_check=7,
)
po.plot.imshow(
    [met.image, met.metamer],
    title=["Target image", "Synthesized metamer"],
    vrange="auto1",
);

../../_images/5653c7fced416d80b4e127b7ac0a7617e35592012ddd6d4a5d3c15e8e0e4651f.png

And we can see these shifts happening in the animation of synthesis:

po.plot.synthesis_animate(met)

MetamerCTF has several attributes which are used in the course of coarse-to-fine synthesis:

scales_loss: this list contains the scale-specific loss at each iteration (that is, the loss computed on just the scale(s) we’re optimizing on that iteration; which we use to determine when to switch scales).
scales: this is a list of the scales in optimization order (i.e., from coarse to fine). The last entry will be 'all' (since after we’ve optimized each individual scale, we move on to optimizing all at once). This attribute will be modified by the synthesize method and is used to track which scale we’re currently optimizing (the first one). When we’ve gone through all the scales present, this will just contain a single value: 'all'.
scales_timing: this is a dictionary whose keys are the values of scales. The values are lists, with 0 through 2 entries: the first entry is the iteration where we started optimizing this scale, the second is when we stopped (thus if it’s an empty list, we haven’t started optimizing it yet).
scales_finished: this is a list of the scales that we’ve finished optimizing (in the order we’ve finished). The union of this and scales will be the same as metamer.model.scales (e.g., scales).

A small wrinkle: if coarse_to_fine=='together', then none of these will ever contain the final, finest scale, since that is equivalent to 'all'.