Introduction

A central goal in neuroscience is to understand how behaviorally-significant computations are implemented by cellular and synaptic mechanisms (see (Churchland and Abbott 2016)). Such computations typically rely on a diverse collection of interacting circuit mechanisms, many of which are nonlinear (Zador 2000; Gollisch and Meister 2010; Jadzinsky and Baccus 2013; Isaacson and Scanziani 2011). Circuit outputs - which are often most amenable to experimental measurement - typically do not uniquely identify the role of specific circuit mechanisms. Hence progress towards understanding the mechanistic basis of circuit function requires better tools to manipulate specific mechanisms and identify their contribution to circuit output.

Sensory systems provide a clear example of these issues. Sensory processing is strongly shaped by the properties of the sensory receptors themselves and by post-receptor circuit mechanisms. Yet separating the contributions of receptor and post-receptor mechanisms to circuit outputs can be difficult, and computational models often make untested assumptions about their relative importance. For example, models for retinal processing generally assume that photoreceptor responses are linear or near-linear, despite decades of direct evidence that this is not the case (reviewed by (Burns and Baylor 2001; G. Schwartz 2021)). Adaptation to changes in mean light intensity provides a specific and important example. Both photoreceptors and post-photoreceptor circuits adjust response properties such as gain and kinetics to match the prevailing light inputs (reviewed by (Dunn and Rieke 2006; Demb 2008)), yet models for retinal outputs typically start by passing light inputs through a linear spatio-temporal filter; this architecture includes linear-nonlinear (Chichilnisky 2001), generalized-linear (Pillow et al. 2008) and computational neural network (CNN) (McIntosh et al. 2016; Turner et al. 2019) models. Models with an initial linear filtering stage cannot capture the spatially local adaptation produced by photoreceptors. Tools that separate receptor and post-receptor contributions to adaptation are needed to better understand how this salient aspect of retinal processing works.

Here, we show how we can predictably manipulate the responses of rod and cone photoreceptors to causally probe their role in shaping responses of downstream visual neurons. We build on decades of work identifying and testing biophysical models for rod and cone phototransduction (Pugh and Lamb 1993; Rieke and Baylor 1998; Nikonov, Lamb, and Pugh 2000; Younger, McCarthy, and Owen 1996). We show that existing phototransduction models, with appropriate parameters, can account for responses of rod and cone photoreceptors from both primate and mouse to a broad range of inputs. We then show that these models can be mathematically inverted - enabling the design of stimuli that will give rise to photoreceptor responses with specific desired properties such as a linear dependence on light input. Direct recordings of responses to these stimuli show that they work as designed. This approach allows us to determine the impact of specific properties of the phototransduction currents on visual signaling. For example, we can negate the impact of photoreceptor adaptation and test how adaptation in the responses of retinal ganglion cells is altered.

Results

Biochemical model

We chose a model architecture based on the biochemical interactions that comprise the phototransduction cascade. We chose this model rather than empirical models (e.g. (Clark et al. 2013); see (Angueyra et al. 2022) for comparison of the biochemical model with the Clark model) because it was more clearly connected to the mechanistic operation of the phototransduction cascade and because it was exactly invertible.

Combined work in biochemistry, molecular biology and physiology has produced a clear understanding of the operation of the phototransduction cascade (Figure 1A; reviewed by (Burns and Baylor 2001; Rieke and Baylor 1998; G. Schwartz 2021). In brief, the phototransduction current flows through cGMP-gated channels in the photoreceptor outer segment. Levels of cGMP, and hence the phototransduction current, are maximal in darkness. Light activates (via a G-protein coupled receptor and a G-protein) a cGMP-phosphodiesterase. The resulting decrease in cGMP concentration allows some cGMP-gated channels to close and decreases the current. Levels of cGMP are restored by synthesis of cGMP by a guanylate cyclase; cyclase activity is sped by a calcium-feedback pathway that strongly shapes light responses (Burns et al. 2002).

Model and fitting procedure. A. Phototransduction cascade and differential equations that describe operation of key components. Abbreviations are as follows: Stim: stimulus; R: receptor (photopigment) activity; P: phosphodiesterase activity; G: cGMP concentration; S: rate of cGMP synthesis; I: membrane current; C: calcium concentration. B. Fits (red) of a model with four free parameters (γ, σ, η, KGC) to the responses of a mouse rod (see Matlab code for parameter values). All measured responses are from the same rod, and all model responses use the same values for the free parameters. Responses to different flash strengths or flash timing have been displaced vertically in the left two panels. The weakest flash in the far left panel produced on average ∼3 activated rhodopsin molecules, and each successive flash was twice as bright. The bottom trace in the middle panel was to a light step alone, while the other traces in this panel each had 5 superimposed flashes. The horizontal scale bar for the inset in B is 1 sec.

The biochemical understanding summarized in Figure 1A has led to several quantitative models for phototransduction (Pugh and Lamb 1993; Rieke and Baylor 1998; Nikonov, Lamb, and Pugh 2000; Younger, McCarthy, and Owen 1996). We focus on a set of differential equations that form the core of these models. Our goal was to develop a model that generalized across stimuli rather than one in which the parameters accurately reflected the operation of components of the cascade. Below we show that the model illustrated in Figure 1A, with appropriate parameters, captures the responses of rod and cone photoreceptors to a variety of stimuli. Most importantly for our purposes here, the models are sufficiently accurate to enable us to accurately predict stimuli that elicit desired photoreceptor responses, as tested in Figures 5-8.

Our phototransduction model has a total of eleven parameters (Figure 1A). Two of these parameters can be expressed in terms of others using steady-state conditions, several are well constrained by direct measurements, and several have a similar impact on model output and hence do not need to be treated as independent free parameters (see Methods). These considerations allowed us to reduce the number of free model parameters to 4 for rods and 5 for cones. This choice reflects a balance of minimizing the number of free model parameters while providing sufficient flexibility to capture key aspects of the light response.

The free parameters consisted of the gain γ with which photons are converted to photopigment activity, the photopigment decay rate σ, the spontaneous PDE activation rate η, the constant KGC specifying the calcium sensitivity of the cGMP synthesis rate, and (for cones) the rate β of Ca2+ extrusion. This choice of free parameters is not unique given the similar impact of some parameters (such as σ and the PDE decay rate Ф). But, as indicated below, it provided sufficient flexibility to capture measured responses to a variety of stimuli. We numerically identified optimal values for these parameters separately for each photoreceptor type (see first section of associated Matlab code). As described below, we used different fitting procedures for rod and cone photoreceptors due to differences in how much data we could collect from individual cells via different recording techniques (see Methods for more details).

Fitting rod phototransduction models

We used suction electrode recordings to measure responses of mouse and macaque rod photoreceptors to a set of stimuli consisting of a family of brief flashes of different strengths, flashes delivered at several times relative to the onset and offset of a light step, and a “variable mean noise stimulus” consisting of Gaussian noise with periodic changes in mean intensity (Figure 1B; see Methods for experimental details). The variable mean noise stimulus approximates the large and frequent changes in input produced by eye movements and the structure of natural images (Frazor and Geisler 2006). These large (up to 30-fold) changes in mean intensity strongly engage photoreceptor adaptation, and the superimposed Gaussian noise probes sensitivity to rapidly varying inputs.

We recorded responses to this collection of stimuli for each rod in our data set. We fit mouse and macaque rods separately, in each case seeking a common model (i.e. with the same parameters) that fit responses across stimuli. We identified model parameters by numerically minimizing the mean-squared error between the model predictions and measured responses across stimuli.

Figure 1B compares the predicted and measured responses of a mouse rod photoreceptor. Predicted and measured responses are not identical - e.g. the model predicts a faster initial rising phase of the flash response than that measured. Nonetheless, a common model specified by four free parameters (γ, σ, η, and KGC) accounts for many aspects of the responses to this set of stimuli. These parameters were well-constrained by fitting the measured responses: best fit parameters varied by ∼10% across different photoreceptors, and the mean-squared error increased significantly for 10-20% changes in parameters.

To arrive at consensus models for rod transduction currents, we fit measured responses from multiple rods from either primate or mouse simultaneously. We first used the measured dark current and the previously-measured relation between current and cGMP (Eq. 4 in Figure 1; (Rieke and Baylor 1996)) to specify the cGMP concentration in darkness for each cell. We then allowed γ to vary between cells to account for differences in sensitivity. The remaining parameters (σ, η, and KGC) were constrained to have the same value across cells. These consensus models provide our best estimate of how an unrecorded cell of a given type will respond; these models provide the foundation for the manipulations of the photoreceptor responses in Figures 5-9. Consensus parameters for mouse and primate rods were very similar (see Matlab code for consensus model parameters and examples of fitting procedures).

Fitting cone phototransduction models

We recorded the responses of mouse and macaque cones using whole-cell patch clamp approaches. Light responses in these recordings run down quickly, and hence we recorded responses to a single stimulus from each individual cone. Based on the rod recordings described above, and earlier work on primate cones (Angueyra et al. 2022), we identified the variable mean noise stimulus as the single stimulus that best constrained the models. Models fit to the variable mean noise stimulus generalized well to other stimuli (see Figure 2 - Figure Supplement 1).

We identified consensus cone models by fitting measured responses to the variable mean noise stimulus from multiple cones simultaneously, as described above for the rods. Consensus parameters for mouse and primate cones differed substantially, reflecting the ∼2-fold faster kinetics of primate cone responses (flash response time-to-peak is ∼35 ms in primate cones, and ∼70 ms in mouse cones; see Matlab code for model parameters). For primate cones, performance using these consensus parameters was very similar to that from the parameters in (Angueyra et al. 2022); for consistency we used these published parameters for our consensus model.

Model performance

Figure 2 tests the performance of the consensus models for both rod and cone photoreceptors. We focused on the variable mean noise stimulus since it probes responses to rapid stimulus variations and slower changes in mean intensity. For each individual cell, we specified the dark cGMP concentration using the measured dark current and allowed the overall gain Y to vary; the remaining parameters were set to the consensus model values. The resulting models (red) captured the measured responses (black) well.

Photoreceptor model and fits. A. Comparison of measured responses (black) with predictions of full (red) and linear model (blue) to variable mean noise stimulus (gray). Full model responses use consensus parameters from fitting responses of multiple cells of each type simultaneously, with the dark current and sensitivity allowed to vary between cells (see Methods). The linear model was generated from fitting the low-contrast responses of the full model (see Methods). Insets expand regions in gray boxes. B. Fraction of variance explained for the full model fit to each cell individually (y-axis) plotted against that for the consensus model that has fixed parameters across cells except for the dark current and sensitivity.

Figure 2 also shows predictions of a linear phototransduction model (blue) (see Methods). Rod and cone phototransduction currents depend linearly on light intensity for low contrast inputs (e.g. (Hass et al. 2015; Schnapf et al. 1990; Baylor, Lamb, and Yau 1979)). Hence, the linear model was fit to responses of the full model to low contrast inputs (see Methods for model construction details); this means that the differences between the linear and full model responses can be directly attributed to nonlinearities in phototransduction. Linear predictions deviated from the measured responses considerably more than did predictions of our biochemical model. Specifically, linear models overpredicted responses at high mean light levels, and underpredicted responses at low mean light levels. Some of these issues could be resolved by adding a time-independent nonlinearity to the linear model (i.e. a linear-nonlinear model); however, responses to many stimuli show clear time-dependent nonlinearities that are not captured by linear-nonlinear models (e.g. Figure 5; see also (Angueyra et al. 2022)). The biochemical model, though not perfect, tracked the measured responses quite well. For each recorded cell, the consensus models captured more than 80% of the variance of the responses (Figure 2B); on average these models captured ∼90% of the variance.

We compared the performance of the consensus models with that of models fit to each cell individually. Consensus models and models fit to individual cells performed similarly (Figure 2B). Models also generalized well across stimuli and to cells that did not contribute to the fits (Figure 2 - Figure Supplement 1). This suggests that the consensus models can be used to predict responses of unrecorded cells.

Figures 1 and 2 indicate that the known components of the phototransduction cascade, with appropriate parameters, can account for the full time course of the rod and cone photocurrents, including the changes produced by adaptation. Our central goal was to use these models to design stimuli that permit predictable manipulations of the photoreceptor responses. We directly evaluate whether the models are adequate for this goal below.

Model Inversion

In this section we show that the model illustrated in Figure 1A can be exactly inverted to identify the stimulus that corresponds to a specific desired photoreceptor response. Model inversion requires a one-to-one relationship between inputs and outputs - i.e. for every input there must be a unique output and vice versa. Mathematically, this holds for the phototransduction model summarized in Figure 1A: the model consists of linear elements (Eqs. 1, 2 and 5), static nonlinear elements (Eqs. 4 and 6) and a time-dependent nonlinear element (Eq. 3). The linear elements can be described as linear filters, and inverted by deconvolution. The static nonlinear elements consist of one-to-one mappings of inputs to outputs, and can be inverted correspondingly. The time-dependent nonlinear component can be rearranged and solved analytically (see Methods). The result is an analytical relationship between the phototransduction current and the stimulus - in other words an inverse model that takes the photocurrent as input and identifies the corresponding stimulus.

Figure 3 illustrates the steps in this procedure, using the current response generated by the forward model to the variable mean noise stimulus as an example “target” (far right). The first step is to convert this target current response to corresponding time-varying concentrations of cGMP and calcium using Eqs. 4 and 5 (step 1). In step 2, the time courses of cGMP and calcium are used to determine the time-varying PDE activity using Eqs. 3 and 6. In the final step (step 3), the PDE activity is used to determine the stimulus through Eqs. 1 and 2. In this case, as expected, we recover exactly the stimulus that we started with, confirming that mathematically the phototransduction model can be inverted.

Steps in model inversion. A test current was generated from a variable mean noise stimulus using the full model (far right). Step 1 (right) converts this current into changes in cGMP and calcium using Eqs. 4 and 5 (see Figure 1). Step 2 converts the time course of the cGMP and calcium into that of the PDE activity using Eqs. 3 and 6. Finally, step 3 converts the PDE to the stimulus using Eqs. 1 and 2. The estimated stimulus is identical to the initial stimulus because there is no added noise and the inversion process is exact.

The procedure illustrated in Figure 3 is exact in the context of the phototransduction model - i.e. given the architecture of the model, there is a one-to-one mapping of inputs to outputs. In practice, we would like to use this procedure to predict stimuli that will elicit specific photoreceptor responses. At least two issues could cause these model-based predictions to fail. First, the forward model may not accurately capture aspects of the real photoreceptor responses that are important for inversion. The model, for example, may capture some temporal frequencies of the response better than others, and this could limit the accuracy of the model inversion at those frequencies that are not captured well. Second, inversion could be highly sensitive to noise. The limited ability to recover spatial information in deconvolution microscopy provides an example of how noise can limit model inversion even with a good forward model. Noise in deconvolution microscopy is controlled by choosing not to try to recover spatial frequencies at which the noise is too high.

To test how well the model inversion process works on real responses, we applied it to measured photoreceptor responses to the variable mean noise stimulus. As described above, noise in the measured responses could cause the estimate to be better at some temporal frequencies than others; this is particularly true at high temporal frequencies that are poorly encoded by the photoreceptors. To minimize this effect while also probing the limits of the estimation approach, we limited the frequency content of the stimulus to temporal frequencies that we could reasonably expect to recover (0-60 Hz for primate cones, 0-30 Hz for mouse cones, 0-15 Hz for both primate and mouse rods) and required that the power spectrum of our estimate matched that of the true stimulus (see Methods). Figure 4A shows stimulus estimates based on measured responses for each photoreceptor type. The estimates closely approximate the actual stimulus, including recovering rapid stimulus fluctuations (insets). For each photoreceptor type, stimulus estimates based on model inversion captured the majority of the stimulus variance (Figure 4B).

Test of model inversion based on measured responses. A. Stimulus (gray in top panel), measured response (black in lower panels) and estimated stimulus (blue in top panels) calculated by using the measured response as input to the inverse model as in Figure 3. Estimates are able to recover both the periodic changes in mean intensity and the more rapid superimposed stimulus modulations (insets). B. Variance explained for stimulus estimates based on the average response across multiple stimulus trials compared to that based on individual responses. Since the model captures only the deterministic part of the response, noise in the individual responses lowers the accuracy of the estimates and causes the points to fall below the unity line. This effect is modest but systematic.

Variance explained, as in Figure 4B, mixes low temporal frequencies for which we might expect inversion to work well with high temporal frequencies for which it is expected to do less well. To explore the accuracy of the inversion across temporal frequencies, we compared the power spectrum of the stimulus with that of the residuals given by the difference between the stimulus and estimate (Figure 4 - Figure Supplement 1). At low temporal frequencies, stimulus power was much greater than that of the residuals, consistent with accurate stimulus estimation at these frequencies. This difference decreased with increasing frequency, and the two power spectra converged near 30 Hz for primate cones, 10 Hz for mouse cones, and 2-3 Hz for both primate and mouse rods. This provides an estimate of the frequency range over which stimulus inversion is possible.

The ability to convert photoreceptor responses to input stimuli is useful in several ways (see Discussion). Our focus in the remainder of the Results is using it to design stimuli that alter the photoreceptor responses in specific ways, such as negating the impact of adaptation and shaping response kinetics.

Light-adaptation clamp examples

Figure 5 illustrates how we can use the model inversion procedure to predictably manipulate photoreceptor responses, in this case using the cone model and (for simplicity) a sinusoidal stimulus. We start with a desired or target response. For the specific applications described below, the aim was to remove nonlinearities in photoreceptor responses and hence we generate the target response from a linear phototransduction model. As for Figure 2 above, the linear model used to generate the target response was obtained by fitting responses of the full phototransduction model to low contrast stimuli (see Methods). The response of the linear model to a sinusoidal stimulus is, as expected, another sinusoid (Figure 5, second panel from left). The response of real cones, and of the full cone model, deviates strongly from a sinusoid (black trace in the far right panel; (Angueyra et al. 2022)).

Adaptive nonlinearities in phototransduction cause the responses to sinusoidal light inputs to deviate from sinusoids. These time-dependent nonlinearities operate sufficiently rapidly to shape each cycle of the response, causing responses to decreases in light intensity to be larger than those to increases and causing the responses to light-to-dark transitions to be slower than those to dark-to-light transitions. To eliminate the impact of these nonlinear response properties, we seek a stimulus that will cause the response of the full model to match the target response - i.e. a stimulus that “clamps” the cone response to the target. The model inversion process identifies such a stimulus directly (red trace in second panel from right). Our primary interest is not in the shape of the stimulus itself but instead in the response that the stimulus produces. The right panel in Figure 5 compares measured cone responses to the original (black) and modified (red) stimuli; responses to the modified stimulus are considerably more sinusoidal than those to the original stimulus.

The approach outlined in Figure 5 is exact in principle, but could fail due either to inadequacies of the model or to cell-to-cell variability in the photoreceptor responses. Hence, a key step is testing whether the calculated stimuli produce the expected responses. Figures 6-8 test this approach quantitatively for several stimulus manipulations.

Light-adaptation clamp procedure. Starting with an initial stimulus (left), we generate a target or desired response (second panel from right). In this case, the target was chosen to be the response of a linear phototransduction model; for a sinusoidal input, a linear model produces a sinusoidal output. We use the (linear) target response as the input to the inverse phototransduction model and identify the stimulus required to elicit that response (red in third panel from left). Substantial stimulus modifications are required for the full model to produce a sinusoidal output. Finally, we confirm that the modified stimulus works as designed in direct recordings, in this case from a primate cone (right panel).

Compensating for increment/decrement asymmetries in responses to sinusoidal stimuli. A. Photoreceptor responses (bottom, black) to original sinusoidal stimuli (top, black) and modified stimuli (red). Thin lines in the bottom panels are best-fitting sinusoids for reference. Original and modified stimuli are shown above the recorded responses. B. Mean-squared-error between measured responses and best fitting sinusoids for modified stimuli plotted against that for original stimuli for each recorded cell. Sinusoidal fits had lower error for the modified stimuli compared to the original stimuli.

Sinusoidal stimuli

We start with the sinusoidal stimuli illustrated in Figure 5. As predicted by the model, responses of both rod and cone photoreceptors to high-contrast sinusoidal light inputs are strongly nonsinusoidal: responses to decreases in light intensity are larger than those to increases, and responses to dark-to-light transitions are faster than those to corresponding light-to-dark transitions (Figure 6A). These nonlinearities in the photoreceptor responses are clear from the substantial deviations between the measured responses and the best-fitting sinusoids (thin lines in Figure 6A). These response asymmetries are important in interpreting responses of downstream visual neurons to similar stimuli. For example, asymmetries in signaling of On and Off RGCs are often attributed to differences in On and Off retinal circuits (e.g. (Stockman, Petrova, and Henning 2014)), and the possibility that they arise at least in part in the photoreceptors themselves is rarely considered.

We measured responses of rod and cone photoreceptors to sinusoidal stimuli and stimuli designed to compensate for adaptive nonlinearities in phototransduction. Responses to the modified stimuli (red traces in Figure 6A) were much closer to sinusoidal than responses to the original sinusoidal stimuli (black traces). Specifically, asymmetries between increment and decrement responses and light-to-dark vs dark-to-light transitions were substantially reduced for the modified stimuli. We quantified how much measured responses deviated from a sinusoid by computing the mean-squared error (MSE) between the responses and the best-fit sinusoid for the original and modified stimuli (Figure 6B). Sinusoidal fits to responses to the modified stimuli had considerably lower MSE than those to the original stimuli in each rod and cone tested.

Neither the sinusoidal stimuli nor the specific cells that contribute to Figure 6 were used in fitting the phototransduction models. Hence Figure 6 tests the ability of the model-based inversion process to generalize across photoreceptors and across stimuli.

Steps and flashes

As a second example of the light-adaptation clamp, we generated stimuli that compensate for the adaptive changes in gain of photoreceptor responses produced by changes in mean light intensity. These gain changes cause responses to a fixed strength flash to become smaller and faster when the mean light intensity increases (black traces in Figure 7A).

Compensating for adaptation produced by changes in mean light level. A. Responses to a brief light flash delivered before and during a step in light intensity. For the original stimuli, flashes delivered before and during the step are identical, and the resulting responses decrease in amplitude ∼2-fold (summarized on x-axis of bottom panels). Red traces show responses to stimuli designed to compensate for the adaptation produced by the change in light intensity (following approach in Figure 5). B. Summary of gain changes (amplitude of response during the step divided by that of response prior to step) for responses to modified (y-axis) and original (x-axis) stimuli.

The control stimulus consisted of identical flashes delivered at two mean light levels. As expected, responses to this stimulus (black in Figure 7A) showed clear effects of adaptation. Specifically, the response to the flash decreased ∼2-fold at the higher mean light level (summarized in Figure 7B). We then passed this original stimulus through a linear model to estimate the photoreceptor response without adaptation. This linear prediction provided the target response in the stimulus identification process illustrated in Figure 5; the resulting modified stimuli are shown in red at the top of Figure 7A.

The red traces in Figure 7A (bottom) show responses to the stimuli designed to compensate for adaptation. Responses to the test flashes had similar amplitudes at the low and high mean light level. This similarity held across cells (the ratio of the response amplitudes is plotted on the y-axes in Figure 7B), demonstrating that the modified stimuli effectively compensated for the reduction in response gain produced by adaptation. None of the cells in Figure 7 went into the model fitting, so this again tests the ability of the models to generalize across photoreceptors.

Altering kinetics

As for adaptation, separating the contributions of phototransduction and post-transduction circuit mechanisms to the kinetics of responses of downstream cells is difficult. Hence we sought to use the model inversion process to predictably alter the kinetics of the photoreceptor responses and determine their impact on the kinetics of downstream responses. For example, compensating for the speeding of photoreceptor responses that occurs with increasing light intensity would isolate the kinetics of the post-transduction circuitry and determine if they change independently from the changes in kinetics of the photoreceptor responses.

Slowing down the kinetics is conceptually straightforward - we slow down the kinetics of the stimulus itself and that will also cause the response to slow down. We apply this to responses to brief flashes. To test if we can do this in a predictable manner, we stretched the time axis of the linear responses to a brief flash by factors of 1.25-4, and used the resulting response as the target for the model inversion as in Figure 5. We omitted mouse cones because these experiments required longer-lasting recordings with stable kinetics, which were difficult to achieve with mouse cones. Figure 8A compares measured (thick traces) and predicted (thin traces) responses to the original flash and to stimuli designed to slow down the photoreceptor response slightly (left) and more substantially (right). The measured responses to the modified stimuli follow the predictions well for both rod and cone photoreceptors (summarized in Figure 8C).

Speeding up the responses is similarly straightforward in principle but more difficult in practice. Stimuli that speed up the response consist of a brief increment followed by a decrement - which together cause all but the initial rising phase of the response to cancel. In practice, this approach is limited because the contrast of the decrement cannot exceed 100%. Nonetheless, speeding up responses by 20-30% is possible (Figure 8B), and again measured responses closely follow predictions (Figure 8C).

Manipulating kinetics of photoreceptor responses. A. Responses of primate cones and mouse rods to brief flashes (black) and stimuli designed to slow down responses slightly (left) and more substantially (right) (red). Thin traces show the target responses used to generate the modified stimuli. B. As in A for stimuli designed to speed up responses. C. Measured change in time-to-peak plotted against predicted change for a variety of manipulations of kinetics as in A and B.

Impact of cone adaptation on ganglion cell responses

Here, we show one example of how the ability to manipulate photoreceptor responses can be applied to reveal adaptational mechanisms at different locations in retinal circuits. Specifically, we compare responses of primate cones and downstream retinal neurons to the step plus flash stimulus used in Figure 7.

Figure 9A shows responses of a primate cone, a horizontal cell, and an On parasol ganglion cell to the original step plus flash stimulus (black) and the modified stimulus that negates the cone adaptation (red). The stimuli used for the three cells illustrated are identical. As shown in Figure 6, the modified stimulus effectively negates adaptation in the cone transduction currents, and the responses to the cone responses to the two flashes are similar. Negating adaptation in the cones also eliminated most or all of the adaptation in downstream responses (summarized in Figure 9B).

Cone and post-cone adaptation. A. Responses of a cone (top), horizontal cell (middle) and On parasol RGC (bottom) to the step and flash protocol for both original and modified stimuli. B. Summary of experiments like those in A, plotting the change in gain for the modified stimuli against that for the original stimuli.

Previous work reached a similar conclusion by quantitatively comparing adaptation measured in cones and RGCs (Dunn, Lankheet, and Rieke 2007). Negating cone adaptation provides an alternative approach to this issue and similar questions about the contributions of photoreceptor and post-photoreceptor adaptation. Figure 9 is a simple example of how the stimulus design approach developed here can be used. We return to other possible applications in the Discussion.

Discussion

Rod and cone photoreceptors play an essential role in the responses of downstream visual neurons and visual perception. Refining this picture to identify the specific contributions of photoreceptors and post-photoreceptor circuitry to the computations that underlie vision has been difficult. Here, we have created and tested a tool that allows predictable manipulations of the photoreceptor responses to reveal their role in downstream signaling. Direct tests indicate that this tool is effective, and that it generalizes well across photoreceptors and stimuli. We envision this being useful to causally test how responses of downstream cells or perception are shaped by specific aspects of the photoreceptor responses such as adaptation.

Applications

“Front end” for encoding models for downstream responses and perception

Two broad classes of encoding models have been used to describe responses of retinal ganglion cells and cells in downstream visual areas. Empirical models take light stimuli as inputs and convert these, through a series of linear and nonlinear elements, to predicted responses (Chichilnisky 2001; Pillow et al. 2008). Time-dependent nonlinearities are particularly hard to capture in such models, and few existing models account for them. Computational neural network (CNN) models similarly take light input and convert it, via the learned CNN weights, to predicted neural responses (McIntosh et al. 2016; Turner et al. 2019). Like empirical models, time-dependent nonlinearities such as adaptation are generally not well described by such models. As a result, these models work best when stimulus parameters such as mean light intensity do not change.

The photoreceptor models described here convert light inputs to photoreceptor responses, capturing time-dependent nonlinearities in this process. Empirical or CNN models could then be used to describe the conversion of photoreceptor signals to downstream neural responses. This architecture - a phototransduction model front end followed by an empirical model - should decrease the demand placed on the empirical model and improve the ability to capture responses to stimuli that strongly engage photoreceptor adaptation.

Models for visual perception could similarly incorporate a photoreceptor front end, and by doing so directly test which aspects of perception can be explained by phototransduction and which are due to downstream processing.

“Back end” to decoding models

Approaches to decode neural responses and estimate stimulus properties could similarly benefit from the photoreceptor models. Current decoding approaches either empirically fit response-stimulus relationships or invert encoding models to compute the likelihood of particular stimuli given the neural response (Bialek et al. 1991; Brackbill et al. 2020; Wu et al. 2022). In either case, decoding in the context of stimuli that strongly engage time-dependent nonlinearities has proven difficult.

Incorporating the inverse photoreceptor model into decoding approaches should improve decoding performance. Specifically, existing decoding approaches could be used to estimate the photoreceptor signal from downstream neural responses (e.g. those of RGCs), and then the inverse phototransduction model would convert the estimated photoreceptor signals to estimated stimuli. This again could decrease the demand placed on the decoding model. For example, under conditions in which much of retinal adaptation is accounted for by adaptation in the photoreceptors, the model used to estimate photoreceptor responses from RGC responses would need to vary little, and the inverse photoreceptor model would account for photoreceptor adaptation. A RGC→photoreceptor model should be simpler to fit and hence perform better than a full RGC→stimulus model.

Manipulations of phototransduction currents

The models that we introduce here provide a new tool to causally test the impact of alterations in photoreceptor responses on downstream responses and perception. For example, asymmetries in how increments and decrements in light intensity are processed have been well studied, including responses in retina, cortex and perception (Lu and Sperling 2012; Bowen, Pokorny, and Smith 1989). Parallel On and Off visual pathways are initiated at the cone output synapse, and asymmetries in downstream responses or perception are often attributed to asymmetries in the On and Off retinal circuits conveying photoreceptor signals to the ganglion cells (e.g. (Stockman, Petrova, and Henning 2014)). The implicit assumption is that photoreceptor inputs to On and Off retinal circuits are symmetrical. While this is likely the case for low contrast stimuli, adaptation in phototransduction means that high contrast stimuli will often not produce symmetric input to those circuits, and On/Off asymmetries may originate at least in part from asymmetric photoreceptor responses to light increments and decrements (Angueyra et al. 2022; Clark et al. 2013; Endeman and Kamermans 2010).

The ability to invert the phototransduction model permits the design of stimuli that minimize such asymmetries in the photoreceptor responses to test how they contribute to On/Off differences in downstream responses (e.g. (Yu et al. 2022)). More generally, the ability to shape photoreceptor responses in predictable ways provides a needed tool to isolate the effects of photoreceptor and post-photoreceptor circuits to shaping responses of downstream neurons and perception.

Complementing genetic approaches

Genetic manipulations provide another approach to alter photoreceptor responses and characterize the impact on responses of downstream visual neurons or behavior. Such approaches have the advantage of associating specific molecular components with alterations in vision - e.g. the role of arrestin (Xu et al. 1997; Burns et al. 2006) and rhodopsin kinase (Zhao, Palczewski, and Ohguro 1995; Wilden, Hall, and Kuhn 1986; Arshavsky 2002) in specific forms of stationary night blindness (Dryja 2000; Zeitz, Robson, and Audo 2014). But interpretation of these genetic manipulations can be limited by compensatory changes.

The approach that we introduce here complements genetic manipulations in at least two ways. First, it would be used in conjunction with genetic manipulations - e.g. to restore normal kinetics in photoreceptors in which the genetic alteration changes response kinetics. Second, the stimulus design approach provides an alternative that provides less detailed mechanistic information, but which allows more specific functional manipulations to be made and does not require genetic access.

Limitations

Model validation

The procedure that we use to invert the phototransduction cascade is only as good as the model upon which it is based. Thus a focus of the experiments presented here was to test a range of manipulations for stimuli and photoreceptors that did not contribute to the model fits. We did this by fitting consensus models to recordings from one set of photoreceptors of a given type, and then testing the stimuli generated by these models on different photoreceptors from different retinas. Discrepancies between the predicted and measured responses to these new stimuli reflect both failures of the model inversion and cell-to-cell variability. Systematic differences between predicted and measured responses were small, and most of the differences appear to reflect variability between cells (e.g. due to cell-to-cell differences in sensitivity).

Focus on phototransduction

Our model describes the relationship between light inputs and phototransduction currents. We restricted our consideration to photocurrents rather than photoreceptor synaptic output because the latter is shaped by several mechanisms (e.g. electrical coupling between photoreceptors (E. A. Schwartz 1976; Copenhagen and Owen 1976; Detwiler and Hodgkin 1979) and horizontal feedback (Baylor, Fuortes, and O’Bryan 1971)) that are less well understood than phototransduction. A similar approach could be extended beyond phototransduction when additional quantitative information is available.

Photopigment bleaching

The model that we used does not consider photopigment bleaching and regeneration via the pigment epithelium. We omitted these aspects of photoreceptor function because the experiments to which we fit and test the model require that we remove the pigment epithelium. We restricted total light exposure in these experiments to minimize bleaching, and this limits the range of light levels over which the model is applicable. This is less of a concern for rods, where rod saturation occurs for light levels at which a small fraction of the rod pigment is bleached, and correspondingly our models cover most or all of the range of rod signals. For cones, the model is limited to light levels < 50,000 R*/s (similar to typical indoor lighting conditions).

Speeding vs slowing responses

Not all manipulations of the photoreceptor responses are equally easily achieved. Speeding responses is particularly difficult because it requires increasing the amplitude of high temporal frequencies, and the ability to do that is limited by the requirement that light intensities not assume negative values. This means that low contrast stimuli can be sped more than high contrast stimuli.

Calibrations

The accuracy of the model predictions also depends critically on accurate light calibrations. We have detailed our procedure in Methods.

Methods

Recordings

We performed electrophysiological recordings from primate (Macaca fascicularis, nemestrina and mulatta, either sex, 2-20 years) and mouse retina (C57/BL6 or sv-129, either sex, 1-12 months) in accordance with the University of Washington Institutional Animal Care and Use Committee. Primate retinas were obtained through the Tissue Distribution Program of the University of Washington Regional Primate Research Center. Primate recordings were all at > 20° eccentricity.

Rod responses were recorded with suction electrodes (Field and Rieke 2002). These recordings were sufficiently stable that we could record responses to all three test stimuli in Figure 1B. Periodic standardized flashes tested for changes in kinetics or amplitude of the light response, and recordings were terminated if such changes were apparent. Most recordings were stable for at least 10-15 min before changes in kinetics were apparent.

Cones were recorded with whole-cell patch clamp techniques in slices (mouse; (Ingram, Sampath, and Fain 2019)) and whole-mount preparations (primate; (Angueyra and Rieke 2013)). Cone responses in these recordings run down quickly due to intracellular dialysis of the cell; hence responses to only one of the stimuli in Figure 1B were recorded from each cone. All data reported here were collected within 2-3 min of patch rupture and the onset of intracellular dialysis,

Light calibrations

Optical power was measured at the preparation with a calibrated power meter (Graseby). Power readings were converted to isomerizations per second (R*/sec) using the photoreceptor spectral sensitivities, the LED spectral outputs, and collecting areas of rods and cones (1 and 0.37 µm2 for primates, 0.5 and 0.2 µm2 for mouse). Light stimuli from the LEDs were focused on the preparation via the microscope condenser or objective. Stimuli uniformly illuminated a 600 µm diameter spot. LED outputs were carefully checked for linearity. For rod recordings, we used an LED with peak output at 470 nm, near the peak of the rod spectral sensitivity. For cone recordings, we used an LED with peak output at 405 nm, which produced near-equal (within 10%) activation of L and M cones in primate and S and M cones in mouse.

Forward model

The phototransduction cascade was modeled with the set of differential equations illustrated in Figure 1. This follows previous modeling work in both rods and cones (Pugh and Lamb 1993; Nikonov, Lamb, and Pugh 2000; Rieke and Baylor 1998; Younger, McCarthy, and Owen 1996). Our model has 11 parameters. In the first step of the model, the light stimulus (Stim) activates opsin molecules, converting inactive opsin (R) to active opsin (R*). Active opsin decays with a rate constant σ (Figure 1A, Equation 1). Next, phosphodiesterase (PDE) molecules are activated by active opsin molecules via the G-protein transducin; the resulting PDE activity (P) decays with a rate constant Ф (Figure 1A, Equation 2). We assume that the delay caused by transducin activation is negligible and hence omit this step for simplicity. The opsin decay rate σ and the PDE decay rate Ф were interchangeable, and the model output depended only on the smaller of these (i.e. the slower process). Hence σ and Ф were constrained to be equal.

The concentration of cGMP in the outer segment (G) is determined by the balance of cGMP hydrolysis mediated by PDE and synthesis (S) through guanylate cyclase (GC) (Figure 1A, Equation 3). For physiological conditions, the outer segment current (I) through cGMP-gated channels depends on a power of the cGMP concentration (Figure 1A, Equation 4; (Rieke and Baylor 1996)).

Calcium enters the outer segment through open cGMP channels and is removed by an exchange protein, with a rate constant p (Figure 1A, Equation 5). The calcium concentration (C) regulates the rate of cGMP synthesis (S); this dependence is modeled as a Hill curve (Figure 1A, Equation 6).

Two model parameters were fixed by steady state conditions and prior measurements. First, the constant q that relates current and changes in calcium can be expressed in terms of the dark calcium concentration, dark current, and rate constant β for calcium extrusion. Second, the maximal cyclase rate (Smax) can be written in terms of φ, η, KGC, h, and the dark calcium and cGMP concentrations. We fixed the constants k and n that determine the relation between cGMP and current (e.g. (Rieke and Baylor 1996)). Using these constants, the cGMP concentration in darkness was estimated from the measured dark current using Eq. 3 (e.g. the response to a saturating light flash) for each recorded cell. We also fixed m for both rods and cones and γ for rods based on previous measurements, and the dark calcium concentration (for rods and cones) because model outputs were insensitive to it.

This left a total of 4 free parameters for rod models (γ, η, σ, KGC), and 5 for cone models (γ η, β, σ, KGC). These parameters were determined by minimizing the mean-squared difference between the measured responses and model predictions while the remaining model parameters were held fixed. Some combinations of model parameters can trade for each other, resulting in similar model performance (e.g. the parameters controlling the calcium concentration and its action on the rate of cGMP synthesis). Our goal here was to identify a model that captures photoreceptor responses across a range of stimuli, rather than a model in which the fit parameters were predictions of the underlying biochemical parameters.

Parameters of consensus models were determined by simultaneously fitting measured responses across all recorded cells of a given type. Y was allowed to vary between cells to account for differences in sensitivity, while the remaining parameters were constrained to be identical. These parameters are given in the associated Matlab code.

Inverse model

The differential equations that comprise our model consist of several time-independent nonlinearities (Eqs. 4 and 6), several linear differential equations (Eqs. 1, 2, 5), and one nonlinear differential equation (Eq. 3). The time-independent nonlinearities can be inverted directly as a look-up table - e.g. the cGMP depends directly on the current as G = (I/k)1/n. The linear differential equations can be solved directly by deconvolution - e.g. the Fourier transform of the calcium concentration from Eq. 5 is obtained directly from the Fourier transform of the measured current (I(f)) as

where f is the temporal frequency. The nonlinear differential equation (Eq. 3) can be reexpressed as an equation for the PDE activity P in terms of the cGMP concentration G and its derivative dG/dt as

This equation can be solved given G (obtained from the current via Eq. 4), and S (obtained from the calcium concentration C(t) and Eq. 6). These steps in the inversion process are illustrated in Figure 3.

This process results in an exact mapping between responses and stimuli - i.e. every response I(t) has a corresponding unique stimulus Stim(t). In practice, noise in the measured responses can corrupt the estimated stimuli. This is a general problem in deconvolution of noisy data – e.g. in microscopy. As indicated in the text, we controlled this noise when necessary by imposing that the power spectrum of our estimated stimulus be equal to the power spectrum of the true stimulus. This constraint was imposed by generating the estimated stimulus as described above, and then reweighting the power spectrum of the estimated stimulus to match the power spectrum of the true stimulus. This constraint was used in generating the estimated stimuli in Figure 4.

Predictably altering photoreceptor responses

Model inversion as described above allowed us to identify stimuli that would produce a desired phototransduction current (which we refer to as a target response). For the applications in Figures 5-9 we generated this response using a linear model of the transduction cascade. We passed an initial stimulus (e.g. a sinusoid in Figures 5 and 6) through the linear model to generate the target response. The model inversion process for the full (i.e. nonlinear) phototransduction cascade model then determined the stimulus which, when delivered to the full model, would produce the linear target response. In particular, this allowed us to identify stimuli that negated the impact of adaptation on the photoreceptor responses.

The linear models used to generate target responses were obtained by fitting a parameterized linear filter to responses of the full model to low contrast Gaussian noise. The filter had the form

where a is a scaling factor and tDand tR are time constants. This model provides good fits to measured responses of rods and cones (Angueyra and Rieke 2013). Photoreceptor responses to low contrast stimuli depend linearly on the stimulus (Hass et al. 2015), and hence this approach ensured that the linear model matched the full model for such stimuli. Linear models were fit at a specific mean light intensity matched to that of the stimulus for the full model. Hence the target response reflected how a linear photoreceptor model, at a given mean light level, would respond to the initial stimulus.

To manipulate the response kinetics, the time constants of the linear model were scaled and the process above repeated to generate appropriate stimuli.

Acknowledgements

We thank Shellee Cunnington for excellent technical support. We are grateful for assistance from the WaNPRC staff, especially Chris English, for access to primate tissue. Shane Boniec, Greg Horwitz and Saad Idrees provided invaluable feedback on a draft of the manuscript. This work was supported by NIH grant EY028542 (FR) and the Air Force Office of Scientific Research under award number FA9550-21-1-0230 (special thanks to the AFOSR’s Cognitive & Computational Neuroscience Program.

Tests of the ability of the model to generalize across stimuli and across cells. A. Generalization across stimuli. Responses of primate and mouse rods were fit to the variable mean noise stimulus only and model performance was evaluated for responses to the flash family and flashes and steps, using the fraction of variance explained to evaluate the fits (see Figure 1). This generalization performance to stimuli not included in the fit (y-axis) is compared to performance when the model is fit to all stimuli (x-axis). B. Generalization across cells. Models were fit to the variable mean noise stimulus. The performance of each cell was evaluated for a model in which all cells were fit (x-axis), or in which the test cell was excluded from the fit (y-axis).

Errors in stimulus estimation. Power spectra of stimulus and residual (stimulus - estimate) for example primate and mouse rods and cones. The stimulus and residual power spectra diverge strongly at low frequencies where the estimates are close to the actual stimulus, and converge at higher frequencies when the estimate becomes poor. The convergence point depends on photoreceptor type, as expected for the different kinetics of rod and cone responses.