Systemic kappa opioid receptor antagonism accelerates reinforcement learning via augmentation of novelty processing in male mice

Farahbakhsh, Zahra Z.; Song, Keaton; Branthwaite, Hannah E.; Erickson, Kirsty R.; Mukerjee, Snigdha; Nolan, Suzanne O.; Siciliano, Cody A.

doi:10.1038/s41386-023-01547-x

Download PDF

Article
Open access
Published: 17 February 2023

Systemic kappa opioid receptor antagonism accelerates reinforcement learning via augmentation of novelty processing in male mice

Neuropsychopharmacology volume 48, pages 857–868 (2023)Cite this article

4082 Accesses
2 Citations
32 Altmetric
Metrics details

Subjects

Abstract

Selective inhibition of kappa opioid receptors (KORs) is highly anticipated as a pharmacotherapeutic intervention for substance use disorders and depression. The accepted explanation for KOR antagonist-induced amelioration of aberrant behaviors posits that KORs globally function as a negative valence system; antagonism thereby blunts the behavioral influence of negative internal states such as anhedonia and negative affect. While effects of systemic KOR manipulations have been widely reproduced, explicit evaluation of negative valence as an explanatory construct is lacking. Here, we tested a series of falsifiable hypotheses generated a priori based on the negative valence model by pairing reinforcement learning tasks with systemic pharmacological KOR blockade in male C57BL/6J mice. The negative valence model failed to predict multiple experimental outcomes: KOR blockade accelerated contingency learning during both positive and negative reinforcement without altering innate responses to appetitive or aversive stimuli. We next proposed novelty processing, which influences learning independent of valence, as an alternative explanatory construct. Hypotheses based on novelty processing predicted subsequent observations: KOR blockade increased exploration of a novel, but not habituated, environment and augmented the reinforcing efficacy of novel visual stimuli in a sensory reinforcement task. Together, these results revise and extend long-standing theories of KOR system function.

Remifentanil self-administration in mice promotes sex-specific prefrontal cortex dysfunction underlying deficits in cognitive flexibility

Article 19 May 2021

Oxytocin and orexin systems bidirectionally regulate the ability of opioid cues to bias reward seeking

Article Open access 04 October 2022

Kappa-opioid receptor stimulation in the nucleus accumbens shell and ethanol drinking: Differential effects by rostro-caudal location and level of drinking

Article Open access 25 March 2024

Introduction

Central and peripheral kappa opioid receptors (KORs) are targets of a wide variety of pharmacotherapeutics. Most currently approved compounds are multi-target drugs, with varying pharmacodynamic actions and selectivity for KORs. These drugs are used for the treatment of a diverse array of disorders including, but not limited to, nalfurafine for pruritus [1], alfentanil for nociceptive [2] and levorphanol for neuropathic pain [3], naltrexone for substance use disorders [4], and eluxadoline for irritable bowel syndrome [5, 6]. Development of selective KOR compounds has been a longstanding goal in medicinal chemistry and recent breakthroughs have produced clinically viable, highly selective KOR agonists and antagonists, several of which are in various stages of clinical evaluation (National Clinical Trial number: NCT02800928, NCT02218736, NCT01913535, NCT02641028, NCT02475447) [1, 7, 8]. In particular, clinically viable selective KOR antagonists have been highly anticipated due to wide consensus in the preclinical literature that systemic blockade of KORs holds great promise for the treatment of several neuropsychiatric disorders including depression, substance use disorders, and anxiety [9,10,11,12].

In preclinical models, KOR activity has been causally linked to the underlying behavioral symptomatology of multiple neuropsychiatric diseases. For example, systemic administration of KOR antagonists in model species reliably reverses escalated drug and alcohol consumption resulting from chronic exposure [13,14,15,16,17], ameliorates depression-like phenotypes [18, 19], and can prevent the behavioral consequences of chronic stress [20,21,22,23]. It is thought that KOR modulation of addiction-, depression-, and anxiety-related behaviors stems from its endogenous function as a negative valence system. Indeed, the prevailing and widely accepted model of KOR’s role in neuropsychiatric disease posits that acute activation of KORs produces dysphoria and that experience-dependent upregulation of this system drives aberrant behavioral states such as anhedonia in depression and negative affect during periods of drug abstinence in addiction [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41]. This model of the KOR’s primary role in neurobehavioral processes centers on negative valence processing and motivated behaviors driven by aversive internal states, which are both critical but latent constructs (i.e., cannot be directly observed). Though the latent nature of the variables involved does not allow for a straightforward operationalized definition, the theory has been semantically formalized by multiple literatures. The theory states that KOR activity is involved in negative valence domains (e.g., acute threat, potential threat, sustained threat, loss, and frustrative nonreward) and is critical for the development of subsequent behavioral responses to aversive stimuli, such as negative reinforcement [12, 22, 25, 42, 43]. The role of KORs in these constructs are typically tested in behavioral assays such as learned helplessness [44], forced swim stress [18], stress-induced reinstatement [45, 46], and intracranial self-stimulation [47].

Despite robust and widely reproduced findings that KORs are directly involved in symptomologies associated with substance use and mood disorders [13, 15, 17, 18, 48, 49], few studies have systematically evaluated a priori predictions from the negative valence model of KOR function, which represents a critical missing link in determining the veracity and utility of this framework [50,51,52,53]. Thus, we sought to directly evaluate predictions of the negative valence model of KOR function in mice during reinforcement learning – a quantitative framework recognized across disciplines for its utility in investigating fundamental processes relevant to basic functions and disease states [54,55,56,57]. We found that the negative valence model was insufficient to explain valence-independent experimental outcomes of KOR antagonism and thus proposed novelty processing – a construct which is critical for the identification and learning of stimuli with reinforcement-predictive value – as an a posteriori explanation. Generating axiomatic hypotheses based on novelty processing as a latent construct underlying KOR modulation of behavior predicted multiple experimental outcomes whereby systemic KOR antagonism augmented measures of novelty exploration and novelty-driven intrinsic motivation. Together, these findings call for re-evaluation of long-standing theories regarding KOR system function and delineate KOR control of a conserved neurobehavioral domain broadly implicated in motivated behaviors.

Methods and materials

Subjects

Adult, male C57BL/6J mice were used for all experiments (Jackson Laboratory, Bar Harbor, ME). Mice were group-housed (5 per cage) in a reverse 12-hour light-dark cycle room. Water was available ad libitum and 2.8–3 g of chow per mouse (Labdiet 5L0D) was provided daily, sufficient to maintain mice at healthy adult weights throughout the course of experiments (20–30 g bodyweight). All experiments involving the use of mice were in accordance with NIH guidelines and approved by the Vanderbilt University Institutional Animal Care and Use Committee.

Drugs

The selective and potent KOR antagonist norbinaltorphimine (NorBNI) [58] was graciously provided by the NIDA Drug Supply Program. Mice were treated with NorBNI (10 mg/kg) or saline in a volume of 10 mL/kg intraperitoneally (i.p.) 24 h prior to the start of behavioral testing. NorBNI is long-acting, and extensive pharmacodynamic and pharmacokinetic analyses show that it remains detectable in the central nervous system for at least 21 days following a single injection, concomitant with inactivation of KORs over this time period [59,60,61].

Positive reinforcement

During positive reinforcement experiments, operant boxes were equipped with a nose-poke port on either side, a small cue light above each port, a liquid delivery port in the center, and a house light on the opposite wall. One nose-poke port was active (counterbalanced across mice) on which responses in the presence of a discriminative stimulus (S^D) resulted in the delivery of sucrose. The other nose-poke inactive with no consequent stimulus to a response. On a variable-time schedule ranging from 20–40 s (average 30 s), a cue light above the active side was illuminated for up to 30 s and served as an S^D. A response on the active side during the S^D period was deemed a ‘correct response’ and resulted in the delivery of 10 µL of a 10% sucrose solution (w/v), the illumination of a cue light above the liquid delivery port, and the termination of the S^D. During the interval between S^D presentations, referred to as the S^Δ period (i.e., a condition negatively correlated with reinforcement) [62,63,64,65,66,67], a response on the active side resulted in a 30 s timeout period signaled by the illumination of a house light. Responses on either port during the timeout period had no consequence and the variable-time scheduled was discontinued such that no other stimuli were presented. After 30 s had elapsed, the house light terminated, signaling the beginning of another S^Δ period and resumption of variable-time schedule.

Negative reinforcement

During negative reinforcement experiments, the operant chamber and general procedures were the same as described above, but behavior was reinforced by electrification of the metal grid floor (footshock). Counterbalanced between mice, one nose-poke port was an active port on which responses in the presence of the S^D resulted in the avoidance or escape from footshocks, and the other was inactive with no consequent stimulus to a response. On a variable-time schedule ranging from 20–40 s (average 30 s), a cue light above the active side was illuminated (S^D). If no response was made within 30 s of S^D onset, a series of 20 mild (0.15 mA) footshocks would begin. Within the series of footshocks, each footshock was 0.5 s in duration and there was a period of 15 s between the offset of one footshock and the onset of the next. The S^D remained illuminated during the series of footshocks and was extinguished after all 20 shocks were delivered or after a correct response was made. A response on the active side during the first 30 s of the S^D resulted in the complete avoidance of footshocks and a 1-minute extension of the S^Δ period. A response on the active side after the footshock series began would result in the escape from the remainder of the series concomitant with beginning of the next S^Δ period. Both responses on the active side which resulted in the avoidance and escape of the footshocks were deemed ‘correct responses.’ Unlike in the positive reinforcement task (where a response during the S^Δ period resulted in a timeout), during negative reinforcement a response on the active side during the S^Δ period had no consequence. Consistent with the positive reinforcement task, any response on the inactive side had no consequence.

Crossover treatment

After meeting acquisition criteria (positive reinforcement, see supplemental methods) or after the 15th session (negative reinforcement), a subset of mice received the opposite treatment that they were given at the beginning of reinforcement learning at least 30 min after removal from the operant box. Thus, saline pretreated mice were given NorBNI post-acquisition (saline→NorBNI) and vice versa (NorBNI→saline). Subjects were tested over three sessions following the crossover treatment under identical experimental conditions as described above. For each mouse that received the crossover treatment, post-treatment performance was calculated by normalizing to pretreatment values ([average 3 days post]/[average 3 days pre]x100).

Novelty response

At least 24 h after i.p. treatment with NorBNI (10 mg/kg) or saline, mice performed the first (Day 1, novel) of two open field tests. On day 1, each mouse was placed in the center of the arena and allowed to explore for the duration of the 1 h session, after which they were immediately removed from the apparatus. The following day (Day 2, familiar), mice were re-tested using the same procedure. Distance traveled (cm) per 5 minute bin throughout both sessions was calculated with Noldus Ethovision video-tracking.

Sensory reinforcement

Fixed-ratio 1

A seperate cohort of mice was trained on a fixed-ratio 1 (FR 1) task in a box equipped with one response lever (side counterbalanced) and three stimulus lights in the center for 5 1-hour sessions during which a response on the lever would result in randomized light flashes and a response during the reinforcer period had no consequence. Flash duration (4, 6, 8, and 10 s) and frequency (0.5, 1, 2.5, 5, 10, 12.5, and 25 Hz) were randomized for each reinforcer earned. For each flash of light, the cue light (top, middle, bottom) which was illuminated was also randomized. At least 30 min after the fifth session, mice were treated with saline or NorBNI (10 mg/kg). The next day, mice ran the same FR 1 task to determine if treatment altered responding for the novel stimuli.

Behavioral economics

The behavioral economics procedure was designed such that mice had 10 min to respond for the novel stimuli at each price. For the first 10 min of the session the price was 1 lever press (FR 1) and the mouse was only constrained by the number of reinforcers that would fit in the time window. During the following 10 min the price was 3 lever presses (FR 3) for a reinforcer and so on with ratios FR 5, 10, 20, 30, and 60. A demand curve was fit using the equation \(\log Q = \log Q_0 + k\left( {e^{ - a \times \left( {Q_0 \times C} \right)} - 1} \right)\) to derive Q₀, standardized P_max, and O_max, as previously described [68]. Q₀ is the consumption as price approaches zero, standardized P_max is the first unit price point at which the first derivative point slope of the function equals −1 multiplied by Q₀ to standardize, and O_max is the number of responses at P_max.

Results

Systemic KOR antagonism accelerated discriminated operant responding during positive reinforcement learning

Given the prevalence of valence processing frameworks for interpreting KOR control of motivated behaviors, we first sought to examine how modulation of this system altered learning reinforced by positive and negative operant contingencies where differentially valenced stimuli can be evaluated under analogous experimental conditions [69]. Based on literature positing that KOR activation mediates responses to aversive stimuli and thus drives negative reinforcement, we hypothesized that KOR antagonism would selectively impair negative reinforcement learning with minimal impact on positive reinforcement. To test this hypothesis, mice were given a single i.p. injection of NorBNI (10 mg/kg) or saline 24 h prior to the first behavioral session (Fig. 1A).

**Fig. 1: Systemic KOR antagonism increased rate of positive reinforcement learning.**

Positive reinforcement learning was tested in an operant box in daily, one-hour sessions (Fig. 1B). In contrast to our hypothesis, we found that a single injection of NorBNI increased the rate of discriminated positive reinforcement learning across multiple measures of performance (Fig. 1C–F). Analysis of temporal patterns of responding over sessions revealed that NorBNI treatment decreased the number of S^D presentations necessary to reach half-maximal probability of emitting a correct response during the stimulus presentation (Fig. 1C), indicative of augmented acquisition of the discriminated operant contingency. This was not due to increased responding in general as the number of responses on the inactive port decreased more rapidly in mice treated with NorBNI compared to saline controls (Fig. 1D). Congruently, NorBNI treatment robustly accelerated discrimination between the active and inactive nose-poke ports as assessed by a side discrimination index (Fig. 1E). KOR antagonism also increased the rate at which mice learned to optimize response latency following S^D onset (Fig. 1F). This surprising effect of KOR antagonism on learning rate under a positive reinforcement contingency was not due to weight differences between groups nor a difference in the ability of mice to eventually acquire the task (Supplementary Fig. 1A, B). Together, these findings suggest that NorBNI augments learning even when behavior is reinforced under a positive contingency by a reinforcer with positive valence.

KOR antagonism selectively increased learning rate without impacting maximal performance

Despite its influence on learning rate, KOR blockade showed no effect on maximal performance across all the metrics examined (Supplementary Fig. 1C–F), suggesting that the impact of KOR blockade is specific to the acquisition of learned behaviors, as opposed to behavioral expression of previously learned knowledge. To directly evaluate this hypothesis, a subset of subjects was tested using a crossover design (Fig. 2A). Over the 4 days at or above acquisition criteria, there was no difference between groups in percent correct or average deliveries, confirming that prior to the crossover treatment both groups plateaued at the same level of task performance (Fig. 2B, C). NorBNI administered after task acquisition had no effect on percent correct responses or reinforcers earned per session, demonstrating that KOR blockade does not impact performance of the task after learning has occurred (Supplementary Fig. 2A, B, Fig. 2D, E). Thus, systemic antagonism of KORs accelerates positive reinforcement learning but has no effect on maximal performance or on a previously learned contingency.

**Fig. 2: Systemic KOR blockade did not alter performance of a previously learned contingency during positive reinforcement.**

KOR modulation of learning rate was not due to altered consummatory response

From the results thus far, it is not clear whether NorBNI acts to modulate global variables affecting learning or, alternatively, alters the motivational value or preference for sucrose to produce a reinforcer-specific effect. To evaluate this possibility, we next analyzed sucrose consumption across positive reinforcement sessions. There were no differences between groups in average bout duration, lick bouts per sucrose delivery, or licks per bout at any point throughout the task (Supplementary Fig. 3A–F). This suggests that NorBNI had no effect on the motoric action of licking or consummatory response for the 10% sucrose solution used as a reinforcer during operant learning.

To test the impact of KOR blockade on fluid consumption and sucrose preference independent of reinforcement learning, we next conducted an open access two-bottle choice experiment. Within-subject, pre-post comparison revealed that NorBNI had no effect on overall licking or microstructure patterns for 1% sucrose (Supplementary Fig. 4A–C). NorBNI had no effect on consummatory behavior across the dose-response curve (Supplementary Fig. 4D–F). There was also no difference in the number of licks or the lick microstructure for water after treatment with either saline or NorBNI (Supplementary Fig. 5A–C). Together, these data show that KOR antagonism does not alter innate responses to sucrose, a stimulus with positive valence, and indicate that augmented rate of positive reinforcement learning is more likely to be driven by modulation of global constructs underlying acquisition of learned behaviors.

Systemic KOR antagonism increased reinforcement learning rate independent of reinforcer valence

We next sought to directly evaluate the central prediction of the negative valence model: that KOR blockade disrupts processing of aversive stimuli thereby attenuating motivated behaviors driven by negative reinforcement contingencies. Mice were treated with either NorBNI or saline prior to the first negative reinforcement session (Fig. 3A). Over sessions, subjects increased responding on the active side without changing responding on the inactive side and received fewer shocks, demonstrating that footshocks functioned as a negative reinforcer under these conditions (Fig. 3B). In contrast to canonical theories of KOR function, antagonism increased the rate of negative reinforcement learning across measures of performance, mirroring KOR modulation of positive reinforcement learning (Fig. 3C–E). KOR blockade accelerated the decay rate over session time for shocks delivered (Fig. 3C), probability of response omission during the S^D period (Fig. 3D), and latency to respond following S^D onset (Fig. 3E). Similar to effects on positive reinforcement learning, KOR antagonism did not impact performance during negative reinforcement when treatment was given after initial learning had occurred (Supplemental Fig. 6A–C, Fig. 3F–H).

**Fig. 3: Systemic KOR blockade increased the rate of negative reinforcement learning.**

KOR antagonism did not alter unconditioned responses to footshock

To investigate whether NorBNI treatment alters the unconditioned response to footshock, mice without any prior experimental experience received non-contingent, unsignaled footshocks in an operant chamber. Analysis of locomotion aligned to footshock onset revealed a time-locked, intensity-dependent increase in velocity in response to footshock, as expected (Supplementary Fig. 7A). However, NorBNI treatment did not alter responsiveness to footshock over any of the intensities tested (Supplementary Fig. 7B, C). This demonstrates that KOR blockade does not alter unconditioned responses to footshock, which is incongruent with a central tenant of the negative valence model and furthers supports the hypothesis that KOR modulation of learning rate is independent of reinforcer valence and modality.

KOR antagonism selectively augmented novelty exploration independent of habituation rate, general ambulatory activity, and neophobic avoidance behaviors

Though the results are incongruent with the negative valence model, the experiments above do not provide an alternative explanation for KOR control of learning. Thus, we next aimed to identify a construct congruent with our results which could be used to generate falsifiable axiomatic hypotheses. Human, animal, and in silico studies have reached broad consensus that novelty processing is a critical construct which influences reinforcement learning independent of stimulus valence and primary motivational drives maintaining response-reinforcer associations [70,71,72,73,74]. This factor is particularly relevant for acquisition of stimulus-response-reinforcer contingencies where behavior comes under the control of a neutral, antecedent stimulus due to its function as an S^D rather than as a primary reinforcer [75,76,77]. Increased novelty processing augments learning rate by allocating increased attention towards, and/or assigning intrinsic value to, novel choices and stimuli, which has pronounced effects on learning to associate neutral cues with availability of primary reinforcers without impacting learning curve asymptote [70, 78,79,80]. In sum, the impact of increased novelty exploration on learning is congruent with the effects of KOR blockade observed in the experiments above. Therefore, if novelty exploration is modulated by KOR antagonism it may provide a more holistic explanation for its impact on motivated behaviors.

To test the hypothesis that systemic KOR blockade augments responsiveness to novelty, we used a well-validated measure of novelty exploration: locomotor response to a novel environment and habituation of exploratory behavior over experience [81,82,83,84]. Absolute distance traveled per five minutes followed a predictable time-course whereby activity was pronounced early in the session, eventually reaching a peak after which movement steadily decreased over the remainder of the 60 min (Fig. 4A). Normalizing activity to the first five minutes of the session to account for variance unrelated to exploratory drive allows clear assessment of the novelty-driven exploration phase, eventually followed by a steady decrease in locomotion over time dictated by habituation rate (Fig. 4B). Systemic KOR blockade by NorBNI augmented the degree to which activity increased above baseline but did not alter the rate at which locomotion decreased over the session (Fig. 4C, D). This resulted in a considerably prolonged duration of novel environment-induced increases in activity whereby NorBNI treated mice did not return to baseline activity until the end of the 60-minute session as opposed to 15 min or less in controls.

**Fig. 4: KOR antagonism increased novelty exploration.**

In contrast to when the environment was novel, there was no group differences in the degree to which activity was increased above baseline initially or the rate at which locomotor activity habituated over time during the second session when the environment was familiar (Fig. 4E–H), confirming that differences observed in the first session are specific to a novel environment, and that KOR blockade does not appear to influence spontaneous locomotor activity. Further, increased novelty-induced exploratory behavior was not associated with reciprocal changes in neophobic processes, as additional analysis revealed that novelty-dependent avoidance of the center of the open field area did not differ between groups during either session (Supplemental Fig. 8A–D).

KOR blockade augmented essential reinforcing value of novel stimuli

To directly assess novelty-driven reinforcement without habituation of responding, we utilized a sensory reinforcement task where responding is reinforced by identical visual stimuli throughout but longitudinal behavior can be maintained by randomizing the pattern and frequency of each stimulus presentation (Fig. 5A, Supplementary Video 1) [85,86,87]. Randomized illumination of the cue lights presented on a FR 1 schedule maintained robust responding which increased over sessions, demonstrating that the stimulus reliably functioned as a reinforcer under these conditions and is insensitive to habituation (Fig. 5B).

**Fig. 5: Systemic KOR blockade increased the intrinsic motivational value of novel stimuli.**

Treatment with NorBNI increased the number of novel reinforcers earned under a FR 1 schedule (Fig. 5C). Though responding under an FR 1 contingency demonstrates that novel sensory stimuli are more reinforcing with KOR blockade, the minimal effort required to make one response makes continuous reinforcement insufficient to measure motivation for presentation of the stimulus. To better assess novelty-driven intrinsic motivation, we used a within-session threshold procedure to quantify demand elasticity, or the degree to which consumption of a commodity changes as a function of price – a widely accepted measurement of essential motivational properties of reinforcers [88, 89]. The number of responses made at each price, the number of ratios completed (consumption), and the response records were recorded (Fig. 5D-F). A demand curve was fit to extract behavioral economic parameters for each subject (Fig. 5G). Q₀ is the subject’s preferred level of commodity consumption, or the amount the subject would consume at a minimally constraining price. Standardized P_max represents the amount consumed at the price at which demand becomes elastic and the subject stops responding sufficiently to maintain the desired level of consumption, and O_max is the number of responses made at P_max. Mice treated with NorBNI showed more motivation for novel stimuli, with an increase in Q₀, standardized P_max, and O_max (Fig. 5H–J).

To ensure that augmented motivation was specific to responding for novel stimuli rather than any sensory stimuli or responding per se, a separate cohort was tested using a continuous, 30 s cue light illumination as a reinforcer which did not vary across presentations (Supplemental Fig. 9A). There was no difference in responding for this stimulus between saline and NorBNI mice (Supplemental Fig. 9B) suggesting that the effect of NorBNI on intrinsic motivation is specifically novelty-driven. Together, these experiments demonstrate that KOR blockade increases the intrinsic motivational value of novel stimuli and provides further evidence supporting the utility of novelty processing as an explanatory construct underlying KOR modulation of motivated behaviors.

Discussion

Here, we assessed predictions of the negative valence model of KOR function starting with the hypothesis that systemic inhibition of KOR activity would not alter positive reinforcement but would decrease the rate of behaviors reinforced under a negative contingency. We measured the effect of KOR antagonism on positive and negative reinforcement learning and found that, contrary to our hypothesis, KOR blockade accelerated the rate of learning both under a positive and negative contingency, regardless of reinforcer valence. Augmented performance was selective to learning/task acquisition as previously learned behaviors and maximal performance were unaffected. Further, KOR antagonism did not alter innate responses to sucrose or footshock, as would be expected if valence processing were modulated. Together, our results demonstrate that a negative valence framework for the KOR system does not accurately predict effects of KOR modulation on basic behaviors.

Importantly, while our findings are incongruent with standing theories, they are not in conflict with the empirical results in the literature. Studies demonstrating beneficial effects of KOR antagonists that have been interpreted within the negative valence framework have typically examined single endpoints (e.g., reduced drug and alcohol intake under free-access conditions, reduced immobility during forced swim test). Drawing causal inferences with any latent construct necessitates intersectional analysis of multiple observable variables and, as a corollary, any observable effect taken in isolation can be explained by several underlying latent constructs [90,91,92]. As such, the latent constructs responsible for the pharmacotherapeutic actions of KOR antagonists remains debatable. Though the putative therapeutic effects of KOR antagonism have been widely reproduced within specific model paradigms, they often do not generalize across experimental setting and disparate conclusions can be found throughout the literature with studies demonstrating no effect of KOR antagonism [93, 94], evidence of KOR agonists both increasing and decreasing substance use [95,96,97,98,99,100,101,102,103], and results showing anxiolytic effects of KOR agonism [104, 105]. Furthermore, there are many instances of investigations linking KOR activation in specific circuits and brain regions with functions outside of negative valence processing. For example, intra-striatal microinjections of KOR agonists can increase hedonic response to sucrose, induce conditioned place preference, and decrease anxiety-like behaviors depending on the subregion targeted [106, 107]. Similarly, optogenetic activation of subsets of dynorphin releasing neurons can drive opposing effects on place preference behavior depending on anatomical location within the ventral striatum [108]. Intriguingly, our results are highly congruent with recent clinical investigations demonstrating that KOR antagonism in mood disorder patients augments behavioral flexibility and learning rate without affecting hedonic responses to primary rewards [109, 110].

Although multiple lines of evidence have demonstrated complexity in the role for the KOR system beyond mediating only aversion and dysphoria, as has long been accepted, previous findings have typically been interpreted as circuit-specific effects of KOR activation rather than as incongruent with the global theory of the system. Having directly evaluated the negative valence model, we next sought to identify a construct that could explain our results and potentially provide better predictive validity moving forward. We found that systemic KOR blockade augments one of the strongest innate drivers of behavior – novelty processing. The influence of KORs on novelty processing has potentially wide-reaching implications; indeed, attraction to the unknown is a prerequisite for higher-order knowledge and is thought to influence virtually all aspects of human and animal behavior [111,112,113,114], including learning [78, 115, 116], behavioral flexibility [117,118,119], and even pain processing [120,121,122,123,124]. Regarding reinforcement learning specifically, the ability of an organism to recognize and respond to novelty is critical to its ability to adapt to the environment and the tendency for novelty to stimulate exploratory behaviors guarantees diverse experiences required for learning complex contingencies [85, 125, 126]. As such, novelty processing plays a key role in operant reinforcement task acquisition and highly influences the rate of learning through a variety of mechanisms [78, 127]. For example, due to the stochasticity of initial interactions with the operandum, as they have no known value to the subject, increased exploration of a novel environment can augment learning simply by increasing the probability of triggering the operant contingency. More importantly, heightened intrinsic reinforcing value associated with novelty or increased allocation of attention towards novel stimuli can increase the salience and the likelihood of long-term encoding of initial action-outcome pairings [128, 129]. Thus, increased response to novel environments and stimuli augments learning rate by modulating attention, motivation, and memory formation during operant tasks [127].

Future work exploring how KOR regulation of novelty processing modulates additional forms of learning and behavioral flexibility through reversal, extinction, or set-shifting tasks (for example) will be critical to understanding the complex way in which this system is implicated in behavior. Though the effect of KOR blockade on novelty processing suggests a role for endogenous signaling in this process, it is worth noting that this does not exclude the possibility that exogenous activation of the system modulates other behavioral processes. It is also important to consider that this work, and that which served as the foundation of the negative valence model, were conducted with male mice and there is a wide range of literature suggesting major sex differences in effects of KOR activity [130,131,132,133,134,135], thus, further work is certainly required to understand the role of the KOR system in motivated behaviors in females. Despite these limitations, evidence that systemic KOR antagonism modulates novelty processing provides a new avenue for understanding the role of this system in a variety of adaptive and maladaptive behaviors.

As mentioned above, widely reproduced effects of KOR antagonists in preclinical substance use disorder and depression models can be putatively explained by several underlying constructs, including modulation of novelty processing. For example, decreased responding during drug self-administration can also fit a novelty exploration model, as augmented novelty exploration is likely to reduce rates of ongoing behavior by assigning value to novel choices [136]. Likewise, increased exploration of a novel environment may slow the emergence of immobility during forced swim assays [137]. The complex interplay between anxiety and novelty whereby an increase in novelty processing could present either as anxiolytic (neophilia) or anxiogenic (neophobia) behavior depending on the magnitude of novelty and the intensity of the stimulus [77] also suggests that this model may fit the wealth of literature implicating the KOR system in anxiety-like behaviors [138,139,140]. Assessing these possibilities goes well beyond the scope of the current report, and we do not claim that the previous results demonstrate causality of altered novelty exploration, nor that claims of negative valence modulation are necessarily false. It is certainly conceivable that negative valence processing explains the effects of KOR antagonism under some specific conditions. Rather, our results directly challenge the utility of the negative valence framework as the primary model of KOR function and call for re-evaluation, via empirical assessment, of conclusions that draw from this framework.

References

Inui S. Nalfurafine hydrochloride to treat pruritus: a review. Clin Cosmet Investig Dermatol. 2015;8:249–55.
Article PubMed PubMed Central Google Scholar
Moman RN, Mowery ML, Kelley B. Alfentanil. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2022. Available from: https://www.ncbi.nlm.nih.gov/books/NBK470456/.
Prommer E. Levorphanol: the forgotten opioid. Support Care Cancer. 2007;15:259–64.
Article PubMed Google Scholar
Singh D, Saadabadi A. Naltrexone. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2022. Available from: https://www.ncbi.nlm.nih.gov/books/NBK534811/.
Brenner DM, Sayuk GS. Current US Food and Drug Administration-approved pharmacologic therapies for the treatment of irritable bowel syndrome with diarrhea. Adv Ther. 2020;37:83–96.
Article PubMed Google Scholar
Corsetti M, Whorwell P. New therapeutic options for IBS: the role of the first in class mixed µ- opioid receptor agonist and δ-opioid receptor antagonist (mudelta) eluxadoline. Expert Rev Gastroenterol Hepatol. 2017;11:285–92.
Article CAS PubMed Google Scholar
Rorick-Kehn LM, Witkin JM, Statnick MA, Eberle EL, McKinzie JH, Kahl SD, et al. LY2456302 is a novel, potent, orally-bioavailable small molecule kappa-selective antagonist with activity in animal models predictive of efficacy in mood and addictive disorders. Neuropharmacology. 2014;77:131–44.
Article CAS PubMed Google Scholar
Zhou Y, Kreek MJ. Combination of clinically utilized kappa-opioid receptor agonist nalfurafine with low-dose naltrexone reduces excessive alcohol drinking in male and female mice. Alcohol Clin Exp Res. 2019;43:1077–90.
CAS PubMed PubMed Central Google Scholar
Knoll AT, Carlezon WA. Dynorphin, stress, and depression. Brain Res. 2010;1314:56–73.
Article CAS PubMed Google Scholar
Carlezon WA, Béguin C, Knoll AT, Cohen BM. Kappa-opioid ligands in the study and treatment of mood disorders. Pharm Ther. 2009;123:334–43.
Article CAS Google Scholar
Domi E, Barbier E, Augier E, Augier G, Gehlert D, Barchiesi R, et al. Preclinical evaluation of the kappa-opioid receptor antagonist CERC-501 as a candidate therapeutic for alcohol use disorders. Neuropsychopharmacology. 2018;43:1805–12.
Article CAS PubMed PubMed Central Google Scholar
Carlezon WA, Krystal AD. Kappa-opioid antagonists for psychiatric disorders: from bench to clinical trials. Depress Anxiety. 2016;33:895–906.
Article CAS PubMed PubMed Central Google Scholar
Wee S, Orio L, Ghirmai S, Cashman JR, Koob GF. Inhibition of kappa opioid receptors attenuated increased cocaine intake in rats with extended access to cocaine. Psychopharmacology. 2009;205:565–75.
Article CAS PubMed PubMed Central Google Scholar
Walker BM, Zorrilla EP, Koob GF. Systemic κ-opioid receptor antagonism by nor-binaltorphimine reduces dependence-induced excessive alcohol self-administration in rats. Addiction Biol. 2011;16:116–9.
Article CAS Google Scholar
Walker BM, Koob GF. Pharmacological evidence for a motivational role of kappa-opioid systems in ethanol dependence. Neuropsychopharmacology. 2008;33:643–52.
Article CAS PubMed Google Scholar
Valenza M, Windisch KA, Butelman ER, Reed B, Kreek MJ. Effects of Kappa opioid receptor blockade by LY2444296 HCl, a selective short-acting antagonist, during chronic extended access cocaine self-administration and re-exposure in rat. Psychopharmacology. 2020;237:1147–60.
Article CAS PubMed Google Scholar
Uhari-Väänänen J, Eteläinen T, Bäckström P, Oinio V, Carroll FI, Raasmaja A, et al. The selective κ-opioid receptor antagonist JDTic attenuates the alcohol deprivation effect in rats. Eur Neuropsychopharmacol. 2019;29:1–11. https://doi.org/10.1016/j.euroneuro.2019.10.003.
Mague SD, Pliakas AM, Todtenkopf MS, Tomasiewicz HC, Zhang Y, Stevens WC Jr, et al. Antidepressant-like effects of kappa-opioid receptor antagonists in the forced swim test in rats. J Pharm Exp Ther. 2003;305:323–30.
Article CAS Google Scholar
Pliakas AM, Carlson RR, Neve RL, Konradi C, Nestler EJ, Carlezon WA Jr. Altered responsiveness to cocaine and increased immobility in the forced swim test associated with elevated cAMP response element-binding protein expression in nucleus accumbens. J Neurosci. 2001;21:7397–403.
Article CAS PubMed PubMed Central Google Scholar
Knoll AT, Meloni EG, Thomas JB, Carroll FI, Carlezon WA. Anxiolytic-like effects of kappa-opioid receptor antagonists in models of unlearned and learned fear in rats. J Pharm Exp Ther. 2007;323:838–45.
Article CAS Google Scholar
Land BB, Bruchas MR, Lemos JC, Xu M, Melief EJ, Chavkin C. The dysphoric component of stress is encoded by activation of the dynorphin kappa-opioid system. J Neurosci. 2008;28:407–14.
Article CAS PubMed PubMed Central Google Scholar
Bruchas MR, Land BB, Chavkin C. The dynorphin/kappa opioid system as a modulator of stress-induced and pro-addictive behaviors. Brain Res. 2010;1314:44–55.
Article CAS PubMed Google Scholar
Jacobson ML, Wulf HA, Browne CA, Lucki I. The kappa opioid receptor antagonist aticaprant reverses behavioral effects from unpredictable chronic mild stress in male mice. Psychopharmacology. 2020;237:3715–28.
Article CAS PubMed PubMed Central Google Scholar
Carlezon WA Jr, Béguin C, DiNieri JA, Baumann MH, Richards MR, Todtenkopf MS, et al. Depressive-like effects of the kappa-opioid receptor agonist salvinorin A on behavior and neurochemistry in rats. J Pharm Exp Ther. 2006;316:440–7.
Article CAS Google Scholar
Wee S, Koob GF. The role of the dynorphin-κ opioid system in the reinforcing effects of drugs of abuse. Psychopharmacology. 2010;210:121–35.
Article CAS PubMed PubMed Central Google Scholar
Koob GF, Volkow ND. Neurobiology of addiction: a neurocircuitry analysis. Lancet Psychiatry. 2016;3:760–73.
Article PubMed PubMed Central Google Scholar
Jackson KJ, Jackson A, Carroll FI, Damaj MI. Effects of orally-bioavailable short-acting kappa opioid receptor-selective antagonist LY2456302 on nicotine withdrawal in mice. Neuropharmacology. 2015;97:270–4.
Article CAS PubMed PubMed Central Google Scholar
Daunais JB, Roberts DCS, McGinty JF. Cocaine self-administration increases preprodynorphin, but not c-fos, mRNA in rat striatum. Neuroreport. 1993;4:543–6.
Article CAS PubMed Google Scholar
Hurd YL, Herkenham M. Molecular alterations in the neostriatum of human cocaine addicts. Synapse. 1993;13:357–69.
Article CAS PubMed Google Scholar
Mathieu-Kia AM, Besson MJ. Repeated administration of cocaine, nicotine and ethanol: effects on preprodynorphin, preprotachykinin A and preproenkephalin mRNA expression in the dorsal and the ventral striatum of the rat. Mol Brain Res. 1998;54:141–51.
Article CAS PubMed Google Scholar
Brandon CL, Steiner H. Repeated methylphenidate treatment in adolescent rats alters gene regulation in the striatum. Eur J Neurosci. 2003;18:1584–92.
Article PubMed Google Scholar
di Benedetto M, D’Addario C, Candeletti S, Romualdi P. Chronic and acute effects of 3,4-methylenedioxy-N-methylamphetamine (‘Ecstasy’) administration on the dynorphinergic system in the rat brain. Neuroscience. 2006;137:187–96.
Article PubMed Google Scholar
Przewłocka B, Turchan J, Lasoń W, Przewłocki R. Ethanol withdrawal enhances the prodynorphin system activity in the rat nucleus accumbens. Neurosci Lett. 1997;238:13–6.
Article PubMed Google Scholar
Turchan J, Przewłocka B, Lasoń W, Przewłocki R. Effects of repeated psychostimulant administration on the prodynorphin system activity and kappa opioid receptor density in the rat brain. Neuroscience. 1998;85:1051–9.
Article CAS PubMed Google Scholar
Marinelli PW, Lam M, Bai L, Quirion R, Gianoulakis C. A microdialysis profile of dynorphin A1-8 release in the rat nucleus accumbens following alcohol administration. Alcohol Clin Exp Res. 2006;30:982–90.
Article CAS PubMed Google Scholar
Unterwald EM, Rubenfeld JM, Kreek MJ. Repeated cocaine administration upregulates κ and μ but not δ opioid receptors. Neuroreport. 1994;5:1613–6.
Article CAS PubMed Google Scholar
Karpyak VM, Winham SJ, Preuss UW, Zill P, Cunningham JM, Walker DL, et al. Association of the PDYN gene with alcohol dependence and the propensity to drink in negative emotional states. Int J Neuropsychopharmacol. 2013;16:975–85.
Article CAS PubMed Google Scholar
Bazov I, Sarkisyan D, Kononenko O, Watanabe H, Yakovleva T, Hansson AC, et al. Dynorphin and κ-opioid receptor dysregulation in the dopaminergic reward system of human alcoholics. Mol Neurobiol. 2018;55:7049–61.
Article CAS PubMed PubMed Central Google Scholar
Xu K, Seo D, Hodgkinson C, Hu Y, Goldman D, Sinha R. A variant on the kappa opioid receptor gene (OPRK1) is associated with stress response and related drug craving, limbic brain activation and cocaine relapse risk. Transl Psychiatry. 2013;3:1–9.
Article Google Scholar
Rose JH, Karkhanis AN, Chen R, Gioia D, Lopez MF, Becker HC, et al. Supersensitive kappa opioid receptors promotes ethanol withdrawal-related behaviors and reduce dopamine signaling in the nucleus accumbens. Int J Neuropsychopharmacol. 2016;19:1–10.
Article CAS Google Scholar
Siciliano CA, Calipari ES, Cuzon Carlson VC, Helms CM, Lovinger DM, Grant KA, et al. Voluntary ethanol intake predicts κ-opioid receptor supersensitivity and regionally distinct dopaminergic adaptations in macaques. J Neurosci. 2015;35:5959–68.
Article CAS PubMed PubMed Central Google Scholar
Honeycutt JA, Young JW, Porcu A, Sabariego M. Editorial: negative valence systems. Front Syst Neurosci. 2022;16:1014745.
Article PubMed PubMed Central Google Scholar
Tejeda HA, Bonci A. Dynorphin/kappa-opioid receptor control of dopamine dynamics: implications for negative affective states and psychiatric disorders. Brain Res. 2019;1713:91–101.
Article CAS PubMed Google Scholar
Newton SS, Thome J, Wallace TL, Shirayama Y, Schlesinger L, Sakai N, et al. Inhibition of cAMP response element-binding protein or dynorphin in the nucleus accumbens produces an antidepressant-like effect. J Neurosci. 2002;22:10883–90.
Article CAS PubMed PubMed Central Google Scholar
Redila VA, Chavkin C. Stress-induced reinstatement of cocaine seeking is mediated by the kappa opioid system. Psychopharmacology. 2008;200:59–70.
Article CAS PubMed PubMed Central Google Scholar
Funk D, Coen K, Lê AD. The role of kappa opioid receptors in stress-induced reinstatement of alcohol seeking in rats. Brain Behav. 2014;4:356–67.
Article PubMed PubMed Central Google Scholar
Nguyen JD, Grant Y, Taffe MA. Paradoxical changes in brain reward status during oxycodone self‐administration in a novel test of the negative reinforcement hypothesis. Br J Pharm. 2021;178:3797–812.
Article CAS Google Scholar
Nealey KA, Smith AW, Davis SM, Smith DG, Walker BM. Κ-opioid receptors are implicated in the increased potency of intra-accumbens nalmefene in ethanol-dependent rats. Neuropharmacology. 2011;61:35–42.
Article CAS PubMed PubMed Central Google Scholar
Valenza M, Butelman ER, Kreek MJ. Effects of the novel relatively short-acting kappa opioid receptor antagonist LY2444296 in behaviors observed after chronic extended-access cocaine self-administration in rats. Psychopharmacology. 2017;234:2219–31.
Article CAS PubMed PubMed Central Google Scholar
Insel T, Cuthbert B, Garvey M, Heinssen R, Pine DS, Quinn K, et al. Research domain criteria (RDoC): toward a new classification framework for research on mental disorders. Am J Psychiatry. 2010;167:748–51.
Article PubMed Google Scholar
Hoffman KL. New dimensions in the use of rodent behavioral tests for novel drug discovery and development. Expert Opin Drug Disco. 2016;11:343–53.
Article Google Scholar
Bale TL, Abel T, Akil H, Carlezon WA Jr, Moghaddam B, Nestler EJ, et al. The critical importance of basic animal research for neuropsychiatric disorders. Neuropsychopharmacology. 2019;44:1349–53.
Article PubMed PubMed Central Google Scholar
van der Staay FJ. Animal models of behavioral dysfunctions: Basic concepts and classifications, and an evaluation strategy. Brain Res Rev. 2006;52:131–59.
Article PubMed Google Scholar
Der-Avakian A, Markou A. The neurobiology of anhedonia and other reward-related deficits. Trends Neurosci. 2012;35:68–77.
Article CAS PubMed Google Scholar
Maia TV, Frank MJ. From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci. 2011;14:154–62.
Article CAS PubMed PubMed Central Google Scholar
Redish AD. Addiction as a computational process gone awry. Science. 2004;306:1944–7.
Article CAS PubMed Google Scholar
Rothkirch M, Tonn J, Köhler S, Sterzer P. Neural mechanisms of reinforcement learning in unmedicated patients with major depressive disorder. Brain. 2017;140:1147–57.
Article PubMed Google Scholar
Portoghese PS, Lipkowski AW, Takemori AE. Binaltorphimine and nor-binaltorphimine, potent and selective kappa-opioid receptor antagonists. Life Sci. 1987;40:1287–92.
Article CAS PubMed Google Scholar
Bruchas MR, Yang T, Schreiber S, Defino M, Kwan SC, Li S, et al. Long-acting κ opioid antagonists disrupt receptor signaling and produce noncompetitive effects by activating c-Jun N-terminal kinase. J Biol Chem. 2007;282:29803–11.
Article CAS PubMed Google Scholar
Horan P, Taylor J, Yamamura HI, Porreca F. Extremely long-lasting antagonistic actions of nor-binaltorphimine (nor-BNI) in the mouse tail-flick test. J Pharm Exp Ther. 1992;260:1237–43.
CAS Google Scholar
Kishioka S, Kiguchi N, Kobayashi Y, Yamamoto C, Saika F, Wakida N, et al. Pharmacokinetic evidence for the long-lasting effect of nor-binaltorphimine, a potent kappa opioid receptor antagonist, in mice. Neurosci Lett. 2013;552:98–102.
Article CAS PubMed Google Scholar
Dews PB. The effect of multiple S-delta periods on responding on a fixed-interval schedule. V. Effect of periods of complete darkness and of occasional omissions of food presentations. J Exp Anal Behav. 1966;9:573–8.
Article CAS PubMed PubMed Central Google Scholar
Dews PB. The effect of multiple S-delta periods on responding on a fixed-interval schedule: 3. Effect of changes in pattern of interruptions, parameters and stimuli. J Exp Anal Behav. 1965;8:427–35.
Article CAS PubMed PubMed Central Google Scholar
Dews PB. The effect of multiple S delta periods on responding on a fixed-interval schedule. J Exp Anal Behav. 1962;5:369–74.
Article CAS PubMed PubMed Central Google Scholar
Zimmerman J, Ferster CB. Intermittent punishment of Sdelta responding in matching to sample. J Exp Anal Behav. 1963;6:349–56.
Article CAS PubMed PubMed Central Google Scholar
Ferster CB, Appel JB. Punishment of S delta responding in matching to sample by time out from positive reinforcement. J Exp Anal Behav. 1961;4:45–56.
Article CAS PubMed PubMed Central Google Scholar
Skinner, BF. The Behaviour of Organisms. New York: Appleton-Century-Crofts; 1938.
Siciliano CA, Saha K, Calipari ES, Fordahl SC, Chen R, Khoshbouei H, et al. Amphetamine reverses escalated cocaine intake via restoration of dopamine transporter conformation. J Neurosci. 2018;38:484–97.
Article CAS PubMed PubMed Central Google Scholar
Kutlu MG, Zachry JE, Brady LJ, Melugin PR, Kelly SJ, Sanders C, et al. A novel multidimensional reinforcement task in mice elucidates sex-specific behavioral strategies. Neuropsychopharmacology. 2020;45:1463–72.
Article PubMed PubMed Central Google Scholar
Jaegle A, Mehrpour V, Rust N. Visual novelty, curiosity, and intrinsic reward in machine learning and the brain. Curr Opin Neurobiol. 2019;58:167–74.
Article CAS PubMed Google Scholar
Kaur JN, Jiang Y, Liang PP. Ask and explore: grounded question answering for curiosity-driven exploration. arXiv preprint. 2021. arXiv:2104.11902.
Stadie BC, Levine S, Abbeel P. Incentivizing exploration in reinforcement learning with deep predictive models. arXiv preprint. 2015. arXiv:1507.00814.
Kakade S, Dayan P. Dopamine: generalization and bonuses. Neural Netw. 2002;15:549–59.
Article PubMed Google Scholar
Laurent PA. The emergence of saliency and novelty responses from Reinforcement Learning principles. Neural Netw. 2008;21:1493–9.
Article PubMed PubMed Central Google Scholar
Shahan TA, Chase PN. Novelty, stimulus control, and operant variability. Behav Anal. 2002;25:175–90.
Article PubMed PubMed Central Google Scholar
Montgomery KC, Segall M. Discrimination learning based upon the exploratory drive. J Comp Physiol Psychol. 1955;48:225–8.
Article CAS PubMed Google Scholar
Hughes RN. Neotic preferences in laboratory rodents: Issues, assessment and substrates. Neurosci Biobehav Rev. 2007;31:441–64.
Article PubMed Google Scholar
Houillon A, Lorenz RC, Boehmer W, Rapp MA, Heinz A, Gallinat J, et al. The effect of novelty on reinforcement learning. Prog Brain Res. 2013;202:415–39.
Article CAS PubMed Google Scholar
Krebs RM, Schott BH, Schütze H, Düzel E. The novelty exploration bonus and its attentional modulation☆. Neuropsychologia. 2009;47:2272–81.
Article PubMed Google Scholar
Dayan P, Sejnowski TJ. Exploration bonuses and dual control. Mach Learn. 1996;25:5–22.
Article Google Scholar
Manosevitz M, Joel U. Behavioral effects of environmental enrichment in randomly bred mice. J Comp Physiol Psychol. 1973;85:373–82.
Article CAS PubMed Google Scholar
Cerbone A, Sadile AG. Behavioral habituation to spatial novelty: interference and noninterference studies. Neurosci Biobehav Rev. 1994;18:497–518.
Article CAS PubMed Google Scholar
Harris JD. Habituatory response decrement in the intact organism. Psychol Bull. 1943;40:385–422.
Article Google Scholar
Kalueff AV, Keisala T, Minasyan A, Kuuslahti M, Tuohimaa P. Temporal stability of novelty exploration in mice exposed to different open field tests. Behav Process. 2006;72:104–12.
Article Google Scholar
Olsen CM, Winder DG. Operant sensation seeking engages similar neural substrates to operant drug seeking in C57 mice. Neuropsychopharmacology. 2009;34:1685–94.
Article CAS PubMed Google Scholar
Olsen CM, Childs DS, Stanwood GD, Winder DG. Operant sensation seeking requires metabotropic glutamate receptor 5 (mGluR5). PLoS One. 2010;5:e15085.
Article CAS PubMed PubMed Central Google Scholar
Olsen CM, Winder DG. Operant sensation seeking in the mouse. J Vis Exp. 2010, https://doi.org/10.3791/2292.
Oleson EB, Roberts DCS. Cocaine self-administration in rats: threshold procedures. Methods Mol Biol. 2012;829:303–19.
Article PubMed Google Scholar
Oleson EB, Richardson JM, Roberts DCS. A novel IV cocaine self-administration procedure in rats: differential effects of dopamine, serotonin, and GABA drug pre-treatments on cocaine consumption and maximal price paid. Psychopharmacology. 2011;214:567–77.
Article CAS PubMed Google Scholar
Borsboom D, Mellenbergh GJ, van Heerden J. The theoretical status of latent variables. Psychol Rev. 2003;110:203–19.
Article PubMed Google Scholar
Lewis AS, Calipari ES, Siciliano CA. Toward standardized guidelines for investigating neural circuit control of behavior in animal research. eNeuro. 2021;8:ENEURO.0498–20.2021.
Article CAS PubMed Google Scholar
Edwards JR, Bagozzi RP. On the nature and direction of relationships between constructs and measures. Psychol Methods. 2000;5:155–74.
Article CAS PubMed Google Scholar
Negus SS. Effects of the kappa opioid agonist U50,488 and the kappa opioid antagonist nor-binaltorphimine on choice between cocaine and food in rhesus monkeys. Psychopharmacology. 2004;176:204–13.
Article PubMed Google Scholar
Hutsell BA, Cheng K, Rice KC, Negus SS, Banks ML. Effects of the kappa opioid receptor antagonist nor-binaltorphimine (nor-BNI) on cocaine versus food choice and extended-access cocaine intake in rhesus monkeys. Addiction Biol. 2016;21:360–73.
Article CAS Google Scholar
Nestby P, Schoffelmeer AN, Homberg JR, Wardeh G, De Vries TJ, Mulder AH, et al. Bremazocine reduces unrestricted free-choice ethanol self-administration in rats without affecting sucrose preference. Psychopharmacology. 1999;142:309–17.
Article CAS PubMed Google Scholar
Lindholm S, Werme M, Brené S, Franck J. The selective kappa-opioid receptor agonist U50,488H attenuates voluntary ethanol intake in the rat. Behav Brain Res. 2001;120:137–46.
Article CAS PubMed Google Scholar
Schenk S, Partridge B, Shippenberg TS. U69593, a kappa-opioid agonist, decreases cocaine self-administration and decreases cocaine-produced drug-seeking. Psychopharmacology. 1999;144:339–46.
Article CAS PubMed Google Scholar
Glick SD, Maisonneuve IM, Raucci J, Archer S. Kappa opioid inhibition of morphine and cocaine self-administration in rats. Brain Res. 1995;681:147–52.
Article CAS PubMed Google Scholar
Carlezon WA Jr, Thome J, Olson VG, Lane-Ladd SB, Brodkin ES, Hiroi N, et al. Regulation of cocaine reward by CREB. Science. 1998;282:2272–5.
Article CAS PubMed Google Scholar
Zhang Y, Butelman ER, Schlussman SD, Ho A, Kreek MJ. Effect of the endogenous kappa opioid agonist dynorphin A(1-17) on cocaine-evoked increases in striatal dopamine levels and cocaine-induced place preference in C57BL/6J mice. Psychopharmacology. 2004;172:422–9.
Article CAS PubMed Google Scholar
Mori T, Nomura M, Nagase H, Narita M, Suzuki T. Effects of a newly synthesized kappa-opioid receptor agonist, TRK-820, on the discriminative stimulus and rewarding effects of cocaine in rats. Psychopharmacology. 2002;161:17–22.
Article CAS PubMed Google Scholar
Crawford CA, McDougall SA, Bolanos CA, Hall S, Berger SP. The effects of the kappa agonist U-50,488 on cocaine-induced conditioned and unconditioned behaviors and Fos immunoreactivity. Psychopharmacology. 1995;120:392–9.
Article CAS PubMed Google Scholar
Logrip ML, Janak PH, Ron D. Blockade of ethanol reward by the kappa opioid receptor agonist U50,488H. Alcohol. 2009;43:359–65.
Article CAS PubMed PubMed Central Google Scholar
Privette TH, Terrian DM. Kappa opioid agonists produce anxiolytic-like behavior on the elevated plus-maze. Psychopharmacology. 1995;118:444–50.
Article CAS PubMed Google Scholar
Kudryavtseva NN, Gerrits MAFM, Avgustinovich DF, Tenditnik MV, van Ree JM. Modulation of anxiety-related behaviors by μ- and κ-opioid receptor agonists depends on the social status of mice. Peptides. 2004;25:1355–63.
Article CAS PubMed Google Scholar
Pirino BE, Spodnick MB, Gargiulo AT, Curtis GR, Barson JR, Karkhanis AN. Kappa-opioid receptor-dependent changes in dopamine and anxiety-like or approach-avoidance behavior occur differentially across the nucleus accumbens shell rostro-caudal axis. Neuropharmacology. 2020;181:108341.
Article CAS PubMed PubMed Central Google Scholar
Castro DC, Berridge KC. Opioid hedonic hotspot in nucleus accumbens shell: Mu, delta, and kappa maps for enhancement of sweetness ‘liking’ and ‘wanting’. J Neurosci. 2014;34:4239–50.
Article CAS PubMed PubMed Central Google Scholar
Al-Hasani R, McCall JG, Shin G, Gomez AM, Schmitz GP, Bernardi JM, et al. Distinct subpopulations of nucleus accumbens dynorphin neurons drive aversion and reward. Neuron. 2015;87:1063–77.
Article CAS PubMed PubMed Central Google Scholar
Krystal AD, Pizzagalli DA, Smoski M, Mathew SJ, Nurnberger J Jr, Lisanby SH, et al. A randomized proof-of-mechanism trial applying the ‘fast-fail’ approach to evaluating κ-opioid antagonism as a treatment for anhedonia. Nat Med. 2020;26:760–8.
Article CAS PubMed PubMed Central Google Scholar
Pizzagalli DA, Smoski M, Ang YS, Whitton AE, Sanacora G, Mathew SJ, et al. Selective kappa-opioid antagonism ameliorates anhedonic behavior: evidence from the Fast-fail Trial in Mood and Anxiety Spectrum Disorders (FAST-MAS). Neuropsychopharmacology. 2020;45:1656–63.
Article CAS PubMed PubMed Central Google Scholar
Farahbakhsh ZZ, Siciliano CA. Neurobiology of novelty seeking. Science. 2021;372:684–5. -685
Article CAS PubMed Google Scholar
Kelley AE. Locomotor activity and exploration. Techniques in The Behavioral and Neural Sciences. 1993;10:499–518
Zuckerman M. Behavioral Expressions and Biosocial Bases of Sensation Seeking. New York: Cambridge University Press; 1994.
Berlyne DE. Curiosity and exploration. Science. 1966;153:25–33.
Article CAS PubMed Google Scholar
Marsland S. Novelty detection in learning systems. Neural Comput Surv. 2003;3:157–95.
Google Scholar
MATZEL LD, Townsend DA, Grossman H, Han YR, Hale G, Zappulla M, et al. Exploration in outbred mice covaries with general learning abilities irrespective of stress reactivity, emotionality, and physical attributes. Neurobiol Learn Mem. 2006;86:228–40.
Article PubMed Google Scholar
Dreisbach G, Goschke T. How positive affect modulates cognitive control: reduced perseveration at the cost of increased distractibility. J Exp Psychol Learn Mem Cogn. 2004;30:343–53.
Article PubMed Google Scholar
Huebner F, Fichtel C. Innovation and behavioral flexibility in wild redfronted lemurs (Eulemur rufifrons). Anim Cogn. 2015;18:777–87.
Article PubMed PubMed Central Google Scholar
Greenberg R. The role of neophobia and neophilia in the development of innovative behaviour of birds. In: Animal innovation. Oxford University Press, 2003, pp. 175–96. https://doi.org/10.1093/acprof:oso/9780198526223.003.0008.
Rochford J, Stewart J. Activation and expression of endogenous pain control mechanisms in rats given repeated nociceptive tests under the influence of naloxone. Behav Neurosci. 1987;101:87–103.
Article CAS PubMed Google Scholar
Netto CA, Siegfried B, Izquierdo I. Analgesia induced by exposure to a novel environment in rats: effect of concurrent and post-training stressful stimulation. Behav Neural Biol. 1987;48:304–9.
Article CAS PubMed Google Scholar
Rochford J, Dawes P, Stewart J. Naloxone potentiation of novelty-induced hypoalgesia: characterization of the α-noradrenergic receptor subtype. Pharm Biochem Behav. 1993;44:381–6.
Article CAS Google Scholar
Siegfried B, Netto CA, Izquierdo I. Exposure to novelty induces naltrexone-reversible analgesia in rats. Behav Neurosci. 1987;101:436–8.
Article CAS PubMed Google Scholar
Rochford J. The effects of clonidine and yohimbine on novelty-induced hypoalgesia. Psychobiology. 1992;20:163–5.
Article CAS Google Scholar
Butler RA. Discrimination learning by rhesus monkeys to visual-exploration motivation. J Comp Physiol Psychol. 1953;46:95–98.
Article CAS PubMed Google Scholar
Reed P, Mitchell C, Nokes T. Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task. Anim Learn Behav. 1996;24:38–45.
Article Google Scholar
Grossberg S. A neural model of attention, reinforcement and discrimination learning. Int Rev Neurobiol. 1975;18:263–327. https://doi.org/10.1016/S0074-7742(08)60037-9.
Tulving E, Markowitsch HJ, Craik FIM, Habib R, Houle S. Novelty and familiarity activations in PET studies of memory encoding and retrieval. Cereb Cortex. 1996;6:71–79.
Article CAS PubMed Google Scholar
Tulving E, Markowitsch HJ, Kapur S, Habib R, Houle S. Novelty encoding networks in the human brain. Neuroreport. 1994;5:2525–8.
Article CAS PubMed Google Scholar
Laman-Maharg A, Williams AV, Zufelt MD, Minie VA, Ramos-Maciel S, Hao R, et al. Sex differences in the effects of a kappa opioid receptor antagonist in the forced swim test. Front Pharm. 2018;9:93.
Article Google Scholar
Przybysz KR, Varlinskaya EI, Diaz MR. Age and sex regulate kappa opioid receptor-mediated anxiety-like behavior in rats. Behav Brain Res. 2020;379:112379.
Article CAS PubMed Google Scholar
Reichard KL, Newton KA, Rivera Z, Sotero de Menezes PM, Schattauer SS, Land BB, et al. Regulation of kappa opioid receptor inactivation depends on sex and cellular site of antagonist action. Mol Pharm. 2020;98:548–58.
Article CAS Google Scholar
Robles CF, McMackin MZ, Campi KL, Doig IE, Takahashi EY, Pride MC, et al. Effects of kappa opioid receptors on conditioned place aversion and social interaction in males and females. Behav Brain Res. 2014;262:84–93.
Article CAS PubMed PubMed Central Google Scholar
Russell SE, Rachlin AB, Smith KL, Muschamp J, Berry L, Zhao Z, et al. Sex differences in sensitivity to the depressive-like effects of the kappa opioid receptor agonist U-50488 in rats. Biol Psychiatry. 2014;76:213–22.
Article CAS PubMed Google Scholar
Chartoff EH, Mavrikaki M. Sex differences in kappa opioid receptor function and their potential impact on addiction. Front Neurosci. 2015;9:466.
Article PubMed PubMed Central Google Scholar
Cohen JD, McClure SM, Yu AJ. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos Trans R Soc B Biol Sci. 2007;362:933–42.
Article Google Scholar
White DA, Kalinichev M, Holtzman SG. Locomotor response to novelty as a predictor of reactivity to aversive stimuli in the rat. Brain Res. 2007;1149:141–8.
Article CAS PubMed PubMed Central Google Scholar
Van’t Veer A, Carlezon WA. Role of kappa-opioid receptors in stress and anxiety-related behavior. Psychopharmacology. 2013;229:435–52.
Article PubMed PubMed Central Google Scholar
Knoll AT, Muschamp JW, Sillivan SE, Ferguson D, Dietz DM, Meloni EG, et al. Kappa opioid receptor signaling in the basolateral amygdala regulates conditioned fear and anxiety in rats. Biol Psychiatry. 2011;70:425–33.
Article CAS PubMed PubMed Central Google Scholar
Bruchas MR, Land BB, Lemos JC, Chavkin C. CRF1-R activation of the dynorphin/kappa opioid system in the mouse basolateral amygdala mediates anxiety-like behavior. PLoS One. 2009;4:e8528.
Article PubMed PubMed Central Google Scholar

Download references

Funding

This work was supported by NIH grants R00 DA04510 (NIDA), U01 AA029971 (NIAAA), R01 AA030115 (NIAAA), Alkermes Pathways Research Award, the Brain Research Foundation, Whitehall Foundation, and the Stanley Cohen Innovation Fund. ZZF is supported by an institutional training grant (T32 MH064913).

Author information

Authors and Affiliations

Department of Pharmacology, Vanderbilt Brain Institute, Vanderbilt Center for Addiction Research, Vanderbilt University, Nashville, TN, 37232, USA
Zahra Z. Farahbakhsh, Keaton Song, Hannah E. Branthwaite, Kirsty R. Erickson, Snigdha Mukerjee, Suzanne O. Nolan & Cody A. Siciliano

Authors

Zahra Z. Farahbakhsh
View author publications
You can also search for this author in PubMed Google Scholar
Keaton Song
View author publications
You can also search for this author in PubMed Google Scholar
Hannah E. Branthwaite
View author publications
You can also search for this author in PubMed Google Scholar
Kirsty R. Erickson
View author publications
You can also search for this author in PubMed Google Scholar
Snigdha Mukerjee
View author publications
You can also search for this author in PubMed Google Scholar
Suzanne O. Nolan
View author publications
You can also search for this author in PubMed Google Scholar
Cody A. Siciliano
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ZZF and CAS jointly conceived of the project. ZZF, CAS, SON, and SM designed the experiments. ZZF, KS, HEB, and KRE collected behavioral data. ZZF and CAS developed MATLAB analysis code. ZZF performed analysis. ZZF and CAS created the figures and wrote the paper.

Corresponding author

Correspondence to Cody A. Siciliano.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Video Legend

Supplement

Supplemental Video

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Farahbakhsh, Z.Z., Song, K., Branthwaite, H.E. et al. Systemic kappa opioid receptor antagonism accelerates reinforcement learning via augmentation of novelty processing in male mice. Neuropsychopharmacol. 48, 857–868 (2023). https://doi.org/10.1038/s41386-023-01547-x

Download citation

Received: 13 September 2022
Revised: 20 January 2023
Accepted: 02 February 2023
Published: 17 February 2023
Issue Date: May 2023
DOI: https://doi.org/10.1038/s41386-023-01547-x

This article is cited by

Kappa opioid receptors as modulators of novelty processing
- Kathryn Braden
- Daniel C. Castro
Neuropsychopharmacology (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods and materials

Subjects

Drugs

Positive reinforcement

Negative reinforcement

Crossover treatment

Novelty response

Sensory reinforcement

Fixed-ratio 1

Behavioral economics

Results

Systemic KOR antagonism accelerated discriminated operant responding during positive reinforcement learning

KOR antagonism selectively increased learning rate without impacting maximal performance

KOR modulation of learning rate was not due to altered consummatory response

Systemic KOR antagonism increased reinforcement learning rate independent of reinforcer valence

KOR antagonism did not alter unconditioned responses to footshock

KOR antagonism selectively augmented novelty exploration independent of habituation rate, general ambulatory activity, and neophobic avoidance behaviors

KOR blockade augmented essential reinforcing value of novel stimuli

Discussion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links