Animal-oriented virtual environments : illusion , dilation , and discovery

As a research tool, virtual environments (VEs) hold immense promise for brain scientists. Yet to fully realize this potential in non-human systems, theoretical and conceptual perspectives must be developed. When selectively coupled to nervous systems, virtual environments can help us better understand the functional architecture of animals’ brains during naturalistic behaviors. While this will no doubt allow us to further our understanding of the neural basis of behavior, there is also an opportunity to uncover the diversity inherent in brain activity and behavior. This is due to two properties of virtual environments: the ability to create sensory illusions, and the ability to dilate space and/or time. These and other potential manipulations will be characterized as the effects of virtuality. In addition, the systems-level outcomes of virtual environment enhanced perception will be discussed in the context of the uncanny valley and other expected relationships between emotional valence, cognition, and training. These effects and their usefulness for brain science will be understood in the context of three types of neurobehavioral phenomena: sensorimotor integration, spatial navigation, and interactivity. For each of these behaviors, a combination of illusory and space/time dilation examples will be reviewed. Once these examples are presented, the implications for improving upon virtual models for more directly inducing the mental phenomena of illusion and space/time dilation will be considered. To conclude, future directions for integrating the use of VEs into a strategy of broader biological inquiry will be presented. Bradly Alicea ( ) Corresponding author: bradly.alicea@outlook.com Alicea B. How to cite this article: Animal-oriented virtual environments: illusion, dilation, and discovery [version 2; referees: 2 approved 2015, :202 (doi: ) with reservations] F1000Research 3 10.12688/f1000research.3557.2 © 2015 Alicea B. This is an open access article distributed under the terms of the , which Copyright: Creative Commons Attribution Licence permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the (CC0 1.0 Public domain dedication). Creative Commons Zero "No rights reserved" data waiver The author(s) declared that no grants were involved in supporting this work. Grant information: Competing interests: No competing interests were disclosed. 26 Aug 2014, :202 (doi: ) First published: 3 10.12688/f1000research.3557.1 1


Introduction
Virtual Environments (VEs) are increasingly being used to uncover the fundamental features of cognition.Areas of investigation include spatial cognition, sensorimotor control, and emotional processing (Bohil et al., 2011).While VEs are an up-and-coming method for studying human cognition, they are increasingly also being used in the study of animal cognition.VE systems usually consist of a sensory or experiential analogue.This allows us not only to faithfully replicate naturalistic conditions for behaviors in the lab, but also explore the limits of the underlying neural systems.
One popular aim in the brain science community is to understand the basis of cognitive functions or disorders (Mar, 2011;Menzel, 2012).By using virtual environments, we wish to control as many environmental variables as possible.Tight control of environmental conditions in an immersive environment should allow us to isolate the biological sources of behavioral variation.This should apply to both human and non-human animals, although the design of visual arrays and other forms of sensory manipulation must conform to a specific animal's sensory abilities and specializations.Removing the environmental vagaries of a behavior may also allow us to induce mental phenomena that can only be simulated in a virtual environment.While the role of mental phenomenology is a controversial topic when talking about non-human animals, VE systems should allow us to better investigate the possible existence of mental worlds in animals.These include sensory illusions and the dilation of space and/or time.
Given that these concepts are not immediately intuitive, how do we formally and operationally define sensory illusion and space/time dilation?The working definition of sensory illusion is focused on a virtual stimulus which can be confused as a real stimulus.The key property of sensory illusion is perceptual ambiguity, where the virtual stimulus looks nearly real, but is nevertheless a simulation.This has the potential to introduce ambiguities in constructing a unified percept of the object, particularly in the context of multisensory integration.By contrast, the working definition of space/time dilation involves a virtual stimulus that speeds up or slows down action in a visual reference frame (or sensory event) relative to the natural motion of an object.The definition of natural motion is either intuitive or innate.Intuitive natural motion can be defined as physical objects evaluated by the observer in terms of naive physics (Povinelli, 2003).Innate natural motion can be defined as biological motion, or the movement patterns of organismal bodies as sensed by an observer (Grossman & Blake, 2001).Both of these can be violated through the use of virtual environments, and the neural response can mimic that of sensory illusion.
These phenomena have been demonstrated in a number of contemporary papers that look at cognitive behaviors such as sensorimotor integration, spatial navigation, and interactivity.The papers reviewed here represent the state-of-the-art application of VEs to the naturalistic study of brain activity and behavior in animals.Aside from serving the needs of neuroethologists, who can now study behavior in a controlled setting, animal models also allow us to better understand the neural correlates of behavior.This is due to the relative ease of conducting direct recordings of neuronal populations and circuits.They also serve as important clues to more subjective issues that warrant further investigation.

Virtual environments meet cognitive neuroethology
In the past few decades, a number of pop-culture references and technological developments have turned virtuality into a relevant, shared human experience.Virtuality itself can be defined as the collective effects of a virtual world stimulus on perception, behavior, and social interactions.While there are many dimensions to this experience, two of the most fundamental are perceptual illusion and space/time dilation.Because VE models are immersive, the technology that simulates perceptual cues creates the illusion of being in a sensory cocoon.Inside of this cocoon, the participant can transcend perceptual limits whilst maintaining a highly-faithful representation of the physical world.Yet VE models are also engaging, and when this level of engagement is high, the potential exists for other forms of sensory distortion.Space/time dilation exists when perception and action can be sped up or slowed down, creating different time-scales.Reality itself can also be dilated in space.In this case, dilation involves expanding and contracting the scope of attentional resources.Both of these effects result directly from the technological environment.
VE models provide an alternate environment which has a high degree of representational similarity but varying degrees of experiential similarity.Yet it also provides us with a means to explore cognitive neuroethology, or the cognitive dynamics of naturalistic animal behavior (Giurfa, 2003).As VEs provide a means to explore behavioral effects beyond trial-by-trial presentations, it also requires us to account for unique emotional and cognitive responses.While the effects of virtuality might seem to be obscure, it is actually a common theme in movies such as "The Matrix" and "Inception".VEs allow for exploration of these fictive aspects of the real world represented as cognitive processes.In applications to animals, this can be extended further into the world of neuroethology.In fact, analogies based on these movies have been made between fictive mental responses and manipulations of hippocampal-dependent memories (see Spiers & Bendor, 2014).In this paper, these types of effects will be applied to animal models, and shown to exist for three types of behavior.
Why would this be interesting to the study of non-human brain and behavior?With VE systems, we can provide high-fidelity reconstructions of the real world and environments in which typical sensory cues are either dilated in space/time, temporally distorted, or combinations of both.In this paper, we will explore how virtual environments allow us to uncover the cognitive and neural processing behind illusion and space/time dilation in animals.These effects, seen in a number of contexts and neural systems, can be collectively referred to as the effects of virtuality.By using a model from the human-robot interaction literature (e.g.uncanny valley), we can better generalize the effects of virtuality to cross-species behaviors and neural mechanisms.

Cognitive neuroethology as an uncanny valley
There is evidence that these factors are most relevant to animal behavior research, for which naturalistic settings are of primary importance (Zupanc, 2010).But how much of the environment must be replicated in order for an animal to recognize it as "just like the real thing"?One way this can be characterized is through the uncanny valley phenomenon.The uncanny valley characterizes the subjectivity inherent in how observers perceive and act upon virtual environment avatars and robots that embody various degrees of realism (see Figure 1).The uncanny valley is based on an emotional response occurring in the very early stages of perceptual processing, which can be elicited for any object that generates an emotional response or involves recognition mechanisms.Both emotional response and recognition result from experience, which emerges in development and occurs in non-human contexts (Lewkowicz & Ghazanfar, 2012).Experience also conditions the classification of stimuli as being real or a facsimile in terms of recognition.Whenever a real object clearly has the attributes of such, the early emotional response resolves the ambiguity of classification as real or virtual (Steckenfinger & Ghazanfar, 2009).It is when this ambiguity cannot be resolved that the problem lies.
For purposes of this paper, let us map recognition to classification using the uncanny valley curve as a referent.Initially (see Figure 1A), the more "real" an object becomes, the more it is associated with its real-world analogue.This phase of the curve is associated with gains in sensory fidelity.The second phase of the curve (see Figure 1B), which consists of two inflection points, is associated with a drop-off in the feeling of realness just before a fully "real" emotional response occurs.At this point in the response curve, there is a predicted perceptual decoupling between the highly-realistic representation and the recognition that a robot is human or an object is real.This is an ongoing challenge in the world of human-robot interaction and VE design.However, this technical challenge might also be used to facilitate the effects of virtuality mentioned previously.
There are a few caveats to the arguments and ideas presented herein.In animals, the uncanny valley has been observed only in primates (Penn & Povinelli, 2007;Steckenfinger & Ghazanfar, 2009).However, the strategic use of VEs to provide stimuli could reveal a similar neural response in other animals.In addition, the effects of virtuality are expected to exhibit a variable effect size depending on the species chosen.Species that have high levels of what is traditionally considered animal intelligence (Matzel & Kolata, 2010) should exhibit these effects most strongly.Effects such as illusion can also be very strong in organisms with highly-specialized sensory systems, particularly given that the VE manipulation is highly specific.I propose that the key component that relates the hyperrealism of VE to the uncanny valley effect is not a set of highercognitive mechanisms, but rather the information held in perceptual ambiguities.It is these ambiguities and the uncanny valley effect in general that can actually be leveraged to produce illusory or space/ time dilation effects.

Potential means of measurement
Let us now turn to potential ways to measure the effects of virtuality and the predicted patterns of these measures for each type of effect (illusion, space/time dilation).There are four general types of measurement for which the neural substrate will vary across taxa: emotional valence, perceptual ambiguity and coherence, adaptation and motion perception, and spatial memory.A summary of these measurement types can be found in Table 1.
The first effect of virtuality involves the production of illusory effects.In terms of emotional valence, it is predicted that when stimuli are either completely or not at all illusory, there is little emotional response.It is when stimuli are slightly illusory that we  expect to see the greatest emotional response.A similar situation is expected to exist for perceptual ambiguity and coherence, and not surprisingly is linked to emotional valence.As was just discussed, it is predicted that the slight degrees of illusion elicit the greatest amount of emotional valence.As a consequence, slight degrees of illusion can correspondingly degrade perceptual performance.In this case, perceptual performance can be measured in the form of response times, object recognition, and kinematic patterns.
Illusion can also be measured by looking at the correlates of adaptation and motion perception.In general, illusory effects should utilize existing capacity for adaptation and result in phenomena such as visual aftereffects.Correlates of spatial memory can also provide potential measurement of illusory effects, as such effects should produce new episodic but not associative memories.Applying VEs to animals can also produce space/time dilation effects which can be measured in a number of ways.As with illusion, there are four general types of measurement for which the neural substrate will also vary across taxa: emotional valence, perceptual ambiguity and coherence, adaptation and motion perception, and spatial memory.
As in the case of illusory effects, emotional valence should be highest when the effects of virtuality are slight.In the case of space/time dilation, the greatest amount of emotional valence occurs when stimuli are slightly disjoint in space/time.By contrast, stimuli that are either entirely integrated or entirely disjoint in space/time should elicit little emotional response.Also as with illusion, perceptual ambiguity and coherence are linked to emotional valence.In this case, moderate amounts of space/time dilation are expected to elicit the greatest amount of emotional valence.Much like in the case of illusion, these conditions contribute to the degradation of perceptual performance.
Space/time dilation should also be apparent in measurements of response times, object recognition, and kinematic patterns.However, space/time dilation is particularly effective at systematically warping the reference frames of perception and action.This should be similar to the phenomena of rotational and gravitational reference frame manipulation (Leone, 1998;Lipshits et al., 2005) and plasticity of the multimodal gravitational reference frame (Luyat et al., 2005) that have been observed in humans.Due to the widerranging nature of this effect, the effects of space/time dilation on adaptation and motion perception should facilitate new adaptations and a generalized neuroplastic response.Spatial memory should also be affected by space/time dilation, as modification of mental representations such as the gravitational reference frames should produce new associative memories.
In cases where there is ambiguity in the stimulus (e.g.agents that look real but do not exhibit all of the cues of a real individual), a distinctive neural response related to the mismatch between appearance and motion can be elicited (Saygin et al., 2012).Part of this response involves physiological adaptation to motion (Celebrini & Newsome, 1994) as expected of real-world objects.The response to mismatch also involves the associated function of visual motion and theory of mind (ToM) mechanisms (Gerrans, 2002).This principle of associated function may also allow for perceptual ambiguities to influence a more general set of neural mechanisms (Changizi, 2011).For example, in humans the ambiguous nature of some virtual stimuli (e.g.agents or complex objects) elicits activity in the bilateral anterior intraparietal sulcus.While this is usually related to prediction error, it can also affect the global state of the actionperception system (Saygin et al., 2012).Thus, simple ambiguities may be intentionally introduced using virtual environments to trigger controlled departures from the context of reality.
To resolve the issue of equivalent responses to real and virtual environments in non-human animal species, it is worth noting that what individuals generally consider to be reality is based on personal experience and perceptual coherence (Engert, 2013).If this premise holds true for the neural basis of sensation and perception (for an example from primate vision, see Andersen et al., 2014), then we should be able to discover the limits of this illusory capacity by manipulating the environment and rousing the organism from this illusion.It is important to remember that in this context, illusory responses are not dependent on the animal reaching some sort of philosophical epiphany.Rather, the illusory effect is a metaphor that encapsulates an immersive versus non-immersive experience.
Depending on the level of immersion, it may be possible to control not only the sensory cues experienced by the non-human animal, but the entirety of the experience itself.In the case of human experience, reality is defined as perceptual and cognitive norms which permeate the context of everyday living.The effects of this context are limited to current (e.g.non life-history dependent) experience.However, it also serves as a contrast to perception and action outside of the VE.In many cases, non-human animals should respond to both rudimentary sensory cues (illusion) and dilated perceptual representations and sensory cues (space/time dilation).In these cases, then the application of VEs to the study of animal cognition and behavior will have much predictive and comparative value.

Current examples
To outline the potential of VE systems for animal research, I will focus on three areas of contemporary investigation: sensorimotor integration, spatial navigation, and interactivity (see Table 2).All

Unique behaviors/ responses
Semi-realistic neural coding, dynamic changes in distributed population codes.
Semi-realistic neural coding, real-time nonlinear plume tracking.
Transfer of experience between spatial scales (beaming).
three of these areas have been studied extensively in humans.Furthermore, the first two areas have also been studied extensively in animals, but until recently have not leveraged the advantages of VE technology.These examples utilize a range of experimental apparatus, from simple illusory stimuli and tracking systems to extensive mimicry of sensory cues.The simulation of any one set of environmental stimuli results in the activation of multiple neural circuits and may involve multiple cognitive systems.Yet this diversity of approaches has roughly the same effect: to enable control over the environment and to extend the range of experimentally-observable behaviors.Newly-observed behaviors and neural responses include: semi-realistic neural coding at the cellular level, transferring experience between spatial scales (e.g.beaming), and dynamic changes in distributed population codes.These and other unique findings also allow us to gain an appreciation for the spectrum of neural responses associated with these behaviors in an analytically tractable manner.
To better appreciate these examples, recall that the efficacy of VE systems is based on more than the ability to generate a series of high-fidelity visual images or tactile stimulations.Part of this unexplained variance has to do with the emotional state and cognitive response (Seyama & Nagayama, 2007) to specific stimuli.The other component involves the form of virtual intervention.Would it simply be enough to show an animal a familiar visual scene, or can experimental outcomes of large effect be elicited by reducing the environment to key features of an experience?The uncanny valley effect suggests that the former is just as important as the latter, and both interact with emotional responses.While the main effect of using VE to generate the effects of virtuality might seem to depend upon selectively manipulating the fidelity of a simulation, perceptual information that triggers an emotional response might be just as important.
For further clarity, we can turn to two examples of how robotic models have been utilized to study animal behavior.Robotic conspecifics can be used to mimic key mating signals.In this case (Patricelli & Krakauer, 2010), it is not the fidelity of the robot that is important, but rather the quality of the mimicked signal.Robotic approximations of conspecifics can be used to replicate commonly-observed, species-specific behaviors such as ant trail building and rat pup behavior (Akst, 2013).As with the simulation of mating rituals, it is not the details of the behavior and how it is represented in the brain that are important.Experiments replicating social learning and conspecific interactions using biomimetic robots demonstrate that full replication of sensory cues is not necessary to elicit a response (Krause et al., 2011).These findings suggest that successful simulation and the elicitation of desired behaviors can be reduced to a few key features depending on the cognitive or technological domain.

Sensorimotor integration
An experimental apparatus that is both capable of tightly reproducing the original environment (maintaining integration) and selectively distorting it (disrupting integration) is highly useful for understanding the effects of movement disorders.Being able to conduct experiments with this level of environmental control in non-human animals allows for single cell-level contributions to behavioral variation.Ahrens et al. (2012) have developed an innovative virtual environment for zebrafish that is customized for fish cognition and swimming behavior.Visual scenes are projected onto a screen located underneath the fish's location (Petri dish), and consist of square gratings that move along the fish's body from snout to tail.Importantly, the speed of visual cue presentation can be adaptively adjusted relative to swim speed.Immersion in such a context is sufficient for initiating short-term forms of motor learning (Gray, 2012).The neural populations responsible for motor learning are distributed across the brain, including the inferior olive and cerebellum.This is the expected location for motor learning consolidation, which is conserved from fishes to humans.
Zebrafish VE also allows for flexibility in the experimental setup which in turn provides a means to dissect components of the sensorimotor loop in a systematic manner.Engert ( 2013) has proposed two alternate interaction modes (e.g.experimental preparations) for creating illusory stimuli related to zebrafish swimming behavior.In this case, possible illusory stimuli include (but may not be limited to) oscillating visual gratings and animations that are inconsistent with an organism's perception of self-motion (Lappe et al., 1999).These type of illusions presented in an experimental setup allows for direct measurement of movement and the recording of neural responses to active behavior.The other involves immobilizing the fish and recording the neural activity associated with intended (or fictive) locomotion.In both cases, the contributions of visual stimuli, motion, and the corresponding neural response can be decoupled through an inconsistency between an organism's self-motion and the surrounding environment.
While this effect might be explained as an experimental artifact, robot-fish interaction studies might help us further appreciate the role of conspecific-like self-motion cues in regulating how perception and action are coupled and decoupled.In the work of Marras & Porfiri (2012), biological fish were attracted to the locomotion of a robotic fish.Rather than actively decoupling sensory cues, the robot-fish interaction involves replicating the hydrodynamic and other mechanical cues of conspecific swimming behavior.While the coupling or decoupling of self-motion and behavior may be context-dependent in nature, VE and robotic studies have shown (in an almost accidental fashion) how true to context stimuli must be to elicit the proper neural responses.As we will see in the case of interactivity, neural activity associated with intentional behavior can be both a useful and important indicator of dynamic cognitive responses.
In The first-person display features a fisheye-view of a linear track with a reward at the end of the track.This experimental setup resulted in semi-realistic firing patterns for place cells, which encode locations in virtual space.The authors also found three distinct sub-threshold signatures for place fields, which in turn may allow us to confirm theoretical models of neuronal coding (Ekstrom et al., 2003).
While traditional spatial navigation experiments require very few illusory or space/time dilation-related manipulations, there is the potential to do experiments in animals where spatial relationships (and perhaps even mental representations of space-time) are warped.The work of Gershow et al. (2012) demonstrates how gradients of airborne cues can be delivered to organisms in a controlled manner using a series of microcontrollers.Some invertebrate species such as moths engage in a form of spatial navigation behavior called plume tracking.Plumes of odorants or other chemicals do not diffuse through their environmental media (e.g.air or water) in a linear fashion, and the information embedded in a plume is made highly nonlinear due to turbulent conditions.By delivering these gradients as highly laminar flows, the diversity and complexity of motor responses associated with plume tracking can be made tractable.

Interactivity
Interactivity can be defined as the ability to manipulate and adaptively respond to a wider range of objects and behaviors than would found in a non-virtual context.This is a term I am presenting here for purposes of describing a series of experiments that feature animals interacting with VE systems.This could include computergenerated stimuli or avatars.Depending on the application, this can provide either the experience of enveloping interactivity or an experience of dilating the temporal or spatial scale of perception and action.
Normand et al. (2012) use an ingenious experimental design to study interactivity between rats and humans using a technique called "beaming".In this approach, a rat interacts with a robotic human analogue (ePuck).Humans interact with a telerobotic virtual environment system that maps behavior to ePuck that size-wise is similar to the rat's body.To provide closed-loop feedback, the rat's movements are then tracked and mapped to a human-like avatar in the virtual environment.The beaming approach allows for human interactions to take place at the rat's size scale and vice versa.This also enables inter-species interactions such as the neuroanthropological studies of human-animal interaction featured in Keil & Downey (2012).Using beaming in this context might more directly address the existence of ToM within and between species.
Interactivity can also be explored using brain-machine interfaces (BMIs).BMIs share many attributes with virtual environments, and allow us to better contextualize the potential interactions between brain, behavior, and environment observed during virtual world immersion.We can look to the application of BMIs in understanding the neural mechanisms underlying grasping in non-human primates as a relevant example.In O' Doherty et al. (2011), his group introduces the brain-machine-brain interface, which uses electrophysiological signals from the motor cortex (motion planning) as input to a virtual arm that grasps virtual objects.The additional (e.g.feedback to the brain) component involves stimulation of the sensorimotor cortex that serves as haptic (e.g.touch) feedback.This set of experiments has applications to brain-controlled prosthetic devices.This brain-machine-brain interface is currently being realized in application form as the Walk Again project, which aims to enable prosthesis-wearers to engage in activities such as soccer (Yong, 2011).This includes robotic limbs that require close coordination with intentional behaviors, or even devices which record behaviorallyrelevant electrical signals in one animal and uses that signal to stimulate the brain of another animal (Pais-Vieira et al., 2013).

Other effects of VE on cognition
Despite these examples from specific cognitive domains, it is not clear what the effects of VE actually are.As the neural response is characterized as semi-realistic by the authors, this suggests VE may not be perceived by the animals as a real world (the virtual representation falling partially into the uncanny valley featured in Figure 1).But how does the uncanny valley-like effects become manifest in sensorimotor integration, spatial cognition, and interactivity?These are not clearly emotional behaviors, but also involve making distinctions between the real and the artificial.In the case of sensorimotor integration, the uncanny valley might involve slightly unnatural movement patterns.This could involve a detectable discontinuity in the integration of vision and touch.Such an outcome could be registered as an emotional ambiguity (e.g.what is this object?),which could in turn disrupt how the animal treats its environment.A similar outcome might be seen for spatial cognition in terms of disruptions of the spatial reference frame.Like sensorimotor integration, there is a reliance of multisensory integration as a seamless process.When this consistency is violated in terms of an animal's locational self-awareness (e.g.where am I?), an emotional response is triggered.However, in terms of interactivity, an uncanny valley-like emotional response is more straightforward.Interactivity involves interpersonal interactions with objects and agents, and so an uncanny valley-like response occurs in much the same way as predicted by the original theory.
Alternatively, the possibility exists that virtual worlds simply expose the diversity of responses to highly similar environmental phenomena.This is not only due to cross-talk between different cognitive processes, but also involves individual variation in learning abilities and attentional capacity.In human experiments that focus on the effects of training, subjects can be switched back and forth between virtual and real-world tasks (Rose et al., 2000).Ideally, the virtual condition should provide gains in expertise that are transferrable to the real world analogue task.A similar experimental approach might be used for disentangling the effects of a virtual environment (such as sub-threshold neuronal activations) on an animal.While it is impossible to know which interpretation is correct at this point, future experiments specifically focused on perceptual realism in animals might provide us with a clearer picture.

Illusion, space/time dilation, and virtual models
There may be other ways to understand the phenomena of illusion and space/time dilation independently of the three previous examples.Virtual models rely on two assumptions about the generalized animal response to virtuality supported by the previous experiments just reviewed.One assumption is that these responses are rooted in symbolic and adaptable representations of the sensory world.While there is scant evidence of higher-level representation in non-human animals, basic representational systems such as the ability to identify quantities and specific groupings of objects (numerosity) have been observed in animals ranging from fish (Agrillo et al., 2011) to macaques (Roitman et al., 2007).
Another assumption is that these representations may be subject to fictive conditioning.Fictive conditioning, which could be considered a form of associative learning, involves the acquisition of a learned response due to a stimulus via one sense that compensates for a lack of stimulus in another sense.One example of this is the supernumerary hand illusion in humans (Guterstam et al., 2011).In this phenomenon, information from one sense (vision) compensates for the lack of information from another sense (touch) to establish a stable (but fictive) association between the body and a third (prosthetic) arm.Yet despite such assumptions, there is an opportunity for systems neuroscientists to better understand the nuances of function for various pathways and processes.This is particularly true when comparing brain function between an animal subject to the effects of virtuality and a control animal behaving in the absence of virtual manipulation.
Returning to the issue of realism in VE, it is worth noting that whether or not non-human animals possess a bona-fide ToM is controversial.While behavioral tests have shown a propensity of reflective behavioral responses in certain species, the neural mechanisms of this are unclear.In addition, while the neural correlates for ToM in humans are fairly well-established (see Saxe, 2009), the neural correlates for mental behaviors in non-human taxa are not as well characterized.Despite these caveats and limitations, eliciting species-specific responses to virtual stimuli consistent with the uncanny valley effect should be quite possible.To explain how this might occur, we can turn to the work of Maravita & Iriki (2004).In this study, experimenters trained a monkey to use a physical rake to retrieve objects from the environment.Electrophysiological and behavioral evidence post-training suggests that the rake had become incorporated into the animal's body schema (Macaluso & Maravita, 2010), as the tool becomes an extension of the arm.
In extending the Uncanny Valley model to virtual environments, it is generally true that objects become more real as their fidelity increases.However, as they are incorporated into the body schema, they become less emotionally salient as real objects.This dropoff is not observed for physical objects (Carlson et al., 2010), but is predicted to occur for virtual objects even of high fidelity.Finally, once the individual is fully immersed in the VE and becomes acclimated to the use of the virtual object, the virtual object then becomes fully consistent with the body's self-representation and sensory representation of the surrounding environment.In this sense, the virtual becomes real, and in some cases serves as a link between affect and cognition (Lewis & Lloyd, 2010).The extent to which this is true will partially determine the future potential of using VE in animal contexts.

Key features of a virtual architecture for illusion
A virtual representation for illusion follows three sets of observations.The first involves the sensory systems that are engaged by the environment.Due to the immersive and flexible aspects of VEs, behaving animals can engage the environment in a naturalistic fashion.This includes engaging an environmental stimulus in a way analogous to behaviors such as foraging, free navigation, and mating.Therefore, considering the connections between higher-level cognition (e.g.attention) and psychophysiological phenomena (e.g.microsaccades) might be useful in selectively manipulating the input (Otero-Millan et al., 2012).In immersive contexts, the selective decoupling of vision from touch/proprioception and even audition is very important.
The use of VE systems also results in neural correlates that are distinct from real world analogues in humans, in concrete forms such as comparisons between static images and animated video (Han et al., 2005), or 2-D versus 3-D images of hand movements (Perani et al., 2001).While the sensory systems are engaged during interactions with virtual environments, areas related to multisensory integration and memory consolidation are also engaged.This is particularly true for long-lived illusions that are more than the by-product of visual after effects.As a result of this neural and sensory engagement, we should expect certain behavioral dynamics that correspond with those exhibited in the natural world.This is a consequence of behaviors being engaged in context.Ideally, an animal should produce a behavioral response to the illusion that is similar or identical to the same stimulus in the natural world.More likely (and more common with less immersive stimuli) is a behavioral shift that does not mimic the real world.This can be due to a lack of realism in the virtual stimuli, but may also be due to a lack of contextual cues.
This expected result is based on the idea that once a virtual environment reaches a certain level of realness, the brain can no longer distinguish between real and virtual stimuli.In the case of highly immersive environments, there may be an augmented effect on cognitive processes such as attention and memory (Ragan et al., 2010).Yet much like in the case of the uncanny valley, there is a regime where the brain treats virtual stimuli very differently from their physical world counterparts.Therefore, we can use informed speculation to better characterize the theoretical relationship between a continuous measure of immersion and task performance.The general variable called performance indicates a potential measure of goal-oriented behavior (e.g.swimming orientation, target accuracy) relative to a real-world control.
In the cases of space/time dilation and illusion, we can make an educated guess as to what the consequences on performance should look like.For example, the predictions for space/time dilation should show a roughly linear relationship between the degree of immersion and performance.In this case, immersion can be operationalized as the degree of exposure an organism has to a VE system.Generally, the degree of immersion increases with the level of performance.On the other hand, previous experience with a specific set of perceptual cues might change this response in certain individuals.Other types of responses might also be possible.
A secondary prediction is that there should be a tendency for a flattening out of the response curve at very high and very low levels of immersion, as immediate distinctions between the real and virtual worlds become impossible.
By contrast, the predictions for illusion might involve an inverted U-shaped relationship between performance and environmental realism.As the amount of environmental realism increases from very low resolution simulation of the environment, performance should increase.Yet for very high resolution simulations, where multiple sensory modalities are simulated at very-high fidelity, performance should drop off.However, any such response would likely be expertise-dependent (C.Bohil, pers.comm.), and might be very different when the stimuli are significantly different from what is normally experienced by the organism.The Uncanny Valley effect and inverted U-shaped relationship is expected to be most prominent in cases where stimuli are unexpected with respect to experience.This can in turn interfere with higher-level mechanisms involved in perception and action.
Key features of a virtual architecture for space/time dilation Similar questions to those that define illusory experiences in animals can also be asked in the context of space/time dilation.Depending on the degree of immersion, there are a range of sensory systems that could be engaged during space/time dilation.In mammals, this might include the visual and vestibular systems working in concert to register the location and position of the organism's body in the environment (Fetsch et al., 2012).Unlike illusion, multisensory integration must not be disrupted over long periods of time.
The neural substrates of space/time dilation involve structures related to learning and memory, spatial cognition, and time-keeping.
In mammals, these include the hippocampus (Jacobs et al., 1990) and frontostriatal-cerebellar connections (Stevens et al., 2007).In cases where space/time dilation is successfully achieved, we should expect enhanced activity in these regions.Space/Time dilation should lead to unique behavioral dynamics, very different from those expected from illusion.Highly-immersive environments should produce sped-up or slowed-down responses that are consistent with the type of space/time dilation employed.The outcome of space/time dilation is a learning effect that may reconstitute neural synchrony (Axmacher et al., 2006).

Challenges and future directions
There are a number of hurdles for eliciting the effects of virtuality (illusion and space/time dilation) in animals.Of course, these hurdles are not unique to non-human animals, as VE systems applied to humans are often far from an immersive experience.But animal models provide additional constraints in that systems reliant upon symbolic representations and fictive conditioning may not have much of an effect on the individual.While these are key and often complex features of human cognition, depending on the species they may be absent altogether in animal cognition.Taking this into consideration, the best strategy would be to tailor VE system content to specific animal species.In fishes, symbolism is likely absent and fictive conditioning must be done at a highly abstract level.
In other animal species such as birds or social insects, symbolism might be used as a means to mediate the encoding of memories.
Another consideration is the interaction between cognitive mechanisms such as attention, memory, and psychophysiological phenomena (e.g.arousal).These connections between neurocognitive mechanisms and cross-talk have been shown to be important in mediating human-VE interactions (Parsons & Courtney, 2011).
In non-human animals, the interaction of these mechanisms provides an opportunity to make a stronger link between affect and the effects of virtuality.This also provides a means to understand the traditionally affect-driven Uncanny Valley effect in the context of illusion and space/time dilation, which in their totality are products of higher-level cognition.
Even more interesting is the effect of decoupling affect or other psychophysiological responses from their cognitive context.A simple example might be a virtual version of the nictitating membrane response.This form of conditioned learning can lead to an effect called overexpectation (Rescorla, 2006), which can affect memory formation across taxa for both fear conditioning and perhaps even other forms of acquisition (Kehoe & White, 2004).Coupling simple mechanisms with VE systems might open up new avenues for manipulating and exploring higher-cognitive processes.
A vision for the future While there are many unknowns in terms of how animals respond to their environment, not to mention the diversity inherent in animal brains and sensory systems, we can nevertheless selectively manipulate these variables using virtual environments.In the broader scheme of animal cognition, parallels with human cognition can be drawn in to illustrate potential neural mechanisms that might be involved in producing behavioral effects observed across a range of experiments.While these effects constitute a relatively unexplored component of animal behavior, they may lead to new discoveries in animal cognition and perhaps in the genetic substrates of conserved animal behaviors (Figure 2).
Elicitation of these behavioral effects is dependent on the configuration of the virtual environment itself.Unlike natural environments, virtual environments are highly stereotyped and do not include much of the noise associated with biological realism (Dennett, 2013).Nevertheless, environmental realism can be high, and findings in human experiments suggest that this is not an epiphenomenon (Blascovich & Bailenson, 2011).In addition, virtual environments are highly flexible and provide an experimental test bed for exploring the potential richness of animal perceptual, cognitive, and social behavior (Bohil et al., 2011).Since there are a range of possible design configurations for animal research-oriented VE systems, many of which can be tailored to a scientific question and organism of interest, the possibilities for further application and future research are potentially endless.Furthermore, costs can be minimized through clever design features.
Tailoring the virtual world to the perceptual specializations of a given organism would help in this regard.One example is the high critical fusion frequency (CFF) of the housefly (Healy et al., 2013).Tightly-controlled environments can be constructed by using the fly's natural visual sampling rate as a baseline.The rate of presentation can then be systematically varied.Another example is the electrosensory and mechanosensory capabilities of sharks, rays, and certain bony fishes (Coombs et al., 2002).A VE system that models fluids and fluid dynamics in the sensory environment could enable the creation of perceptual ambiguities, which could then allow for the power of sensory illusion to be leveraged.These type of examples ultimately provides the experimentalist with a highlycontrollable, selectively enriched (Nithianantharajah & Hannan, 2006), and customizable environment.
The benefit of this might be also considered in terms of geneenvironment interactions (Figure 2).One way in which virtual environments might be able to assist in uncovering gene-environment interactions is by using a logic similar to that which twin studies rests upon.In twin studies, the genetic similarities of identical twins are used to control for unknown genetic variation (van Dongen et al., 2012).In a similar manner, virtual environments might be employed to control for unknown environmental noise.
For experimental purposes, a random sample exposed to the same highly-controlled environment is predicted to exhibit minimal environmental variation.This should allow for the effects of the genetic background to be magnified, enabling stronger associations between genes and behavior to be made.
With the rapid adoption and increasing affordability of nextgeneration sequencing technologies, it is now possible to target assays of a genome in combination with genome-wide association (GWAS) studies to uncover the genetic components of a trait.What is still a mystery are the interactions between genes, behavior, and environment.Gene sequencing combined with robust environmental control can elucidate some of these interactions, while also providing insights into the ultimate processing limits of functionally-distinct neural systems.
While the link between genotype and controllable environment is more speculative, the promise of VEs for the study of animal behavior and cognition is real and the returns can be immediate.I have shown how different forms of VE have been used to elucidate and perhaps even augment animal behavior.In fact, VE might be particularly useful in understanding particularly difficult-to-define problems such as neural coding (Kumar et al., 2010) and human-animal interaction (Wilson & Barker, 2003).Overall, however, VE systems provide a flexible mode of investigation for both general and specific mechanisms that govern brain and behavior.In addition, two specific types of manipulation (illusion and space/time dilation) can be used to produce novel experimental outcomes.These effects of virtuality provide an opportunity to advance the naturalistic study of animal brain and behavior.This article provides an interesting attempt to synthesize theories and empirical findings regarding the potential utility of virtual reality in animal research.I found many of the referenced papers to be quite interesting from my fairly localized perspective on this subject.Overall, however, I find the article to be excessively speculative, insufficiently referenced and often repetitive.The speculations are quite often so vague that they are essentially meaningless.
Below are some specific concerns: Application of the uncanny valley concept: It is a huge leap to assume there is an uncanny valley in all aspects of perception, the original idea is quite specific to "humanness" and there is very good reason to think that the "close but not quite" uncanny valley does not generalize beyond this.Why would such a non-linearity be expected to be present in virtual reality?This argues that a poorly instantiated VR would actually be better than a very good, but not quite perfect one.Secondly, the emotional valence that is plotted to produce the uncanny valley comes from the immediate reaction of a human observer to a robot.This should not be confused with the emotional valence that is acquired by learning in a virtual environment based on the specific task contingencies and rewards or punishments that may be presented.
A huge omission in this article is the concept of immersion vs. presence.The author refers to immersion in several places but fails to incorporate the critical concept of presence, which is defined as the psychological experience being "present" in the virtual environment.This concept is referred to variously as "engagement" and "performance."The absence of this important concept in the paper is perhaps the most serious underlying problem."Engagement" and "good performance" do not necessarily require much immersion and should not be considered evidence of presence: an animal or human can be quite engaged and perform well in very rudimentary "virtual environments" as long as proper feedback is given (rewards, punishment, etc.), without being present in the virtual environment.Fundamentally, immersion vs. presence gets at the distinction between sensation and perception.Illusion, a term used frequently in the article, operates at the perceptual level and is therefore somewhat akin to presence, but it's definition as "confusing" a virtual stimulus for a real one does not fully capture the complexity of being present in a virtual environment.Virtuality is another frequently used term in the article, but its precise definition and how it relates to the immersion vs. presence distinction is entirely unclear.
Several recent papers that are highly relevant to the paper should also be cited and discussed:

F1000Research
Several recent papers that are highly relevant to the paper should also be cited and discussed: Ravassard .( 2012) -.

-This paper et al
Multisensory control of hippocampal spatiotemporal selectivity directly compares "place cells" in real world and virtual environments in rats.

Cushman
. ( 2013 Overall I think a survey/position paper like the one presented here is relevant and helpful.The survey explores the application of technology for generating and manipulating virtual environments to the study of animal behaviour and cognition.Some important potential issues and benefits the practice raises (or might raise) have been explored in this initial version of the article.I still have some major reservations about the article as it stands and would recommend a few changes prior to its finalisation.
The article is very broad and, in many places highly speculative.The many claims phrased "should", "may", "potential" and "expected" (e.g.see para 1 of Introduction) would be more convincing to me if supported with strong arguments based on published or presented evidence.Since the paper is so broad in its scope this might require a very long paper indeed, perhaps an entire book!As an alternative, I would suggest narrowing the scope considerably (e.g. to consider experiments with one species or other experimental category such as Primates, insects, or something still more specific such as Zebrafish or Hymenoptera, or anything for which there is already a lot of evidence gathered in the literature) and focusing on how the arguments presented here might apply, and be supported, in that case.
A second reason for suggesting a reduction in scope relates to the differences in perceptual and cognitive systems between animals.For instance avian colour vision and hymenopteran colour vision are very different, as are the learning mechanisms these organisms seem to implement.Hence, discoveries in one case are not necessarily transferable between species.This makes claims about animal perception in general difficult to state with authority unless they are so broad as to be meaningless or at least unhelpful.I suspect many readers familiar with these issues would be skeptical about the utility of making generalisations at all.This contributes to my perception that F1000Research be skeptical about the utility of making generalisations at all.This contributes to my perception that the article is biting off a little more than can be chewed in a single publication.
The introduction of the concept of the uncanny valley to this context is interesting!As the author points out we need to be very careful about applying it beyond humans/primates.It would indeed be fascinating to test for the effect in other animals.The author needs to be careful though that their hypothesis that it might apply beyond primates and in specific cases, doesn't get confused even loosely with the "truth" of this, and hence its relevance.E.g. see para. 2 of "Current examples" where you indicate that the "effect suggests...".
Indeed in one fun collaboration, a researcher/artist team have tested for something related to the uncanny valley (although this wasn't their motivation) in honeybee attraction to artists' renditions of flowers ( ).Bees https://www.artlink.com.au/articles/3722/insects-as-art-lovers-bees-for-van-gogh/ are not fussy and can be trained to visit coloured squares of paper for rewards so I guess they fall at the "no uncanny valley" end of the spectrum.But I am not sure if any measure of initial confusion, discomfort, or any unusual neural activity, might have been recorded experimentally to ascertain the presence of the valley.
A group of researchers at Graz, Austria are working on robots, (not VEs), that are intended to influence the behaviour of animals/insects.This might well be research that could investigate the effect the author is considering ( ).I hear it took the http://zool33.uni-graz.at/artlife/node/208researchers some patient work to encourage bees to accept their robot hive-mates and I am unsure of the degree of their success!Additionally, this indicates another avenue besides complete virtuality for interacting with organisms as the author notes in para.3 of "Current examples" and later under "Sensorimotor integration" in the context of fish studies.
A note on the valley diagram: individual variation may spread horizontally too -i.e. the "pinch points" as the curves cross the x-axis in the graph might be better rendered as broad "fords" across the axis.Lastly, even if the author chooses to remain broad in your discussion of different animals being tested within virtual environments, please consider editing the article, and individual sentences, much more tightly.Perhaps delete references to robot work, noting it is being done, and focus on screen-based work.Or vice versa.Or discuss only the relevance of the uncanny valley.At the moment, as a reader I find the paper is a little too broad.A tighter focus would (for me) make it more enjoyable to read and more informative.
I have read this submission.I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.
No competing interests were disclosed.These points are in reference to the forthcoming second version of the paper: A) One reviewer cites the lack of references to the Presence literature.Both reviewers were apprehensive about the speculative nature of this paper.In an attempt to reconcile (perhaps incompletely) both of these concerns, I have added a section on how Presence might operate in animals engaged with VR systems and how the Psychological phenomenon of Presence might help us understand the introduced constructs of illusion and time/space dilation.Citation and connections with the Presence literature was avoided in the original version because it is hard to make the mechanistic link between non-human animal cognition and presence.However, some stated connection was requested by one of the reviewers.In this version, I have made an intellectual connection between human presence and illusion/dilation.This has also been discussed in the more general context of animal cognition.B) I have also tightened up the definitions of illusion and dilation to reflect the speculative nature of higher-order cognitive responses playing out on a conserved anatomical and sensory substrate.As with Presence, the uncanny valley phenomenon in non-human animals is also an analogical model.While formal mechanisms for Presence and sense-making are not expected to be found in animals, there is a need for perceptual consistency that belies animal cognition.This model allows us to deal with the sorts of responses we might expect as we push the limits of VR fidelity.C) I agree that a good VR system might yield more "real" perceptual outcomes than does reality itself.However, the inflection point in my graph actually involves a two-part response to the virtual stimulus.1) a response to the realism of a given stimulus.The phenomenon of immersion or "presence" should increase linearly with realism, but with breaks that resemble the uncanny valley.

Discuss this Article
2) a mismatch between the environmental realism of the stimulus (point #1) and the organism's ability to bring them together into unified percepts.This is the region in which the potential effectiveness of illusion and dilation are greater.D) I have also included a section which highlights innovation in the area of spatial cognition in rodents.E) Both reviewers had an issue with the scope of the paper.As it covers a large swath of animal phylogeny in addition to robotics, this can be overwhelming and ultimately not too informative nor consistent.Therefore, I removed the references to robots, with a slightly expanded disclaimer involving the diversity of animal brains and sensory systems: F) To increase the citation count, I have also made additions to the bibliography.The total number of references now stand at 87, which includes the additions suggested by one of the reviewers (Jesse Cushman).I don't feel that adding more references can be done without also expanding the scope of the article.
No competing interests were disclosed.Competing Interests:

Figure 1 .
Figure 1.A conceptual demonstration of the uncanny valley, adapted specifically for virtual environments.Realism (x-axis) represents the fidelity and/or resolution of this representation.Emotional valence (y -axis) represents the positive or negative emotional valence associated with a given representation of familiar objects or conspecifics.A: the first phase of the response curve, showing an initial rise in emotional valence with moderate degrees of realism.B: the second phase of the response curve, showing a dip and rebound in emotional valence at very high levels of realism.Shaded region represents hypothetical individual variation exhibited in the response.Figure adapted from the uncanny valley principle as originally proposed by Mori (1970).

Figure 2 .
Figure 2. The role of VE in the milieu of interactions (genes, behavior, and environment) that define an organism's world.
) -et al Multisensory control of multimodal behavior: do the legs know what the tongue is -This paper shows spatial navigation in virtual reality in rats and utilizes isolation of the visual and doing?auditory modalities combined with the simultaneous measure of two behavioral outputs.Aronov and Tank (2014) -Engagement of Neural Circuits Underlying 2D Spatial Navigation in a Rodent -This paper extensively characterizes place cells and grid cells in virtual reality.Virtual Reality System.Aghajan (2014) -et al.Impaired spatial selectivity and intact phase precession in two-dimensional virtual -This paper compares place cells in 2-D virtual and real world environments reality.I have read this submission.I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.No competing interests were disclosed.Technology, Monash University, Clayton, VIC, Australia

Table 2 . Comparison of three emerging areas of animal virtual environment research. Sensorimotor integration Spatial navigation Interactivity Organism
Zebrafish, Drosophila Mice, rats, moths Rats/robots/humans, monkeys/robots Gray et al. (2002)eriments in insects, virtual environments are used to dilate visual stimuli with regard to motor control.Gray et al. (2002)use the walls of a flight arena to present visual cues that mimic depth and motion to an immobilized insect.This was done in a specialized arena which is shown and discussed inGray et al. (2002)Spatial navigation Spatial navigation is perhaps the best understood of the three featured behaviors due to our extensive knowledge of neural mechanisms at both the structural (hippocampus) and single-cell (place and grid cell) levels.Indeed, virtual environments enable the development and confirmation of sophisticated theoretical models of spatial navigation.This is exactly what was done inHolscher  et al. (2005)andHarvey et al. (2009).In theHarvey et al. (2009)approach, a mouse is situated atop an air supported-spherical treadmill, and its head is fixed for purposes of in vivo measurement.The virtual environment consists of a projection-based visual display.
and Seelig et al. (2010).InSeelig et al. (2010), a head-fixation task is replicated by having a fly walk on an air-supported ball concurrent with the presentation of visual stimuli.Using this type of VE design, an integrated response was found in horizontal system neurons.Using systems such as these, adaptive behaviors can be initiated in a highly-controlled environment.This not only allows for a range of behavioral regimes to be explored, but multiscale (e.g.cellular and behavioral dynamics) experimental investigations as well.