TL;DR: In this article, the authors propose that the brain produces an internal representation of the world, and the activation of this internal representation is assumed to give rise to the experience of seeing, but it leaves unexplained how the existence of such a detailed internal representation might produce visual consciousness.
Abstract: Many current neurophysiological, psychophysical, and psychological approaches to vision rest on the idea that when we see, the brain produces an internal representation of the world. The activation of this internal representation is assumed to give rise to the experience of seeing. The problem with this kind of approach is that it leaves unexplained how the existence of such a detailed internal representation might produce visual consciousness. An alternative proposal is made here. We propose that seeing is a way of acting. It is a particular way of exploring the environment. Activity in internal representations does not generate the experience of seeing. The outside world serves as its own, external, representation. The experience of seeing occurs when the organism masters what we call the governing laws of sensorimotor contingency. The advantage of this approach is that it provides a natural and principled way of accounting for visual consciousness, and for the differences in the perceived quality of sensory experience in the different sensory modalities. Several lines of empirical evidence are brought forward in support of the theory, in particular: evidence from experiments in sensorimotor adaptation, visual “filling in,” visual stability despite eye movements, change blindness, sensory substitution, and color perception.
TL;DR: A series of functional magnetic resonance imaging studies revealing substantial evidence for a distinct cortical region in humans that responds selectively to images of the human body, as compared with a wide range of control stimuli.
Abstract: Despite extensive evidence for regions of human visual cortex that respond selectively to faces, few studies have considered the cortical representation of the appearance of the rest of the human body. We present a series of functional magnetic resonance imaging (fMRI) studies revealing substantial evidence for a distinct cortical region in humans that responds selectively to images of the human body, as compared with a wide range of control stimuli. This region was found in the lateral occipitotemporal cortex in all subjects tested and apparently reflects a specialized neural system for the visual perception of the human body.
TL;DR: This article trained monkeys to categorize computer-generated stimuli as "cat" and "dogs" using a morphing system to systematically vary stimulus shape and precisely define the category boundary, which reflected the category of visual stimuli even when a monkey was retrained with the stimuli assigned to new categories.
Abstract: The ability to group stimuli into meaningful categories is a fundamental cognitive process. To explore its neural basis, we trained monkeys to categorize computer-generated stimuli as "cats" and "dogs." A morphing system was used to systematically vary stimulus shape and precisely define the category boundary. Neural activity in the lateral prefrontal cortex reflected the category of visual stimuli, even when a monkey was retrained with the stimuli assigned to new categories.
TL;DR: Objects defined by a conjunction of four features can be retained in working memory just as well as single-feature objects, allowing many individual features to be retained when distributed across a small number of objects.
Abstract: Working memory can be divided into separate subsystems for verbal and visual information. Although the verbal system has been well characterized, the storage capacity of visual working memory has not yet been established for simple features or for conjunctions of features. The authors demonstrate that it is possible to retain information about only 3-4 colors or orientations in visual working memory at one time. Observers are also able to retain both the color and the orientation of 3-4 objects, indicating that visual working memory stores integrated objects rather than individual features. Indeed, objects defined by a conjunction of four features can be retained in working memory just as well as single-feature objects, allowing many individual features to be retained when distributed across a small number of objects. Thus, the capacity of visual working memory must be understood in terms of integrated objects rather than individual features.
TL;DR: There is evidence that most reading problems have a fundamental sensorimotor cause and good magnocellular function is essential for high motion sensitivity and stable binocular fixation, hence proper development of orthographic skills.
Abstract: Low literacy is termed ‘developmental dyslexia’ when reading is significantly behind that expected from the intelligence quotient (IQ) in the presence of other symptoms —incoordination, left ‐right confusions, poor sequencing —that characterize it as a neurological syndrome. 5‐10% of children, particularly boys, are found to be dyslexic. Reading requires the acquisition of good orthographic skills for recognising the visual form of words which allows one to access their meaning directly. It also requires the development of good phonological skills for sounding out unfamiliar words using knowledge of letter sound conversion rules. In the dyslexic brain, temporoparietal language areas on the two sides are symmetrical without the normal left-sided advantage. Also brain ‘warts’ (ectopias) are found, particularly clustered round the left temporoparietal language areas. The visual magnocellular system is responsible for timing visual events when reading. It therefore signals any visual motion that occurs if unintended movements lead to images moving off the fovea (‘retinal slip’). These signals are then used to bring the eyes back on target. Thus, sensitivity to visual motion seems to help determine how well orthographic skill can develop in both good and bad readers. In dyslexics, the development of the visual magnocellular system is impaired: development of the magnocellular layers of the dyslexic lateral geniculate nucleus (LGN) is abnormal; their motion sensitivity is reduced; many dyslexics show unsteady binocular fixation; hence poor visual localization, particularly on the left side (left neglect). Dyslexics’ binocular instability and visual perceptual instability, therefore, can cause the letters they are trying to read to appear to move around and cross over each other. Hence, blanking one eye (monocular occlusion) can improve reading. Thus, good magnocellular function is essential for high motion sensitivity and stable binocular fixation, hence proper development of orthographic skills. Many dyslexics also have auditory:phonological problems. Distinguishing letter sounds
TL;DR: The lateral geniculate nucleus, which is the thalamic relay of retinal information to cortex, provides a reasonable model for all of thalamus and allows theThalamus to provide a dynamic relay that affects the nature and format of information that reaches the cortex.
TL;DR: The results suggest that the parietal area is the most likely locus of this selection mechanism and that it acts by pointing to the spatial coordinates (or cortical coordinates) of items of interest rather than by holding a representation of the items themselves.
TL;DR: Evidence suggests that saccades trigger two distinct neural processes: a suppression of visual sensitivity, specific to the magnocellular pathway, that dampens the sensation of motion and a gross perceptual distortion of visual space in anticipation of the repositioning of gaze.
TL;DR: It is argued that viewing a visual index as performing a demonstrative or preconceptual reference function has far-reaching implications not only for a theory of situated vision, but also for suggesting a new way to look at why the primitive individuation of visual objects, or proto-objects, is so central in computing visual representations.
TL;DR: The results provide the strongest evidence to date for the existence of multisensory prior entry and support previous claims for attentional biases toward the visual modality and toward the right side of space.
Abstract: Despite 2 centuries of research, the question of whether attending to a sensory modality speeds the perception of stimuli in that modality has yet to be resolved. The authors highlight weaknesses inherent in this previous research and report the results of 4 experiments in which a novel methodology was used to investigate the effects on temporal order judgments (TOJs) of attending to a particular sensory modality or spatial location. Participants were presented with pairs of visual and tactile stimuli from the left and/or right at varying stimulus onset asynchronies and were required to make unspeeded TOJs regarding which stimulus appeared first. The results provide the strongest evidence to date for the existence of multisensory prior entry and support previous claims for attentional biases toward the visual modality and toward the right side of space. These findings have important implications for studies in many areas of human and animal cognition.
TL;DR: A dual-process approach to visual perception emerges, with the ecological-dorsal process transpiring mainly without conscious awareness, while the constructivist-ventral process is normally conscious.
Abstract: The two contrasting theoretical approaches to visual perception, the constructivist and the ecological, are briefly presented and illustrated through their analyses of space and size perception. Earlier calls for their reconciliation and unification are reviewed. Neurophysiological, neuropsychological, and psychophysical evidence for the existence of two quite distinct visual systems, the ventral and the dorsal, is presented. These two perceptual systems differ in their functions; the ventral system's central function is that of identification, while the dorsal system is mainly engaged in the visual control of motor behavior. The strong parallels between the ecological approach and the functioning of the dorsal system, and between the constructivist approach and the functioning of the ventral system are noted. It is also shown that the experimental paradigms used by the proponents of these two approaches match the functions of the respective visual systems. A dual-process approach to visual perception emerges from this analysis, with the ecological-dorsal process transpiring mainly without conscious awareness, while the constructivist-ventral process is normally conscious. Some implications of this dual-process approach to visual-perceptual phenomena are presented, with emphasis on space perception.
TL;DR: This work investigates the performance, by normal individual and subjects with a selective impairment in either motor or visual imagery, of an imagery task involving a mental rotation, and highlights the distinct but complementary contribution of covert motor and visual processes during mental rotation.
Abstract: Recent studies indicate that covert mental activities, such as simulating a motor action and imagining the shape of an object, involve shared neural representations with actual motor performance and with visual perception, respectively. Here we investigate the performance, by normal individual and subjects with a selective impairment in either motor or visual imagery, of an imagery task involving a mental rotation. The task involved imagining a hand in a particular orientation in space and making a subsequent laterality judgement. A simple change in the phrasing of the imagery instructions (first-person or third-person imagery) and in actual hand posture (holding the hands on the lap or in the back) had a strong impact on response time (RT) in normal subjects, and on response accuracy in brain-damaged subjects. The pattern of results indicates that the activation of covert motor and visual processes during mental imagery depends on both top-down and bottom-up factors, and highlights the distinct but complementary contribution of covert motor and visual processes during mental rotation.
TL;DR: The results indicate that VV emerges in vestibular patients if they have increased visual dependence and difficulty in resolving conflict between visual and vestibulo-proprioceptive inputs and it is argued that treating these patients with visual motion desensitization, e.g. repeated optokinetic stimulation, should be beneficial.
Abstract: Certain patients with balance disorders report a 'visual vertigo' in which their symptoms are provoked or aggravated by specific visual contexts (e.g. supermarkets, driving or movement of objects). In order to determine the causes of visual vertigo (VV), we assessed symptoms, anxiety and the influence of disorienting visual stimuli in 21 such patients. In 17 out of 21 patients, a peripheral vestibular disorder was diagnosed. Sixteen bilateral labyrinthine-defective subjects (LDS) and 25 normal subjects served as controls. Questionnaire assessment showed that the levels of trait anxiety and childhood motion sickness in the three subject groups were not significantly different. Reporting of autonomic symptoms and somatic anxiety was higher than normal in both patient groups but not significantly different between LDS and VV patients. Handicap levels were not different in the two patient groups, but the reporting of vestibular symptoms was higher in the VV than in the LDS group. The experimental stimuli required subjects to set the subjective visual vertical in three visual conditions: total darkness, in front of a tilted luminous frame (rod and frame test) and in front of a large disc rotating in the frontal plane (rod and disc test). Body sway was also measured in four visual conditions: eyes closed, eyes open, facing the tilted frame and during disc rotation. In psychophysical and postural tests, both LDS and VV patients showed: (i) a significant increase in the tilt of the visual vertical both with the static tilted frame and with the rotating disc; and (ii) an increased postural deviation whilst facing the tilted frame and the rotating disc. The ratio between sway path with eyes closed and eyes open (i.e. the stabilizing effect of vision) was increased in the LDS, but not in VV patients, compared with normal subjects. In contrast, the ratio between sway path during disc rotation and sway path during eyes open (i.e. the destabilizing effect of a moving visual stimulus) was increased in the VV patients but not in LDS. Taken together, these data show that VV patients have abnormally large perceptual and postural responses to disorienting visual environments. VV is not related to trait anxiety or a past history of motion sickness. The results indicate that VV emerges in vestibular patients if they have increased visual dependence and difficulty in resolving conflict between visual and vestibulo-proprioceptive inputs. It is argued that treating these patients with visual motion desensitization, e.g. repeated optokinetic stimulation, should be beneficial.
TL;DR: The event-related functional magnetic resonance imaging was used to compare activity in the human parietal cortex in two attention-switching paradigms and revealed distinct parietal areas concerned with visual attentional set shifts and visuomotor intentional set shifts.
Abstract: Event-related functional magnetic resonance imaging was used to compare activity in the human parietal cortex in two attention-switching paradigms. On each trial of the visual switching (VS) paradigm, subjects attended to one of two visual stimuli on the basis of either their color or shape. Trials were presented in blocks interleaved with cues instructing subjects to either continue attending to the currently relevant dimension or to switch to the other stimulus dimension. In the response switching (RS) paradigm, subjects made one of two manual responses to the single stimulus presented on each trial. The rules for stimulus-response mapping were reversed on different trials. Trials were presented in blocks interleaved with cues that instructed subjects to either switch stimulus-response mapping rules or to continue with the current rule. Brain activity at "switch" and "stay" events was compared. The results revealed distinct parietal areas concerned with visual attentional set shifts (VS) and visuomotor intentional set shifts (RS). In VS, activity was recorded in the lateral part of the intraparietal region. In RS, activity was recorded in the posterior medial intraparietal region and adjacent posterior superior and dorsomedial parietal cortex. The results also suggest that the basic functional organization of the intraparietal sulcus and surrounding regions is similar in both macaque and human species.
TL;DR: The results demonstrate the utility of complementary analyses for fMRI data and suggest that the cerebellum may play a significant role in visual perceptual processing and illustrate functional connectivity between frontal eye fields and prefrontal and parietal regions.
TL;DR: The authors conclude that the structure of linguistic spatial categories can be partially explained in terms of independently motivated perceptual processes.
Abstract: The present paper grounds the linguistic cdategorization of space in aspects of visual perception; specifically, the structure of projective spatial terms such as above are grounded in the process of attention and in vector-sum coding of overall direction. This is formalized in the attentional vector-sum (AVS) model. This computational model accurately predicts linguistic acceptability judgments for spatial terms, under a variety of spatial configurations. In 7 experiments, the predictions of the AVS model are tested against those of 3 competing models. The results support the AVS model and disconfirm its competitors. The authors conclude that the structure of linguistic spatial categories can be partially explained in terms of independently motivated perceptual processes.
TL;DR: It is found that only real actions in natural environment activated a visuospatial network including the right posterior parietal cortex, whereas virtual-reality conditions do not access the full motor knowledge available to the central nervous system.
TL;DR: A review and reanalysis of the new evidence, pro and con, resolves the challenges and yields a more clearly defined and significantly strengthened theory.
TL;DR: Monitoring subjects' eye movements during a visual search task suggested that visual search does have memory, and no evidence suggesting that fixations were guided by amnesic covert scans that scouted the environment for new items during fixations.
Abstract: By monitoring subjects' eye movements during a visual search task, we examined the possibility that the mechanism responsible for guiding attention during visual search has no memory for which locations have already been examined. Subjects did reexamine some items during their search, but the pattern of revisitations did notfit the predictions of the memory less search model. In addition, a large proportion of the refixations were directed at the target, suggesting that the revisitations were due to subjects' remembering which items had not been adequately identified. We also examined the patterns of fixations and compared them with the predictions of a memoryless search model Subjects' fixation patterns showed an increasing hazard function, whereas the memoryless model predicts a flat function. Lastly, we found no evidence suggesting that fixations were guided by amnesic covert scans that scouted the environment for new items during fixations. Results do not support the claims of the memorvless search model, and instead suggest that visual search does have memory.
TL;DR: It is shown that this motion-induced blindness (MIB) phenomenon is unlikely to reflect retinal suppression, sensory masking or adaptation, and might reflect a disruption of attentional processing, which shifts the system into a winner-takes-all mode.
Abstract: Cases in which salient visual stimuli do not register consciously are known to occur in special conditions, such as the presentation of dissimilar stimuli to the two eyes1 or when images are stabilized on the retina2. Here, we report a striking phenomenon of ‘visual disappearance’ observed with normal-sighted observers under natural conditions. When a global moving pattern is superimposed on high-contrast stationary or slowly moving stimuli, the latter disappear and reappear alternately for periods of several seconds. We show that this motion-induced blindness (MIB) phenomenon is unlikely to reflect retinal suppression, sensory masking or adaptation. The phenomenology observed includes perceptual grouping effects, object rivalry and visual field anisotropy. This is very similar to that found in other types of visual disappearance, as well as in clinical cases of attention deficits, in which partial invisibility might occur despite the primary visual areas being intact3. Disappearance might reflect a disruption of attentional processing, which shifts the system into a winner-takes-all mode, uncovering the dynamics of competition between object representations within the human visual system.
TL;DR: Three findings support the two competing hypotheses regarding the accumulation of visual information during scene viewing that visual representations do not necessarily decay upon the withdrawal of attention, but instead can be accumulated in memory from previously attended regions.
Abstract: What is the nature of the representation formed during the viewing of natural scenes? We tested two competing hypotheses regarding the accumulation of visual information during scene viewing. The first holds that coherent visual representations disintegrate as soon as attention is withdrawn from an object and thus that the visual representation of a scene is exceedingly impoverished. The second holds that visual representations do not necessarily decay upon the withdrawal of attention, but instead can be accumulated in memory from previously attended regions. Target objects in line drawings of natural scenes were changed during a saccadic eye movement away from those objects. Three findings support the second hypothesis. First, changes to the visual form of target objects (token substitution) were successfully detected, as indicated by both explicit and implicit measures, even though the target object was not attended when the change occurred. Second, these detections were often delayed until well after the change. Third, changes to semantically inconsistent target objects were detected better than changes to semantically consistent objects.
TL;DR: Results show that visual stimuli activate two cortical areas right from the beginning of the cortical response, and suggest that the anteromedial cuneus has the temporal position needed to interact with the primary visual cortex V1 and thereby to modify information transferred via V1 to extrastriate cortices.
Abstract: Proper understanding of processes underlying visual perception requires information on the activation order of distinct brain areas. We measured dynamics of cortical signals with magnetoencephalography while human subjects viewed stimuli at four visual quadrants. The signals were analyzed with minimum current estimates at the individual and group level. Activation emerged 55–70 ms after stimulus onset both in the primary posterior visual areas and in the anteromedial part of the cuneus. Other cortical areas were active after this initial dual activation. Comparison of data between species suggests that the anteromedial cuneus either comprises a homologue of the monkey area V6 or is an area unique to humans. Our results show that visual stimuli activate two cortical areas right from the beginning of the cortical response. The anteromedial cuneus has the temporal position needed to interact with the primary visual cortex V1 and thereby to modify information transferred via V1 to extrastriate cortices.
TL;DR: It is demonstrated here that the assumption on the illumination position is in fact biased to the above-left rather than directly above, and the possible origins of this singular bias on the illuminated position are discussed.
TL;DR: The 'proto-objects' revealed by studies of segmentation and attention may relate to other object-based notions in cognitive science, and the reverse influence is also apparent, suggesting an interactive architecture.
Abstract: Issues concerning selective attention provoke new questions about visual segmentation, and vice-versa. We illustrate this by describing our recent work on grouping under conditions of inattention, on change blindness for background events and the residual processing of undetected background changes, on modal versus amodal completion in visual search, and the differential effects of these two forms of completion on attentional processes, and on attentional modulation of lateral interactions thought to arise in early visual cortex. Many of these results indicate that segmentation processes substantially constrain attentional processes, but the reverse influence is also apparent, suggesting an interactive architecture. We discuss how the 'proto-objects' revealed by studies of segmentation and attention (i.e. the segmented perceptual units which constrain selectivity) may relate to other object-based notions in cognitive science, and we wrestle with their relation to phenomenal visual awareness.
TL;DR: It is argued that action does not resist visual illusions, and the effects on the motor system seem to be comparable to the effect on the perceptual system, which challenges the action vs. perception hypothesis in its current form.
TL;DR: This study found that many neurons in both area TE and perirhinal cortex came to elicit more similar neuronal responses to paired stimuli as learning proceeded, and this experience-dependent plasticity of sensory representations in the cerebral cortex may underlie the learning of associations between objects.
Abstract: Visual stimuli that are frequently seen together become associated in long-term memory, such that the sight of one stimulus readily brings to mind the thought or image of the other. It has been hypothesized that acquisition of such long-term associative memories proceeds via the strengthening of connections between neurons representing the associated stimuli, such that a neuron initially responding only to one stimulus of an associated pair eventually comes to respond to both. Consistent with this hypothesis, studies have demonstrated that individual neurons in the primate inferior temporal cortex tend to exhibit similar responses to pairs of visual stimuli that have become behaviorally associated. In the present study, we investigated the role of these areas in the formation of conditional visual associations by monitoring the responses of individual neurons during the learning of new stimulus pairs. We found that many neurons in both area TE and perirhinal cortex came to elicit more similar neuronal responses to paired stimuli as learning proceeded. Moreover, these neuronal response changes were learning-dependent and proceeded with an average time course that paralleled learning. This experience-dependent plasticity of sensory representations in the cerebral cortex may underlie the learning of associations between objects.
TL;DR: The FEF sends diverse delay activity signals related to movement, memory, and vision to the SC, where the signals may be used for saccade generation.
Abstract: Many neurons within prefrontal cortex exhibit a tonic discharge between visual stimulation and motor response. This delay activity may contribute to movement, memory, and vision. We studied delay activity sent from the frontal eye field (FEF) in prefrontal cortex to the superior colliculus (SC). We evaluated whether this efferent delay activity was related to movement, memory, or vision, to establish its possible functions. Using antidromic stimulation, we identified 66 FEF neurons projecting to the SC and we recorded from them while monkeys performed a Go/Nogo task. Early in every trial, a monkey was instructed as to whether it would have to make a saccade (Go) or not (Nogo) to a target location, which permitted identification of delay activity related to movement. In half of the trials (memory trials), the target disappeared, which permitted identification of delay activity related to memory. In the remaining trials (visual trials), the target remained visible, which permitted identification of delay activity related to vision. We found that 77% (51/66) of the FEF output neurons had delay activity. In 53% (27/51) of these neurons, delay activity was modulated by Go/Nogo instructions. The modulation preceded saccades made into only part of the visual field, indicating that the modulation was movement-related. In some neurons, delay activity was modulated by Go/Nogo instructions in both memory and visual trials and seemed to represent where to move in general. In other neurons, delay activity was modulated by Go/Nogo instructions only in memory trials, which suggested that it was a correlate of working memory, or only in visual trials, which suggested that it was a correlate of visual attention. In 47% (24/51) of FEF output neurons, delay activity was unaffected by Go/Nogo instructions, which indicated that the activity was related to the visual stimulus. In some of these neurons, delay activity occurred in both memory and visual trials and seemed to represent a coordinate in visual space. In others, delay activity occurred only in memory trials and seemed to represent transient visual memory. In the remainder, delay activity occurred only in visual trials and seemed to be a tonic visual response. In conclusion, the FEF sends diverse delay activity signals related to movement, memory, and vision to the SC, where the signals may be used for saccade generation. Downstream transmission of various delay activity signals may be an important, general way in which the prefrontal cortex contributes to the control of movement.
TL;DR: This study investigated the gradient of visual attention in 21 children, 11 children with specific reading disorder (SRD) or dyslexia and 10 children with normal reading skills, finding that RTs were directly proportional to the eccentricity of the target, and no visual field effect was observable.