Reporting and Interpreting Working Memory Performance in n-back Tasks (2024)

  • Journal List
  • Front Psychol
  • PMC5339218

As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsem*nt of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more: PMC Disclaimer | PMC Copyright Notice

Reporting and Interpreting Working Memory Performance in n-back Tasks (1)

Link to Publisher's site

Adrian Meule1,2,*

Author information Article notes Copyright and License information PMC Disclaimer

Working memory is an executive function, which involves holding information in mind and mentally working with it (Diamond, 2013). A widely used measure for the assessment of working memory function is the n-back task (Owen et al., 2005). Here, participants are typically instructed to monitor a series of stimuli and to respond whenever a stimulus is presented that is the same as the one presented n trials previously. Common versions are 2-back and 3-back tasks, in which participants have to respond to stimuli that have been presented two or three trials earlier. Zero-back and 1-back versions are also often used as control conditions.

In most studies, participants are required to respond with a button press to the relevant stimuli (= targets) and to withhold responses to distractor stimuli (= non-targets). Yet, there are also studies, in which participants are required to indicate for each trial whether the stimulus represents a target or a non-target (e.g., by pressing two different buttons; Jonides et al., 1997; Carlson et al., 1998; Perlstein et al., 2003; Harvey et al., 2005; Miller et al., 2009). Stimuli in classical n-back tasks are numbers or words, but pictorial versions, which display, for example, emotional scenes (Marx et al., 2011; Hur et al., in press), faces (Cromheeke and Mueller, 2016), or food (Meule et al., 2012; Meule, 2016) have also been used in recent years.

As dependent variables, most studies report response latencies (= reaction times) and accuracy (in %) or the number of errors. With increasing task difficulty (i.e., with increasing ns), reaction times usually increase and accuracy decreases (e.g., Jonides et al., 1997; Carlson et al., 1998; Perlstein et al., 2003; Harvey et al., 2005; Miller et al., 2009; Schmidt et al., 2009). Similarly, reaction times and accuracy are usually negatively correlated (e.g., Carter et al., 1998). In other words, higher reaction times are associated with a higher number of errors. Although this relationship exists, it appears that reaction times and accuracy have dissociable correlates. For example, Jaeggi et al. (2010) examined various n-back tasks and found several dissociations between reaction times and accuracy. For instance, higher accuracy (but not reaction times) in visuospatial, auditory, and dual 3-back tasks was correlated with higher fluid intelligence as measured with the Raven test. In visuospatial n-back tasks, reaction times (but not accuracy) were associated with reading span and digit span forward performance.

In addition to these findings, a recent study by Hur et al. (in press) further highlights the role of reaction times vs. accuracy. In that study, pictures of emotional scenes were used in a 0-back task (labeled as perception task) and a 2-back task (labeled as working memory task). The authors argued that there was a ceiling effect in accuracy and more meaningful variation in reaction times in the perception task and, thus, they focused on interpreting reaction time results. In the working memory task, however, there was more variability in accuracy and less variability in reaction times and, thus, they focused on interpreting accuracy results because “participants' efforts are generally focused more on performing the task accurately than responding as fast as they can” (p. 4).

In light of these findings, how is interpretation of results affected when associations for some n-back task performance indices can be found but not for others? For example, in two studies that either used emotional words (Kopf et al., 2013) or pictures of emotional scenes (Marx et al., 2011) it was found that accuracy (but not reaction times) differed as a function of emotional valence of the stimuli. In contrast, effects of emotional stimuli (here: faces) were only found for reaction times and not for accuracy in a recent study by Cromheeke and Mueller (2016). Yet, the authors concluded that “allocating attention to affective information improved working memory” (p. 295). It might be argued that it is at least debatable if emotional stimuli indeed affected working memory performance as participants' ability to discriminate between targets and non-targets was not influenced. What these examples illustrate is that reaction times and accuracy in n-back tasks should not be interpreted interchangeably. Specifically, I argue that it is not reasonable when different studies reach similar conclusions (e.g., that effects of certain stimuli on or certain group differences in working memory performance were found), although these conclusions are based on different dependent variables (e.g., on reaction times in one study and on accuracy in another study).

In addition to these considerations, what constitutes accuracy is surprisingly rarely defined in most reports and/or it includes different types of errors (e.g., Jonides et al., 1997; Carlson et al., 1998; Perlstein et al., 2003; Harvey et al., 2005; Miller et al., 2009; Dodds et al., 2011). In n-back tasks, participants can either correctly press a button in response to targets (= hits), incorrectly press a button in response to non-targets (= commission errors or false alarms), and incorrectly do not press a button in response to targets (= omission errors or misses). However, researchers often do not make this distinction (for an exception see, e.g., Schmidt et al., 2009). In contrast to motor inhibition tasks (e.g., Go/No-go tasks), in which the main measure of interest is commission errors (e.g., Newman et al., 1985), omission errors are more frequent than commission errors in n-back tasks. Of note, it appears that these two types of errors have different correlates and, thus, may represent different processes. For example, in two studies that used 2-back tasks with food and neutral pictures (Meule et al., 2012; Meule, 2016), reaction times were positively correlated with the number of omission errors [r(n = 70) = 0.440, p < 0.001 and r(n = 56) = 0.449, p = 0.001], but not with the number of commission errors [r(n = 70) = 0.095, p = 0.435 and r(n = 56) = 0.178, p = 0.188]. Omission and commission errors were unrelated [r(n = 70) = 0.093, p = 0.443 and r(n = 56) = 0.145, p = 0.285]. Moreover, in a study by Oberauer (2005), in which neutral words were used, only omission but not commission errors or reaction times were associated with measures of working memory capacity.

In conclusion, it is argued that researchers need to carefully interpret their findings derived from n-back tasks, particularly when these findings diverge depending on whether using reaction times or accuracy. Moreover, it appears necessary that researchers not only report accuracy but also differentiate between omission and commission errors. In addition, it may be preferable to report other task performance indices that are calculated from hits and false alarms such as discrimination index d′ and response bias C, as has been suggested by researchers who used modified versions of the n-back task (Kane et al., 2007; Haatveit et al., 2010). As the n-back task has been criticized for lacking clear associations with other working memory tasks (Kane et al., 2007; Jaeggi et al., 2010), using more fine-grained analyses of n-back task performance beyond reporting reaction times and accuracy may, therefore, reveal clearer insights about its validity as a measure of working memory performance, its neural or psychopathological correlates, and its utility in applied neuropsychology.

Author contributions

The author confirms being the sole contributor of this work and approved it for publication.

Conflict of interest statement

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Publication of this article was supported by the Open Access Publication Fund of the University of Salzburg.

References

  • Carlson S., Martinkauppi S., Rämä P., Salli E., Korvenoja A., Aronen H. J. (1998). Distribution of cortical activation during visuospatial n-back tasks as revealed by functional magnetic resonance imaging. Cereb. Cortex8, 743–752. 10.1093/cercor/8.8.743 [PubMed] [CrossRef] [Google Scholar]
  • Carter C. S., Perlstein W., Ganguli R., Brar J., Mintun M., Cohen J. D. (1998). Functional hypofrontality and working memory dysfunction in schizophrenia. Am. J. Psychiatry155, 1285–1287. 10.1176/ajp.155.9.1285 [PubMed] [CrossRef] [Google Scholar]
  • Cromheeke S., Mueller S. C. (2016). The power of a smile: stronger working memory effects for happy faces in adolescents compared to adults. Cogn. Emot30, 288–301. 10.1080/02699931.2014.997196 [PubMed] [CrossRef] [Google Scholar]
  • Diamond A. (2013). Executive functions. Annu. Rev. Psychol.64, 135–168. 10.1146/annurev-psych-113011-143750 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
  • Dodds C. M., Bullmore E. T., Henson R. N., Christensen S., Miller S., Smith M., et al.. (2011). Effects of donepezil on cognitive performance after sleep deprivation. Hum. Psychopharmacol. Clin. Exp.26, 578–587. 10.1002/hup.1248 [PubMed] [CrossRef] [Google Scholar]
  • Haatveit B. C., Sundet K., Hugdahl K., Ueland T., Melle I., Andreassen O. A. (2010). The validity of d prime as a working memory index: results from the “Bergen n-back task.”J. Clin. Exp. Neuropsychol.32, 871–880. 10.1080/13803391003596421 [PubMed] [CrossRef] [Google Scholar]
  • Harvey P. O., Fossati P., Pochon J. B., Levy R., LeBastard G., Lehéricy S., et al.. (2005). Cognitive control and brain resources in major depression: an fMRI study using the n-back task. Neuroimage26, 860–869. 10.1016/j.neuroimage.2005.02.048 [PubMed] [CrossRef] [Google Scholar]
  • Hur J., Iordan A. D., Dolcos F., Berenbaum H. (in press). Emotional influences on perception and working memory. Cogn. Emot. 10.1080/02699931.2016.1213703 [PubMed] [CrossRef] [Google Scholar]
  • Jaeggi S. M., Buschkuehl M., Perrig W. J., Meier B. (2010). The concurrent validity of the N-back task as a working memory measure. Memory18, 394–412. 10.1080/09658211003702171 [PubMed] [CrossRef] [Google Scholar]
  • Jonides J., Schumacher E. H., Smith E. E., Lauber E. J., Awh E., Minoshima S., et al.. (1997). Verbal working memory load affects regional brain activation as measured by PET. J. Cogn. Neurosci.9, 462–475. 10.1162/jocn.1997.9.4.462 [PubMed] [CrossRef] [Google Scholar]
  • Kane M. J., Conway A. R., Miura T. K., Colflesh G. J. (2007). Working memory, attention control, and the N-back task: a question of construct validity. J. Exp. Psychol. Learn. Mem. Cogn.33, 615–622. 10.1037/0278-7393.33.3.615 [PubMed] [CrossRef] [Google Scholar]
  • Kopf J., Dresler T., Reicherts P., Herrmann M. J., Reif A. (2013). The effect of emotional content on brain activation and the late positive potential in a word n-back task. PLoS ONE8:e75598. 10.1371/journal.pone.0075598 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
  • Marx I., Domes G., Havenstein C., Berger C., Schulze L., Herpertz S. C. (2011). Enhanced emotional interference on working memory performance in adults with ADHD. World J. Biol. Psychiatry12, 70–75. 10.3109/15622975.2011.599213 [PubMed] [CrossRef] [Google Scholar]
  • Meule A. (2016). Dieting and food cue-related working memory performance. Front. Psychol.7:1944. 10.3389/fpsyg.2016.01944 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
  • Meule A., Skirde A. K., Freund R., Vögele C., Kübler A. (2012). High-calorie food-cues impair working memory performance in high and low food cravers. Appetite59, 264–269. 10.1016/j.appet.2012.05.010 [PubMed] [CrossRef] [Google Scholar]
  • Miller K. M., Price C. C., Okun M. S., Montijo H., Bowers D. (2009). Is the n-back task a valid neuropsychological measure for assessing working memory?Arch. Clin. Neuropsychol.24, 711–717. 10.1093/arclin/acp063 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
  • Newman J. P., Widom C. S., Nathan S. (1985). Passive avoidance in syndromes of disinhibition: psychopathy and extraversion. J. Pers. Soc. Psychol.48, 1316–1327. 10.1037/0022-3514.48.5.1316 [PubMed] [CrossRef] [Google Scholar]
  • Oberauer K. (2005). Binding and inhibition in working memory: individual and age differences in short-term recognition. J. Exp. Psychol. Gen.134, 368–387. 10.1037/0096-3445.134.3.368 [PubMed] [CrossRef] [Google Scholar]
  • Owen A. M., McMillan K. M., Laird A. R., Bullmore E. (2005). N-back working memory paradigm: a meta-analysis of normative functional neuroimaging. Hum. Brain Mapp.25, 46–59. 10.1002/hbm.20131 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
  • Perlstein W. M., Dixit N. K., Carter C. S., Noll D. C., Cohen J. D. (2003). Prefrontal cortex dysfunction mediates deficits in working memory and prepotent responding in schizophrenia. Biol. Psychiatry53, 25–38. 10.1016/S0006-3223(02)01675-X [PubMed] [CrossRef] [Google Scholar]
  • Schmidt H., Jogia J., Fast K., Christodoulou T., Haldane M., Kumari V., et al.. (2009). No gender differences in brain activation during the N-back task: an fMRI study in healthy individuals. Hum. Brain Mapp.30, 3609–3615. 10.1002/hbm.20783 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

Articles from Frontiers in Psychology are provided here courtesy of Frontiers Media SA

Reporting and Interpreting Working Memory Performance in n-back Tasks (2024)

FAQs

How does the n-back task measure working memory? ›

Abstract. The N-back task requires participants to decide whether each stimulus in a sequence matches the one that appeared n items ago. Although N-back has become a standard "executive" working memory (WM) measure in cognitive neuroscience, it has been subjected to few behavioral tests of construct validity.

How to score n-back test? ›

In the n-back task, participants are presented with a sequence of stimuli (e.g., letters) one at a time and asked to compare the current stimulus to one presented n items prior in the sequence. When performing 2-back, the current stimulus is a target when it matches the stimulus presented two stimuli ago.

What is an example of a working memory task? ›

Below are some examples: Keeping a person's address in mind while being given directions. Keeping elements or the sequence of a story in mind before the person completes telling it. Dialing a telephone number that you were just told.

What are the tasks to measure working memory capacity? ›

One popular measure of working memory capacity is the operation span task, in which test participants are presented with a series of math problems followed by target words (for example, “12 × 8 = 96, WINE”). Participants must read the problem out loud, say whether it is true or false, and then read the target word.

How to do the n-back task? ›

In short, in the N-Back task, participants are presented a sequence of stimuli one-by-one. For each stimulus, they need to decide if the current stimulus is the same as the one presented N trials ago. The N can be 1 trials, 2 trials, 3 trials, etc. The higher the number, the more difficult the task.

What is the n-back test for ADHD? ›

The n-back task requires participants to respond to a series of stimuli by judging whether each stimulus is the same as the one presented n-items previously. The n-back task requires both active maintenance and updating of items in working memory as well as deleting items that are further back in a sequence.

What is the n-back test most likely related to? ›

Performance on the n-back task seems to be more closely correlated with performance on measures of fluid intelligence than it is with performance on other measures of working memory (which is also correlated with performance on measures of fluid intelligence).

Is the n-back test reliable? ›

N-back tasks (Kirchner, 1958) remain some of the most commonly used tests of working memory in the cognitive neuroscience field. They are a well-validated and a widely-used means of manipulating working memory capacity and its response requirements.

Does dual n-back improve working memory? ›

Dual tasks like dual N-back, as a computerized task, can update memories and they are active control and implicit task. Likewise, dual n-back is a visuo-auditory task, activating prefrontal cortex, precentral gyrus, parietal lobe, and caudate nucleus, and hence it improves capacity of WM (2, 11).

What is poor working memory a symptom of? ›

Weak working memory is a core difficulty for students with ADHD, Inattentive Type. Individuals with traumatic brain injury, deafness, oral language deficits or genetic disorders such as Down Syndrome are also more likely to have weak working memory.

What everyday task requires working memory? ›

We use working memory to meaningfully participate in everyday skills such as: Responding appropriately when having a conversation. Carrying out instructions. Reading an unknown word.

Can working memory be improved? ›

Meta-analyses have found that working memory (WM) can be improved with cognitive training; however, some authors have suggested that these improvements are mostly driven by biases in the measurement of WM, especially the use of similar tasks for assessment and training.

What is the best way to assess working memory? ›

How can you measure and evaluate working memory?
  1. Sequencing Test WOM-ASM: A series of balls with different numbers will appear on the screen. The user will have to memorize the series in order to repeat it later. ...
  2. Recognition Test WOM-REST: Three objects will appear on the screen.

What is the task to test working memory? ›

The n-back memory task requires subjects to recognize items that have been presented n-steps before. Since subjects must memorize the sequence of items in order to discover those repetitions that span multiple items, the task has strong face validity for being a working memory task.

What is the most common working memory test? ›

Some of the most common standardized tools used in the field include the Wechsler Intelligence Tests, such as the WPPSI-IV (ages 3-7), WISC-V (ages 6-16), and the WAIS-IV (ages 16-90), as well as the Woodco*ck Johnson Tests of Cognitive Abilities (WJ-IV Cog), The Delis Kaplan Executive Function System (DKEFS), and ...

Does dual n-back increase working memory? ›

Dual tasks like dual N-back, as a computerized task, can update memories and they are active control and implicit task. Likewise, dual n-back is a visuo-auditory task, activating prefrontal cortex, precentral gyrus, parietal lobe, and caudate nucleus, and hence it improves capacity of WM (2, 11).

Is the n-back task a measure of unstructured working memory capacity towards understanding its connection to other working memory tasks? ›

We suggest that n-back might be a valid measure of working memory, and that the ability to exploit contextual information is an important faculty captured by some versions of the change detection task.

What scale measures working memory? ›

The Working Memory Questionnaire (WMQ) is a self-administered scale, addressing three dimensions of working memory: short-term storage, attention, and executive control.

How does working memory work exactly? ›

Think of working memory as a temporary sticky note in the brain. It holds new information in place so the brain can work with it briefly and connect it with other information. For example, in math class, working memory lets kids “see” in their head the numbers the teacher is saying.

Top Articles
Latest Posts
Article information

Author: Eusebia Nader

Last Updated:

Views: 6140

Rating: 5 / 5 (80 voted)

Reviews: 95% of readers found this page helpful

Author information

Name: Eusebia Nader

Birthday: 1994-11-11

Address: Apt. 721 977 Ebert Meadows, Jereville, GA 73618-6603

Phone: +2316203969400

Job: International Farming Consultant

Hobby: Reading, Photography, Shooting, Singing, Magic, Kayaking, Mushroom hunting

Introduction: My name is Eusebia Nader, I am a encouraging, brainy, lively, nice, famous, healthy, clever person who loves writing and wants to share my knowledge and understanding with you.