Victor Kuperman
McMaster University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Victor Kuperman.
Behavior Research Methods | 2013
Amy Beth Warriner; Victor Kuperman; Marc Brysbaert
Information about the affective meanings of words is used by researchers working on emotions and moods, word recognition and memory, and text-based sentiment analysis. Three components of emotions are traditionally distinguished: valence (the pleasantness of a stimulus), arousal (the intensity of emotion provoked by a stimulus), and dominance (the degree of control exerted by a stimulus). Thus far, nearly all research has been based on the ANEW norms collected by Bradley and Lang (1999) for 1,034 words. We extended that database to nearly 14,000 English lemmas, providing researchers with a much richer source of information, including gender, age, and educational differences in emotion norms. As an example of the new possibilities, we included stimuli from nearly all of the category norms (e.g., types of diseases, occupations, and taboo words) collected by Van Overschelde, Rawson, and Dunlosky (Journal of Memory and Language 50:289-335, 2004), making it possible to include affect in studies of semantic memory.
Behavior Research Methods | 2012
Victor Kuperman; Hans Stadthagen-Gonzalez; Marc Brysbaert
We present age-of-acquisition (AoA) ratings for 30,121 English content words (nouns, verbs, and adjectives). For data collection, this megastudy used the Web-based crowdsourcing technology offered by the Amazon Mechanical Turk. Our data indicate that the ratings collected in this way are as valid and reliable as those collected in laboratory conditions (the correlation between our ratings and those collected in the lab from U.S. students reached .93 for a subsample of 2,500 monosyllabic words). We also show that our AoA ratings explain a substantial percentage of the variance in the lexical-decision data of the English Lexicon Project, over and above the effects of log frequency, word length, and similarity to other words. This is true not only for the lemmas used in our rating study, but also for their inflected forms. We further discuss the relationships of AoA with other predictors of word recognition and illustrate the utility of AoA ratings for research on vocabulary growth.
Language and Cognitive Processes | 2008
Victor Kuperman; Raymond Bertram; R. Harald Baayen
This paper explores the time-course of morphological processing of trimorphemic Finnish compounds. We find evidence for the parallel access to full-forms and morphological constituents diagnosed by the early effects of compound frequency, as well as early effects of left constituent frequency and family size. We also observe an interaction between compound frequency and both the left and the right constituent family sizes. Furthermore, our data show that suffixes embedded in the derived left constituent of a compound are efficiently used for establishing the boundary between compounds’ constituents. The success of segmentation of a compound is demonstrably modulated by the affixal salience of the embedded suffixes. We discuss implications of these findings for current models of morphological processing and propose a new model that views morphemes, combinations of morphemes and morphological paradigms as probabilistic sources of information that are interactively used in recognition of complex words.
Journal of Experimental Psychology: General | 2014
Victor Kuperman; Zachary Estes; Marc Brysbaert; Amy Beth Warriner
Emotion influences most aspects of cognition and behavior, but emotional factors are conspicuously absent from current models of word recognition. The influence of emotion on word recognition has mostly been reported in prior studies on the automatic vigilance for negative stimuli, but the precise nature of this relationship is unclear. Various models of automatic vigilance have claimed that the effect of valence on response times is categorical, an inverted U, or interactive with arousal. In the present study, we used a sample of 12,658 words and included many lexical and semantic control factors to determine the precise nature of the effects of arousal and valence on word recognition. Converging empirical patterns observed in word-level and trial-level data from lexical decision and naming indicate that valence and arousal exert independent monotonic effects: Negative words are recognized more slowly than positive words, and arousing words are recognized more slowly than calming words. Valence explained about 2% of the variance in word recognition latencies, whereas the effect of arousal was smaller. Valence and arousal do not interact, but both interact with word frequency, such that valence and arousal exert larger effects among low-frequency words than among high-frequency words. These results necessitate a new model of affective word processing whereby the degree of negativity monotonically and independently predicts the speed of responding. This research also demonstrates that incorporating emotional factors, especially valence, improves the performance of models of word recognition.
Journal of Experimental Psychology: Human Perception and Performance | 2013
Victor Kuperman; Julie A. Van Dyke
The importance of vocabulary in reading comprehension emphasizes the need to accurately assess an individuals familiarity with words. The present article highlights problems with using occurrence counts in corpora as an index of word familiarity, especially when studying individuals varying in reading experience. We demonstrate via computational simulations and norming studies that corpus-based word frequencies systematically overestimate strengths of word representations, especially in the low-frequency range and in smaller-size vocabularies. Experience-driven differences in word familiarity prove to be faithfully captured by the subjective frequency ratings collected from responders at different experience levels. When matched on those levels, this lexical measure explains more variance than corpus-based frequencies in eye-movement and lexical decision latencies to English words, attested in populations with varied reading experience and skill. Furthermore, the use of subjective frequencies removes the widely reported (corpus) Frequency × Skill interaction, showing that more skilled readers are equally faster in processing any word than the less skilled readers, not disproportionally faster in processing lower frequency words. This finding challenges the view that the more skilled an individual is in generic mechanisms of word processing, the less reliant he or she will be on the actual lexical characteristics of that word.
Quarterly Journal of Experimental Psychology | 2013
Victor Kuperman; Denis Drieghe; Emmanuel Keuleers; Marc Brysbaert
We assess the amount of shared variance between three measures of visual word recognition latencies: eye movement latencies, lexical decision times, and naming times. After partialling out the effects of word frequency and word length, two well-documented predictors of word recognition latencies, we see that 7–44% of the variance is uniquely shared between lexical decision times and naming times, depending on the frequency range of the words used. A similar analysis of eye movement latencies shows that the percentage of variance they uniquely share either with lexical decision times or with naming times is much lower. It is 5–17% for gaze durations and lexical decision times in studies with target words presented in neutral sentences, but drops to 0.2% for corpus studies in which eye movements to all words are analysed. Correlations between gaze durations and naming latencies are lower still. These findings suggest that processing times in isolated word processing and continuous text reading are affected by specific task demands and presentation format, and that lexical decision times and naming times are not very informative in predicting eye movement latencies in text reading once the effect of word frequency and word length are taken into account. The difference between controlled experiments and natural reading suggests that reading strategies and stimulus materials may determine the degree to which the immediacy-of-processing assumption and the eye–mind assumption apply. Fixation times are more likely to exclusively reflect the lexical processing of the currently fixated word in controlled studies with unpredictable target words rather than in natural reading of sentences or texts.
Frontiers in Psychology | 2013
Victor Kuperman
The present study supplements research on semantic effects in word processing by focusing on the role that meanings of morphemes play in recognition of complex words. We present an overview of behavioral effects of six semantic properties characterizing the emotional and sensory connotations of English compounds and their morphemes, as well as their semantic richness. Semantics of compounds affected latencies to those compounds, and semantics of morphemes affected latencies to those morphemes presented as isolated words. Yet semantics of morphemes had little bearing on recognition of compounds, with the exception of longer recognition times for compounds with emotionally negative morphemes (e.g., seasick). We interpret the data as evidence against obligatory decomposition and dual-route accounts of morphological processing and in favor of the naive discriminative learning account that posits independent, morphologically unmediated, and simultaneous access to all meanings activated by orthographic cues in the visual input. We discuss selectivity and division of attention as driving forces in complex word recognition.
Cognition & Emotion | 2015
Amy Beth Warriner; Victor Kuperman
A long-standing observation about the interface between emotion and language is that positive words are used more frequently than negative ones, leading to the Pollyanna hypothesis which alleges a predominantly optimistic outlook in humans. This paper uses the largest available collection of affective ratings as well as insights from linguistics to revisit the Pollyanna hypothesis as it relates to two dimensions of emotion: valence (pleasantness) and arousal (intensity). We identified systematic patterns in the distribution of words over a bi-dimensional affective space, which (1) run counter to and supersede most prior accounts, and (2) differ drastically between word types (unique, distinct words in the lexicon) and word tokens (number of occurrences of available words in the lexicon). We argue for two factors that shape affect in language and society: a pro-social benevolent communication strategy with its emphasis on useful and dangerous phenomena, and the structure of human subjective perception of affect.
Psychological Science | 2015
Bryor Snefjella; Victor Kuperman
Existing evidence shows that more abstract mental representations are formed and more abstract language is used to characterize phenomena that are more distant from the self. Yet the precise form of the functional relationship between distance and linguistic abstractness is unknown. In four studies, we tested whether more abstract language is used in textual references to more geographically distant cities (Study 1), time points further into the past or future (Study 2), references to more socially distant people (Study 3), and references to a specific topic (Study 4). Using millions of linguistic productions from thousands of social-media users, we determined that linguistic concreteness is a curvilinear function of the logarithm of distance, and we discuss psychological underpinnings of the mathematical properties of this relationship. We also demonstrated that gradient curvilinear effects of geographic and temporal distance on concreteness are nearly identical, which suggests uniformity in representation of abstractness along multiple dimensions.
Language and Cognitive Processes | 2013
Victor Kuperman; Raymond Bertram
The present study explores linguistic predictors and behavioural implications of the orthographic alternation between a spaced (bell tower), hyphenated (bell-tower), and concatenated (belltower) format observed in English compound words. On the basis of two English corpora, we model the evolution of spelling for compounds undergoing lexicalisation, as well as define the set of orthographic, distributional, and semantic properties of the compounds constituents that co-determine the preference for one of the available realisations. We explore iconicity and economy as competing motivations for both the diachronic change and synchronous preferences in spelling. Observed patterns of written production closely mirror the demands and strategies of recognition of compound words in reading. Orthographic choices that go against the readers economy of effort come with a high recognition cost, as evidenced in inflated lexical decision and naming latencies to concatenated compounds that occur in other spelling formats.