Recent advances in natural language processing (NLP) have produced general models that can perform complex tasks such as summarizing long passages and translating across languages. Here, we introduce a method to extract adjective similarities from language models as done with survey-based ratings in traditional psycholexical studies but using millions of times more text in a natural setting. The correlational structure produced through this method is highly similar to that of self- and other-ratings of 435 English terms reported by Saucier and Goldberg (1996a). The first three unrotated factors produced using NLP are congruent with those in survey data, with coefficients of 0.89, 0.79, and 0.79. This structure is robust to many modeling decisions: adjective set, including those with 1,710 (Goldberg, 1982) and 18,000 English terms (Allport & Odbert, 1936); the query used to extract correlations; and language model. Notably, Neuroticism and Openness are only weakly and inconsistently recovered. This is a new source of signal that is closer to the original (semantic) vision of the lexical hypothesis. The method can be applied where surveys cannot: in dozens of languages simultaneously, with tens of thousands of items, on historical text, and at extremely large scale for little cost. The code is made public to facilitate reproduction and fast iteration in new directions of research. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

Download full-text PDF

Source
http://dx.doi.org/10.1037/pspp0000443DOI Listing

Publication Analysis

Top Keywords

lexical hypothesis
8
natural language
8
english terms
8
deep lexical
4
hypothesis identifying
4
identifying personality
4
personality structure
4
structure natural
4
language
4
language advances
4

Similar Publications

Second-language speakers are more likely to strategically reuse the words of their conversation partners (Zhang & Nicol, 2022). This study investigates if this is also the case for lower-proficiency bilinguals from a bilingual community, who use language more implicitly, and if there is more alignment with lower than with higher proficiency, provided the words to be aligned to are all highly familiar. In two experiments, Spanish-English bilinguals took turns with a confederate to name and match pictures in Spanish.

View Article and Find Full Text PDF
Article Synopsis
  • Selective attention to the emotional quality (valence) of words enhances the early ability to differentiate emotions associated with relevant words while diminishing responses to irrelevant ones.
  • The study involved 58 participants who responded quickly to words of a certain emotional quality, revealing that short, high-frequency, and low-arousal words were more effectively processed in terms of emotional discrimination.
  • Results indicate that both emotional quality and arousal levels interact during initial processing, supporting the idea that these factors influence how we perceive and react to emotional words at a subconscious level.
View Article and Find Full Text PDF

Text-based automatic personality recognition (APR) operates at the intersection of artificial intelligence (AI) and psychology to determine the personality of an individual from their text sample. This covert form of personality assessment is key for a variety of online applications that contribute to individual convenience and well-being such as that of chatbots and personal assistants. Despite the availability of good quality data utilizing state-of-the-art AI methods, the reported performance of these recognition systems remains below expectations in comparable areas.

View Article and Find Full Text PDF
Article Synopsis
  • The Implicit Prosody Hypothesis (IPH) suggests that people create internal vocal patterns while reading silently, similar to those used in spoken language.
  • The study used EEG to analyze brain responses as participants read sequences of words with different stress patterns, revealing that unexpected stress in words triggered stronger brain reactions.
  • Results indicated that various brain wave activities correlate with rhythmic expectations in language, supporting the idea that the same neural networks are involved in processing both spoken and silently read language.
View Article and Find Full Text PDF

In a recent study, I demonstrated that large numbers of L2 (second language) speakers do not appear to influence the morphological or information-theoretic complexity of natural languages. This paper has three primary aims: First, I address recent criticisms of my analyses, showing that the points raised by my critics were already explicitly considered and analysed in my original work. Furthermore, I show that the proposed alternative analyses fail to withstand detailed examination.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!