The difference in positivity of the Russian and English lexicon: The big data approach

封面

如何引用文章

详细

Psychological cross-cultural studies have long noted differences in the degree of cognition positivity, or optimism, in various cultures. Herewith, the question whether the difference shows up at the level of the language lexicon remains unexplored. Linguistic positivity bias has been confirmed for a number of languages. The point of it is that most words have a positive connotation in the language. This begs the question: is linguistic positivity bias the same for different languages or not? In a sense, the issue is similar to the hypothesis of linguistic relativity suggesting the language impact on the human cognitive system. The problem has been researched only in one work (Dodds et al. 2015), where data on the positivity bias values are given for different languages and the comparison for each pair of languages is based on merely one pair of dictionaries. In the present study, we radically increase the computational baseline by comparing four English and five Russian dictionaries. We carry out the comparative study both at the level of vocabularies and at the level of texts of different genres. A new, previously untapped idea is to compare positivity ratings of translated texts. Also, English and Russian sentiment dictionaries are compared based on the scores of translation-stable words. The results suggest that the Russian language is somewhat slightly more positive than English at the level of vocabulary.

作者简介

Valery Solovyev

Kazan Federal University

Email: maki.solovyev@mail.ru
ORCID iD: 0000-0003-4692-2564

Doctor Habil. of Physical and Mathematical Sciences, Professor, Chief Researcher of “Text Analytics” Research Lab, Institute of Philology and Intercultural Communication of Kazan Federal University, Kazan, Russia. He is a member of the Presidium of the Interregional Association for Cognitive Research, author of four monographs and more than 60 publications on the computational linguistics.

Kazan, Russia

Anna Ivleva

Kazan Federal University

编辑信件的主要联系方式.
Email: ivleva.anna.igorevna@yandex.ru
ORCID iD: 0000-0002-2670-6795

Ph.D. in Engineering Sciences and Senior Researcher of “Linguistics and AI” Research Lab, Institute of Philology and Intercultural Communication of Kazan Federal University, Kazan, Russia. The main areas of her research interests are quantitative linguistics, natural language processing and translation studies.

Kazan, Russia

参考

  1. Madhusudhan, Aithal & Tan Chenhao. 2021. On positivity bias in negative reviews. https://arxiv.org/pdf/2106.12056.pdf
  2. Bochkarev, Vladimir, Valery Solovyev, Timofei Nestik & Anna Shevlyakova. 2023 Variations in average word valence of Russian books in response to social change over a century. Proceedings of the Artificial Intelligence and Natural Language Conference. Zap. Nauchn. Sem. POMI 529. 24-42.
  3. Bradley, Margaret M. & Peter. J. Lang. 1999. Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings. Technical Report C-1, The Center for Research in Psychophysiology, University of Florida.
  4. Boucher, Jerry & Charles E. Osgood. 1969. The Pollyanna hypothesis. Journal of Verbal Learning and Verbal Behavior 8. 1-8. https://doi.org/10.1016/S0022-5371(69)80002-2
  5. Dodds, Peter Sheridan, Eric M. Clark, Suma Desu, Morgan R. Frank, Andrew J. Reagan, Jake Rylnd Williams, Lewis Mitchell, Kameron Decker Harris, Isabel M. Kloumann, James P. Bagrow, Karine Megerdoomian, Matthew T. McMahon, Brian F. Tivnan & Christopher M. Danforth. 2015. Human language reveals a universal positivity bias. Proceedings of the National Academy of Sciences 112 (8). 2389-2394. https://doi.org/10.1073/pnas.1411678112
  6. Dodds, Peter Sheridan & Christopher M. Danforth. 2010. Measuring the happiness of large-scale written expression: Songs, blogs, and presidents. Journal of Happiness Studies 11. 444-456. https://doi.org/10.48550/arXiv.1703.09774
  7. Dodds, Peter Sheridan, Kameron Decker Harris, Isabel M. Kloumann, Catherine A. Bliss & Christopher M. Danforth. 2011. Temporal patterns of happiness and information in a global social network: Hedonometrics and twitter. PLoS ONE 6 (12). e26752. https://doi.org/10.1371/journal.pone.0026752
  8. Folk, Dunigan & Elizabeth Dunn. 2023. How can people become happier? A systematic review of preregistered experiments. Annual Review of Psychology 75.
  9. Frank, Morgan R., Lewis Mitchell, Peter Sheridan Dodds & Christopher M. Danforth. 2013. Happiness and the patterns of life: A study of geolocated tweets. Scientific Reports 3. 2625. https://doi.org/10.1038/srep02625
  10. Gallagher, Matthew W., Shane J. Lopez & Sarah D. Pressman. 2013. Optimism is universal: Exploring the presence and benefits of optimism in a representative sample of the world. Journal of Personality 81 (5). 429-440. https://doi.org/10.1111/jopy.12026
  11. Gower, Tricia, Kimberly S. Chiew, David Rosenfield & Holly J. Bowen. 2023. Positive biases and psychological functioning during the coronavirus disease 2019 pandemic. Cognition and Emotion 37. 1-9. https://doi.org/10.1080/02699931.2023.2221022
  12. Hills, Thomas T., Eugenio Proto, Daniel Sgroi & Chanuki Illushka Seresinhe. 2019. Historical analysis of national subjective wellbeing using millions of digitized books. Nature Human Behaviour 3 (12). 1271-1275. https://doi.org/10.1038/s41562-019-0750-z
  13. Iliev, Rumen, Joe Hoover, Morteza Dehghani & Robert Axelrod. 2016. Linguistic positivity in historical texts reflects dynamic environmental and psychological factors. Proceedings of the National Academy of Sciences 113 (49). E7871-E7879. https://doi.org/10.1073/pnas.1612058113
  14. Jackson, Joshua Conrad, Joseph Watts, Teague R. Henry, Johann. M. List, Robert Forkel, Peter Mucha, Simon J. Greenhill, Russell D. Gray & Kristen A. Lindquist. 2019. Emotion semantics show both cultural variation and universal structure. Science 366 (6472). 1517-1522. https://doi.org/10.1126/science.aaw8160
  15. Jacobs, Arthur M., Berenike Herrmann, Gerhard Lauer, Jana Lüdtke & S. Schroeder. 2020. Sentiment analysis of children and youth literature: Is there a Pollyanna effect? Frontiers in Psychology 11. https://doi.org/10.3389/fpsyg.2020.574746
  16. Jaidka, Kokil. 2022. Cross-platform-and subgroup-differences in the well-being effects of Twitter, Instagram, and Facebook in the United States. Scientific Reports 12 (1). 3271. https://doi.org/10.1038/s41598-022-07219-y
  17. Ji, Li-Jun, Thomas. I. Vaughan-Johnston, Zhiyong Zhang, Jill A. Jacobson, Ning Zhang & Xiaoye Huang. 2021. Contextual and cultural differences in positive thinking. Journal of Cross-Cultural Psychology 52 (5). 449-467. https://doi.org/10.1177/00220221211020442
  18. Kassinove, Howard & Denis G. Sukhodolsky. 1995. Optimism, pessimism and worry in Russian and American children and adolescents. Journal of Social Behavior & Personality 10 (1). 157-168.
  19. Kay, Paul & Chad K. McDaniel. 1978. The linguistic significance of meanings of basic color terms. Language 54 (3). 610-646. https://doi.org/10.2307/412789
  20. Kirchner-Häusler, Alexander, Michael Boiger, Yukiko Uchida, Yoko Higuchi, A. Uchida & Batja Mesquita. 2022. Relatively happy: The role of the positive-to-negative affect ratio in Japanese and Belgian couples. Journal of Cross-Cultural Psychology 53 (1). 66-86. https://doi.org/10.3389/fpsyg.2020.01048
  21. Kloumann, Isabel M., Christopher M. Danforth, Kameron Decker Harris, Catherine A. Bliss & Peter Sheridan Dodds. 2012. Positivity of the English Language. PLoS ONE 7 (1). e29484. https://doi.org/10.48550/arXiv.1108.5192
  22. Koltsova, Olesya Yu., Svetlana V. Alexeeva & Sergey N. Kolcov. 2016. An opinion word lexicon and a training dataset for Russian sentiment analysis of social media. Computational Linguistics and Intellectual Technologies - Proceedings of the International Conference “Dialog” 277-287.
  23. Kotel’nikov, Evgeniy V., Elena V. Razova, Anastasiya V. Kotelnikova & Sergey V. Vychegzhanin. 2020. Modern sentiment lexicons for opinion mining in English and Russian (analytical survey). Informacionnye Processy i Sistemy 12. 16-33.
  24. Kulagin, Denis I. 2021. Publicly available sentiment dictionary for the Russian language KartaSlovSent. Computational Linguistics and Intellectual Technologies - Proceedings of the International Conference “Dialog” 20. 1106-1119.
  25. Kušen, Ema, Mark Strembeck & Mauro Conti. 2019. Emotional valence shifts and user behavior on Twitter, Facebook, and YouTube. Influence and Behavior Analysis in Social Networks and Social Media. 63-83. https://doi.org/10.1007/978-3-030-02592-2_4
  26. Larina, Tatiana & Douglas Mark Ponton. 2022. I wanted to honour your journal, and you spat in my face: emotive (im) politeness and face in the English and Russian blind peer review. Journal of Politeness Research 18 (1). 201-226.
  27. Liu, Bing. 2012. Sentiment Analysis and Opinion Mining. Springer.
  28. McKee, Gerard T., David D. Malvern & Brian James Richards. 2000. Measuring vocabulary diversity using dedicated software. Literary and Linguistic Computing 15 (3). 323-337. https://doi.org/10.1093/llc/15.3.323
  29. Mitchell, Lewis, Kameron Decker Harris, Morgan R. Frank, Peter Sheridan Dodds & Christopher M. Danforth. 2013. The geography of happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PLoS ONE 8 (5). e64417. https://doi.org/10.48550/arXiv.1302.3299
  30. Mohammad, Saif M. 2018. Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 174-184. https://doi.org/10.18653/v1/P18-1017
  31. Osgood, Charles E. 1952. The nature and measurement of meaning. Psychological Bulletin 49. 197-237.
  32. Pang, Bo & Lillian Lee. 2008. Opinion mining and sentiment analysis (English). Foundations and Trends in Information Retrieval 2. 1-135. https://doi.org/10.1561/1500000011
  33. Panou, Despoina. 2013. Equivalence in translation theories: A critical evaluation. Theory and Practice in Language Studies 3 (1). 1-6. https://doi.org/10.4304/tpls.3.1.1-6
  34. Reagan, Andrew J., Christopher M. Danforth, Brian F. Tivnan, Jake Ryland Williams & Peter Sheridan Dodds. 2017. Sentiment analysis methods for understanding large-scale texts: A case for using continuum-scored words and word shift graphs. EPJ Data Science 6. 1-21. https://doi.org/10.1140/epjds/s13688-017-0121-9
  35. Solnyshkina, Marina I., Valery D. Solovyev, Elzara V. Gafiyatova & Ekaterina V. Martynova. 2023. Text complexity as interdisciplinary problem. Voprosy Kognitivnoy Lingvistiki 1. 18-39. https://doi.org/10.20916/1812-3228-2022-1-18-39
  36. Solovyev, Valery, Musa Islamov & Venera Bayrasheva. 2022. Dictionary with the evaluation of positivity/negativity degree of the Russian words. In S. R. Mahadeva Prasanna, Alexey Karpov, K. Samudra Vijaya & Shyam S. Agrawal (eds.), Speech and computer. SPECOM 2022. Lecture notes in computer science, 13721, 651-664. Springer.
  37. Solovyev, Valery, Vladimir Ivanov. 2014. Dictionary-based problem phrase extraction from user reviews. In Petr Sojka, Alex Horák, Ivan Kopeček & Karel Pala (eds.), Text, speech and dialogue. TSD 2014. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in Bioinformatics), LNAI, 8655, 225-232. Springer.
  38. Solovyev, Valery D., Marina I. Solnyshkina & Danielle S. McNamara. 2022. Computational linguistics and discourse complexology: Paradigms and research methods. Russian Journal of Linguistics 26 (2). 275-316. https://doi.org/10.22363/2687-0088-31326
  39. Tausczik, Yla R. & James W. Pennebaker. 2014. The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology 29 (1). 24-54. https://doi.org/10.1177/0261927X09351676
  40. Tetior, Alexander N. 2015. The emotional sphere of a person: The predominance of negative emotions. Eurasian Union of Scientists 2 (11). 78-81.
  41. Warriner, Amy Beth, Victor Kuperman & Marc Brysbaert. 2013. Norms of valence, arousal, and dominance for 13,915 English lemmas. Behavior Research Methods 45. 1191-1207. https://doi.org/10.3758/s13428-012-0314-x
  42. Warriner, Amy Beth & Victor Kuperman. 2015. Affective biases in English are bi-dimensional. Cognition and Emotion 29 (7). 1147-1167. https://doi.org/10.1080/02699931.2014.968098
  43. Whorf, Benjamin Lee. 2012. Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf. In John B. Carroll, Stephen C. Levinson & Penny Lee (eds.). The MIT Press.
  44. Wierzbicka, Anna. 1992. The Russian language. Semantics, Culture and Cognition: Universal Human Concepts in Culture-specific Cofigurations. 395-441. New York: Oxford University Press.
  45. Yoonjung, Choi & Wiebe Janyce. 2014. +/-EffectWordNet: Sense-level lexicon acquisition for opinion inference. Proc. of EMNLP. 1181-1191. https://doi.org/10.3115/v1/D14-1125
  46. Hedonometer (English). Retrieved from https://hedonometer.org/words/labMT-en-v2/ (accessed 18 March 2024).
  47. BRM. Retrieved from https://github.com/meadej/twitter-sentiment-analysis?ysclid=lh0bctge 6l466946169 (accessed 18 March 2024).
  48. ANEW. Retrieved from https://github.com/eriq-augustine/sentiment-data/blob/master/anew.csv (accessed 18 March 2024).
  49. NRC-VAD. Retrieved from https://emilhvitfeldt.github.io/textdata/reference/lexicon_nrc_vad.html (accessed 18 March 2024).
  50. KFU Sentiment. Retrieved from https://kpfu.ru/tehnologiya-sozdaniya-semanticheskih-elektronnyh.html (accessed 18 March 2024).
  51. KFU Sentiment BERT. Retrieved from https://kpfu.ru/tehnologiya-sozdaniya-semanticheskih-elektronnyh.html (accessed 18 March 2024).
  52. KartaSlovSent. Retrieved from https://kartaslov.ru (accessed 18 March 2024).
  53. Hedonometer (Russian). Retrieved from https://hedonometer.org/words/labMT-ru-v2/ (accessed 18 March 2024).
  54. LinisCrowd. Retrieved from http://linis-crowd.org/ (accessed 18 March 2024).

版权所有 © Solovyev V., Ivleva A., 2024

Creative Commons License
此作品已接受知识共享署名-非商业性使用 4.0国际许可协议的许可。

##common.cookie##