Различие позитивности в лексиконах русского и английского языков: подход, основанный на больших данных

Обложка

Цитировать

Полный текст

Аннотация

В психологических кросс-культурных исследованиях давно замечены различия в степени позитивности мышления, или оптимизма, в различных культурах. Напрашивается вопрос, является ли преимущественная лингвистическая позитивность (linguistic positivity bias) одинаковой для разных языков или нет. В определенном смысле этот вопрос схож с гипотезой лингвистической относительности, касающейся влияния языка на когнитивную систему человека. Эта проблема рассматривалась только в одной работе (Dodds et al. 2015), в которой представлены данные о разной величине преимущественной лингвистической позитивности для разных языков и где сравнение для двух языков проводилось с использованием всего одной пары словарей. В настоящем исследовании мы существенно увеличиваем вычислительную базу, сравнивая английский и русский языки на основе 4 английских и 5 русских словарей. Сравнение проводится как на уровне лексикона языков, так и на уровне текстов разных жанров. Новой, ранее не использовавшейся идеей, является сопоставление рейтингов позитивности переводных текстов. Также словари английского и русского языков сравниваются по значениям рейтингов слов, устойчивых к переводу ( translation-stable words ). Результаты позволяют предположить, что на уровне лексикона русский язык несколько более позитивен, чем английский.

Об авторах

Валерий Дмитриевич Соловьев

Казанский федеральный университет

Email: maki.solovyev@mail.ru
ORCID iD: 0000-0003-4692-2564

доктор физико-математических наук, профессор, главный научный сотрудник НИЛ «Текстовая аналитика» Института филологии и межкультурной коммуникации Казанского федерального университета, Казань, Россия. Член президиума Межрегиональной ассоциации когнитивных исследований. Автор четырех монографий и более 60 публикаций по компьютерной лингвистике.

Казань, Россия

Анна Игоревна Ивлева

Казанский федеральный университет

Автор, ответственный за переписку.
Email: ivleva.anna.igorevna@yandex.ru
ORCID iD: 0000-0002-2670-6795

кандидат технических наук, старший научный сотрудник НИЛ «Лингвистика и искусственный интеллект» Института филологии и межкультурной коммуникации Казанского федерального университета, Казань, Россия. Основные сферы научных интересов: квантитативная лингвистика, обработка естественного языка, теория перевода.

Казань, Россия

Список литературы

  1. Madhusudhan, Aithal & Tan Chenhao. 2021. On positivity bias in negative reviews. https://arxiv.org/pdf/2106.12056.pdf
  2. Bochkarev, Vladimir, Valery Solovyev, Timofei Nestik & Anna Shevlyakova. 2023 Variations in average word valence of Russian books in response to social change over a century. Proceedings of the Artificial Intelligence and Natural Language Conference. Zap. Nauchn. Sem. POMI 529. 24-42.
  3. Bradley, Margaret M. & Peter. J. Lang. 1999. Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings. Technical Report C-1, The Center for Research in Psychophysiology, University of Florida.
  4. Boucher, Jerry & Charles E. Osgood. 1969. The Pollyanna hypothesis. Journal of Verbal Learning and Verbal Behavior 8. 1-8. https://doi.org/10.1016/S0022-5371(69)80002-2
  5. Dodds, Peter Sheridan, Eric M. Clark, Suma Desu, Morgan R. Frank, Andrew J. Reagan, Jake Rylnd Williams, Lewis Mitchell, Kameron Decker Harris, Isabel M. Kloumann, James P. Bagrow, Karine Megerdoomian, Matthew T. McMahon, Brian F. Tivnan & Christopher M. Danforth. 2015. Human language reveals a universal positivity bias. Proceedings of the National Academy of Sciences 112 (8). 2389-2394. https://doi.org/10.1073/pnas.1411678112
  6. Dodds, Peter Sheridan & Christopher M. Danforth. 2010. Measuring the happiness of large-scale written expression: Songs, blogs, and presidents. Journal of Happiness Studies 11. 444-456. https://doi.org/10.48550/arXiv.1703.09774
  7. Dodds, Peter Sheridan, Kameron Decker Harris, Isabel M. Kloumann, Catherine A. Bliss & Christopher M. Danforth. 2011. Temporal patterns of happiness and information in a global social network: Hedonometrics and twitter. PLoS ONE 6 (12). e26752. https://doi.org/10.1371/journal.pone.0026752
  8. Folk, Dunigan & Elizabeth Dunn. 2023. How can people become happier? A systematic review of preregistered experiments. Annual Review of Psychology 75.
  9. Frank, Morgan R., Lewis Mitchell, Peter Sheridan Dodds & Christopher M. Danforth. 2013. Happiness and the patterns of life: A study of geolocated tweets. Scientific Reports 3. 2625. https://doi.org/10.1038/srep02625
  10. Gallagher, Matthew W., Shane J. Lopez & Sarah D. Pressman. 2013. Optimism is universal: Exploring the presence and benefits of optimism in a representative sample of the world. Journal of Personality 81 (5). 429-440. https://doi.org/10.1111/jopy.12026
  11. Gower, Tricia, Kimberly S. Chiew, David Rosenfield & Holly J. Bowen. 2023. Positive biases and psychological functioning during the coronavirus disease 2019 pandemic. Cognition and Emotion 37. 1-9. https://doi.org/10.1080/02699931.2023.2221022
  12. Hills, Thomas T., Eugenio Proto, Daniel Sgroi & Chanuki Illushka Seresinhe. 2019. Historical analysis of national subjective wellbeing using millions of digitized books. Nature Human Behaviour 3 (12). 1271-1275. https://doi.org/10.1038/s41562-019-0750-z
  13. Iliev, Rumen, Joe Hoover, Morteza Dehghani & Robert Axelrod. 2016. Linguistic positivity in historical texts reflects dynamic environmental and psychological factors. Proceedings of the National Academy of Sciences 113 (49). E7871-E7879. https://doi.org/10.1073/pnas.1612058113
  14. Jackson, Joshua Conrad, Joseph Watts, Teague R. Henry, Johann. M. List, Robert Forkel, Peter Mucha, Simon J. Greenhill, Russell D. Gray & Kristen A. Lindquist. 2019. Emotion semantics show both cultural variation and universal structure. Science 366 (6472). 1517-1522. https://doi.org/10.1126/science.aaw8160
  15. Jacobs, Arthur M., Berenike Herrmann, Gerhard Lauer, Jana Lüdtke & S. Schroeder. 2020. Sentiment analysis of children and youth literature: Is there a Pollyanna effect? Frontiers in Psychology 11. https://doi.org/10.3389/fpsyg.2020.574746
  16. Jaidka, Kokil. 2022. Cross-platform-and subgroup-differences in the well-being effects of Twitter, Instagram, and Facebook in the United States. Scientific Reports 12 (1). 3271. https://doi.org/10.1038/s41598-022-07219-y
  17. Ji, Li-Jun, Thomas. I. Vaughan-Johnston, Zhiyong Zhang, Jill A. Jacobson, Ning Zhang & Xiaoye Huang. 2021. Contextual and cultural differences in positive thinking. Journal of Cross-Cultural Psychology 52 (5). 449-467. https://doi.org/10.1177/00220221211020442
  18. Kassinove, Howard & Denis G. Sukhodolsky. 1995. Optimism, pessimism and worry in Russian and American children and adolescents. Journal of Social Behavior & Personality 10 (1). 157-168.
  19. Kay, Paul & Chad K. McDaniel. 1978. The linguistic significance of meanings of basic color terms. Language 54 (3). 610-646. https://doi.org/10.2307/412789
  20. Kirchner-Häusler, Alexander, Michael Boiger, Yukiko Uchida, Yoko Higuchi, A. Uchida & Batja Mesquita. 2022. Relatively happy: The role of the positive-to-negative affect ratio in Japanese and Belgian couples. Journal of Cross-Cultural Psychology 53 (1). 66-86. https://doi.org/10.3389/fpsyg.2020.01048
  21. Kloumann, Isabel M., Christopher M. Danforth, Kameron Decker Harris, Catherine A. Bliss & Peter Sheridan Dodds. 2012. Positivity of the English Language. PLoS ONE 7 (1). e29484. https://doi.org/10.48550/arXiv.1108.5192
  22. Koltsova, Olesya Yu., Svetlana V. Alexeeva & Sergey N. Kolcov. 2016. An opinion word lexicon and a training dataset for Russian sentiment analysis of social media. Computational Linguistics and Intellectual Technologies - Proceedings of the International Conference “Dialog” 277-287.
  23. Kotel’nikov, Evgeniy V., Elena V. Razova, Anastasiya V. Kotelnikova & Sergey V. Vychegzhanin. 2020. Modern sentiment lexicons for opinion mining in English and Russian (analytical survey). Informacionnye Processy i Sistemy 12. 16-33.
  24. Kulagin, Denis I. 2021. Publicly available sentiment dictionary for the Russian language KartaSlovSent. Computational Linguistics and Intellectual Technologies - Proceedings of the International Conference “Dialog” 20. 1106-1119.
  25. Kušen, Ema, Mark Strembeck & Mauro Conti. 2019. Emotional valence shifts and user behavior on Twitter, Facebook, and YouTube. Influence and Behavior Analysis in Social Networks and Social Media. 63-83. https://doi.org/10.1007/978-3-030-02592-2_4
  26. Larina, Tatiana & Douglas Mark Ponton. 2022. I wanted to honour your journal, and you spat in my face: emotive (im) politeness and face in the English and Russian blind peer review. Journal of Politeness Research 18 (1). 201-226.
  27. Liu, Bing. 2012. Sentiment Analysis and Opinion Mining. Springer.
  28. McKee, Gerard T., David D. Malvern & Brian James Richards. 2000. Measuring vocabulary diversity using dedicated software. Literary and Linguistic Computing 15 (3). 323-337. https://doi.org/10.1093/llc/15.3.323
  29. Mitchell, Lewis, Kameron Decker Harris, Morgan R. Frank, Peter Sheridan Dodds & Christopher M. Danforth. 2013. The geography of happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PLoS ONE 8 (5). e64417. https://doi.org/10.48550/arXiv.1302.3299
  30. Mohammad, Saif M. 2018. Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 174-184. https://doi.org/10.18653/v1/P18-1017
  31. Osgood, Charles E. 1952. The nature and measurement of meaning. Psychological Bulletin 49. 197-237.
  32. Pang, Bo & Lillian Lee. 2008. Opinion mining and sentiment analysis (English). Foundations and Trends in Information Retrieval 2. 1-135. https://doi.org/10.1561/1500000011
  33. Panou, Despoina. 2013. Equivalence in translation theories: A critical evaluation. Theory and Practice in Language Studies 3 (1). 1-6. https://doi.org/10.4304/tpls.3.1.1-6
  34. Reagan, Andrew J., Christopher M. Danforth, Brian F. Tivnan, Jake Ryland Williams & Peter Sheridan Dodds. 2017. Sentiment analysis methods for understanding large-scale texts: A case for using continuum-scored words and word shift graphs. EPJ Data Science 6. 1-21. https://doi.org/10.1140/epjds/s13688-017-0121-9
  35. Solnyshkina, Marina I., Valery D. Solovyev, Elzara V. Gafiyatova & Ekaterina V. Martynova. 2023. Text complexity as interdisciplinary problem. Voprosy Kognitivnoy Lingvistiki 1. 18-39. https://doi.org/10.20916/1812-3228-2022-1-18-39
  36. Solovyev, Valery, Musa Islamov & Venera Bayrasheva. 2022. Dictionary with the evaluation of positivity/negativity degree of the Russian words. In S. R. Mahadeva Prasanna, Alexey Karpov, K. Samudra Vijaya & Shyam S. Agrawal (eds.), Speech and computer. SPECOM 2022. Lecture notes in computer science, 13721, 651-664. Springer.
  37. Solovyev, Valery, Vladimir Ivanov. 2014. Dictionary-based problem phrase extraction from user reviews. In Petr Sojka, Alex Horák, Ivan Kopeček & Karel Pala (eds.), Text, speech and dialogue. TSD 2014. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in Bioinformatics), LNAI, 8655, 225-232. Springer.
  38. Solovyev, Valery D., Marina I. Solnyshkina & Danielle S. McNamara. 2022. Computational linguistics and discourse complexology: Paradigms and research methods. Russian Journal of Linguistics 26 (2). 275-316. https://doi.org/10.22363/2687-0088-31326
  39. Tausczik, Yla R. & James W. Pennebaker. 2014. The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology 29 (1). 24-54. https://doi.org/10.1177/0261927X09351676
  40. Tetior, Alexander N. 2015. The emotional sphere of a person: The predominance of negative emotions. Eurasian Union of Scientists 2 (11). 78-81.
  41. Warriner, Amy Beth, Victor Kuperman & Marc Brysbaert. 2013. Norms of valence, arousal, and dominance for 13,915 English lemmas. Behavior Research Methods 45. 1191-1207. https://doi.org/10.3758/s13428-012-0314-x
  42. Warriner, Amy Beth & Victor Kuperman. 2015. Affective biases in English are bi-dimensional. Cognition and Emotion 29 (7). 1147-1167. https://doi.org/10.1080/02699931.2014.968098
  43. Whorf, Benjamin Lee. 2012. Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf. In John B. Carroll, Stephen C. Levinson & Penny Lee (eds.). The MIT Press.
  44. Wierzbicka, Anna. 1992. The Russian language. Semantics, Culture and Cognition: Universal Human Concepts in Culture-specific Cofigurations. 395-441. New York: Oxford University Press.
  45. Yoonjung, Choi & Wiebe Janyce. 2014. +/-EffectWordNet: Sense-level lexicon acquisition for opinion inference. Proc. of EMNLP. 1181-1191. https://doi.org/10.3115/v1/D14-1125
  46. Hedonometer (English). Retrieved from https://hedonometer.org/words/labMT-en-v2/ (accessed 18 March 2024).
  47. BRM. Retrieved from https://github.com/meadej/twitter-sentiment-analysis?ysclid=lh0bctge 6l466946169 (accessed 18 March 2024).
  48. ANEW. Retrieved from https://github.com/eriq-augustine/sentiment-data/blob/master/anew.csv (accessed 18 March 2024).
  49. NRC-VAD. Retrieved from https://emilhvitfeldt.github.io/textdata/reference/lexicon_nrc_vad.html (accessed 18 March 2024).
  50. KFU Sentiment. Retrieved from https://kpfu.ru/tehnologiya-sozdaniya-semanticheskih-elektronnyh.html (accessed 18 March 2024).
  51. KFU Sentiment BERT. Retrieved from https://kpfu.ru/tehnologiya-sozdaniya-semanticheskih-elektronnyh.html (accessed 18 March 2024).
  52. KartaSlovSent. Retrieved from https://kartaslov.ru (accessed 18 March 2024).
  53. Hedonometer (Russian). Retrieved from https://hedonometer.org/words/labMT-ru-v2/ (accessed 18 March 2024).
  54. LinisCrowd. Retrieved from http://linis-crowd.org/ (accessed 18 March 2024).

© Соловьев В.Д., Ивлева А.И., 2024

Creative Commons License
Эта статья доступна по лицензии Creative Commons Attribution-NonCommercial 4.0 International License.

Данный сайт использует cookie-файлы

Продолжая использовать наш сайт, вы даете согласие на обработку файлов cookie, которые обеспечивают правильную работу сайта.

О куки-файлах