In this paper, we make an attempt to improve the textual fit of English-to-Polish translation of a peculiar type of multi-word units known in corpus linguistic literature as lexical bundles (Biber et al. 1999). Inspired by a study conducted by Grabar and Lefer (2015), we used the English-Polish parallel corpus Paralela (Pęzik 2016) and the National Corpus of Polish (NKJP) to extract and explore the use - in terms of frequency distributions - of the Polish equivalents of selected English lexical bundles expressing attitudinal and epistemic stance. More precisely, we used the NKJP corpus to check whether the Polish equivalents are typical of contemporary Polish as found in native texts. The results of this corpus-informed study revealed a high number of Polish equivalents, both single- and multi-word units, expressing stance. Also, the results showed that the majority of Polish equivalents are frequently used in native Polish texts and therefore they can potentially help enhance the textual fit of translations. Finally, we discussed limitations of the methods and corpora used in this preliminary study and presented suggestions on how it can be pursued further in the future to better explore the usefulness of lexical bundles for translation teaching and translation practice. To that end, we also presented proposals of in-class translation activities.

About the authors


University of Opole

Associate Professor at the Institute of English, University of Opole (Poland). His research interests include corpus linguistics, phraseology, formulaic language, translation studies and lexicography. He is also interested in computer-assisted methods of text analysis. He has published research articles and book chapters internationally in International Journal of Corpus Linguistics and English for Specific Purposes as well as with John Benjamins and Emerald, among others. He is also Managing Editor of the journal Explorations: A Journal of Language and Literature 11a pl. Kopernika, 45-040 Opole, Poland


  1. Appel, R. & Wood, D. (2016). “Recurrent Word Combinations in EAP Test-Taker Writing: Differences between High- and Low-Proficiency Levels”. Language Assessment Quarterly, 13 (1): 55-71.
  2. Barreiro, A. & Batista, F. (2016). “Machine Translation of Non-Contiguous Multiword Units”. Proceedings of DiscoNLP 2016, 22-30. Available: (accessed in October 2017).
  3. Berūkštienė, D. (2017). “A corpus-driven analysis of structural types of lexical bundles in court judgments in English and their translation into Lithuanian” Kalbotyra, 70: 7-31.
  4. Biber, D. (2006). University Language. A corpus-based study of spoken and written registers. Amsterdam: John Benjamins.
  5. Biber, D. (2009). “A corpus-driven approach to formulaic language in English: multi-word patterns in speech and writing”. International Journal of Corpus Linguistics, 14(3): 275-311.
  6. Biber, D., S. Johansson, G. Leech, S. Conrad & Finegan, E. (1999). The Longman Grammar of Spoken and Written English. London: Longman.
  7. Biber, D., Conrad, S. & Cortes, V. (2004). “If you look at...: Lexical bundles in university teaching and textbooks”. Applied Linguistics, 25(3), 371-405.
  8. Biber, D. & Conrad, S. (2009). Register, genre and style. Cambridge: Cambridge University Press.
  9. Biel, Ł. (2011). “Professional Realism in the Legal Translation Classroom: Translation Competence and Translator Competence”. Meta, 56(1), 162-178.
  10. Biel, Ł. (2014a). “The textual fit of translated EU law: a corpus-based study of deontic modality”. The Translator, 20 (3): 332-355.
  11. Biel, Ł. (2014b). Lost in the Eurofog. The Textual Fit of Translated Law. Frankfurt: Peter Lang.
  12. Biel, Ł. (2017). “Lexical bundles in EU law: the impact of translation process on the patterning of legal language”. In” S. Goźdz-Roszkowski & G. Pontrandolfo (Eds), Phraseology in legal and institutional settings. A corpus-based interdisciplinary perspective. London/New York: Routledge, 10-26.
  13. Bouamor, D., Semmar, N., Zweigenbaum, P. (2012). “Identifying Bilingual Multi-Word Expressions for Statistical Machine Translation”. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC12), 674-679. Available at: (accessed in December 2017).
  14. Breeze, R. (2013). “Lexical bundles across four legal genres”. International Journal of Corpus Linguistics 18 (2): 229-253.
  15. Catford, J. (1965). A Linguistic Theory of Translation. London: Oxford University Press.
  16. Chesterman, A. (2004). Hypothesis about translation universals. In: G. Hansen, K. Malmkjaer & D. Gile (Eds), Claims, Changes and Challenges in Translation Studies. Amsterdam: John Benjamins, 1-13.
  17. Forchini, P. & Murphy, A. (2008). N-grams in comparable specialized corpora. Perspectives on phraseology, translation and pedagogy. International Journal of Corpus Linguistics, 13(3), 351-367.
  18. Fuster-Marquez, M. (2017). “The Discourse of US Hotel Websites: Variation through the Interruptibility of Lexical Bundles”. In: M. Gotti, S. Maci and M. Sala (Eds), Ways of Seeing, Ways of Being: Representing the Voices of Tourism. Frankfurt am Main: Peter Lang Verlag, 400-420.
  19. Goźdź-Roszkowski, S. (2011). Patterns of Linguistic Variation in American Legal English. A Corpus-Based Study. Frankfurt am Main: Peter Lang Verlag.
  20. Górski, R. (2012). ”Zastosowanie korpusów w badaniu gramatyki”. In: A. Przepiórkowski, M. Bańko, R. Górski & B. Lewandowska-Tomaszczyk (Eds), Narodowy Korpus Języka Polskiego. Warszawa: Wydawnictwo Naukowe PWN, 291-300.
  21. Grabar, N. & Lefer, M-A. (2015). “Building a lexical bundle resource for CAT and MT”. Presentation delivered at Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT2015) of EUROPHRAS 2015. 29 Jun - 1 Jul 2015, Malaga, Spain. Available at: (accessed in October 2017).
  22. Grabowski, Ł. (2014). “On Lexical Bundles in Polish Patient Information Leaflets: A Corpus-Driven Study”. Studies in Polish Linguistics, 19 (1): 21-43.
  23. Grabowski, Ł. (2015). “Keywords and lexical bundles within English pharmaceutical discourse: a corpus-driven description”. English for Specific Purposes, 38: 23-33.
  24. Grabowski, Ł. (2018a, forth.). “On identification of bilingual lexical bundles for translation purposes. The case of an English-Polish comparable corpus of patient information leaflets”. In: R. Mitkov, J. Monti, G. Corpas Pastor and V. Seretan (Eds), Multiword Units in Machine Translation and Translation Technology [Current Issues in Linguistic Theory 341], Amsterdam: John Benjamins, pp. 181-200.
  25. Grabowski, Ł. (2018b). “Fine-tuning lexical bundles: A methodological reflection in the context of describing drug-drug interactions”. In: J. Kopaczyk & J. Tyrkkö (Eds), Applications of Pattern-driven Methods in Corpus Linguistics. Amsterdam: John Benjamins, pp. 57-80.
  26. Granger, S. (2014). A lexical bundle approach to comparing languages. Stems in English and French. In: M-A. Lefer & S. Vogeleer (Eds.), Genre- and register-related discourse features in contrast. Special issue of Languages in Contrast, 14(1), 58-72.
  27. Gray, B. & Biber, D. (2013). Lexical frames in academic prose and conversation. International Journal of Corpus Linguistics, 18(1), 109-135.
  28. Hoey, M. (2005). Lexical Priming: A New Theory of Words and Language. London: Routledge.
  29. Hoey, M. (2007). Lexical priming and literary creativity. In M. Hoey, M. Mahlberg, M. Stubbs & W. Teubert (Eds.), Text, Discourse and Corpora. London: Continuum, 7-30.
  30. Hyland, K. (2008). “As can be seen: Lexical bundles and disciplinary variation”. English for Specific Purposes 27: 4-21.
  31. Jalali, H. (2017). “Reflection of stance through it bundles in applied linguistics”. Ampersand 4, 30-39. (accessed in October 2017).
  32. Jukneviciene, R. (2009). “Lexical bundles in learner language: Lithuanian learners vs. native speakers”. Kalbotyra 61 (3): 61-72.
  33. Jukneviciene, R. (2017). English phraseology and corpora: An introduction to corpus-based and corpus driven phraseology. Vilnius: Vilniaus Univesiteto Leidykla.
  34. Koehn, P. (2005). “Europarl: A Parallel Corpus for Statistical Machine Translation”. In: Conference Proceedings: the tenth Machine Translation Summit, Phuket, Thailand: AAMT, 79-86.
  35. Kornacki, M. (2017). Computer-assisted translation (CAT) tools in the translator training process. Unpublished PhD dissertation. University of Łódź.
  36. Kranich, S. (2016). Contrastive Pragmatics and Translation: Evaluation, epistemic modality and communicative styles in English and German. Amsterdam: John Benjamins.
  37. Laviosa, S. (2002). Corpus-based translation studies: theory, findings, applications. Amsterdam: Rodopi.
  38. Lee, D. (2008). “Corpora and discourse analysis”. In V. Bhatia, J. Flowerdew & R. Jones (Eds.), Advances in Discourse Studies. London: Routledge, 86-99.
  39. Lewandowska-Tomaszczyk, B. (2012). “Cognitive Corpus Studies: A New Qualitative & Quantitative Agenda for Contrasting Languages”. MFU Connexion: Journal of Humanities and Social Sciences, 1 (1): 26-64. Available at: f2601120e1c60067d1328094376a8c8d?Resolve_DOI=10.14456/connexion.2012.2 (accessed in February 2018).
  40. Martinez, R. & Schmitt, N. (2012). “A Phrasal Expression List”. Applied Linguistics, 33(3): 299-320.
  41. McVeigh, J. (2018). “Join us for this. Lexical bundles and repetition in email marketing texts”. In: J. Kopaczyk & J. Tyrkko (Eds), Applications of Pattern-driven Methods in Corpus Linguistics. Amsterdam: John Benjamins, 213-250.
  42. Mindt, I. (2011). Adjective Complementation: An Empirical Analysis of Adjectives Followed by That-clauses. Amsterdam: John Benjamins.
  43. Novita, H. & Kwary, D. (2018). “Comparing the use of lexical bundles in Indonesian-English translation by student translators and professional translators”. Translation & Interpreting 10 (1): 53-74.
  44. Oksefjell Ebeling, S. & Ebeling, J. (2017). “A Cross-Linguistic Comparison of recurrent word combinations in a comparable corpus of English and Norwegian Fiction”. In: M. Janebova, E. Lapshinova-Koltunski & M. Martinkova (Eds), Contrasting English and Other Language through Corpora. Newcastle upon Tyne: Cambridge Scholars Publishing, 2-31.
  45. Pérez-Paredes, P. (2010). “The death of the adverb revisited: attested uses of adverbs in native and non-native comparable corpora of spoken English”. In: M. Moreno Jaén, F. Serrano Valverde, & M. Calzada Pérez (Eds), Exploring new paths in language pedagogy. Lexis and corpus-based language teaching. London: Equinox, 157-172.
  46. Pęzik, P. (2012). “Język mówiony w NKJP. (Spoken Language in NKJP)” In: A. Przepiórkowski, M. Bańko, R. Górski & B. Lewandowska-Tomaszczyk (Eds), Narodowy Korpus Języka Polskiego (National Corpus of Polish), Warszawa: Wydawnictwo Naukowe PWN, 37-47.
  47. Pęzik, P. (2016). “Exploring phraseological equivalence with Paralela”. In: E. Gruszczyńska & A. Leńko-Szymańska (Eds), Polish-Language Parallel Corpora. Warszawa: Instytut Lingwistyki Stosowanej UW, 67-81.
  48. Piotrowski, T. (1994). Z zagadnień leksykografii (Problems in lexicography). Warszawa: Wydaw­nictwo Naukowe PWN.
  49. Piotrowski, T. (2011). “Ekwiwalencja w słownikach dwujęzycznych (Equivalence in bilingual dictionaries)”. In: W. Chlebda (Ed.), Na tropach translatów. W poszukiwaniu odpowiedników przekładowych (Searching for translation equivalents). Opole: Wydawnictwo Uniwersytetu Opolskiego, 89-114.
  50. Przepiórkowski, A., Bańko, M., Górski, R. & Lewandowska-Tomaszczyk, B. (Eds) (2012). Narodowy Korpus Języka Polskiego (National Corpus of Polish). Warszawa: Wydawnictwo Naukowe PWN.
  51. Sag, I., Baldwin, T., Bond, F., Copestake, A., & Flickinger D. (2002). Multiword Expressions: A Pain in the Neck for NLP. Computational Linguistics and Intelligent Text Processing: Third International Conference (CICLing 2002), 1-15. Available at: WP-2001-03.pdf (accessed May 2013).
  52. Salazar, D. (2011). Lexical bundles in scientific English: A corpus-based study of native and non-native writing. Unpublished PhD dissertation. University of Barcelona.
  53. Salazar, D. (2014). Lexical Bundles in Native and Non-native Scientific Writing. Amsterdam: John Benjamins.
  54. Simpson-Vlach, R. & Ellis, N. (2010). “An Academic Formulas List: New Methods in Phraseology Research”. Applied Linguistics 31(4): 487-512.
  55. Skadina, I. (2016). “Multi-Word Expressions in English-Latvian SMT: Problems and Solutions. In: I. Skadina & R. Rozis (Eds), Human Language Technologies - The Baltic Perspective: Proceedings of the Seventh International Conference Baltic HLT 2016. Amsterdam: IOS Press. 97-106.
  56. Staples, S., Egbert, J. & Biber, D. & McClair, A. (2013). “Formulaic sequences and EAP writing development: Lexical bundles in the TOEFL iBT writing section”. Journal of English for Academic Purposes. 12. 214-225.
  57. Toury, G. (1995). Descriptive Translation Studies and Beyond. Amsterdam/Philadelphia: John Benjamins Publishing Company.
  58. Toury, G. (2001). -The Nature and Role of Norms in Translation. In: L. Venuti (ed.), The Translation Studies Reader, London: Routledge, 198-211.

Copyright (c) 2018 GRABOWSKI Ł.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies