Pre-editing rules developed for higher-quality target language texts
List of Authors
  • Tsujitsuji, Kayo

Keyword
  • Academic Writing, English, Pre-editing Rules, MT Use, Neural Machine Translation

Abstract
  • Neural machine translation (NMT/MT) has recently developed by leaps and bounds through deeply learning human brain mechanism, and its effective use has been the focus of much attention. To generate high-quality target language (TL) texts, MT users or TL learners need to edit the source language (SL) text before applying NMT. The purpose of this study was to provide pedagogical implications on pre-editing for effective NMT use with Japanese as a SL. The participants were 23 Japanese students with intermediate TL (English) proficiency, and the targeted task was a Japanese (SL) written argumentative essay and the English (TL) output thereof. Three language researchers systematically examined and analysed issues with the NMT output to detect the problematic factors in SL texts causing the issues and developed pre-editing rules. The results demonstrated the following five principles users should be aware of: (1) simplifying statements, (2) clearly stating all necessary sentence elements, (3) noting the logical placement or setting of sentence elements, (4) paying careful attention to the semantic properties of lexical items in SL texts, and (5) considering appropriacy of genre-specific SL lexical items. These rules should be applied to improve the appropriacy of syntactic and semantic aspects of TL texts. They may also assist TL learners translating from Japanese to English independently, since the output of recent NMTs is largely considered close to human translations.

Reference
  • 1. Bahdanau, D., Cho, KH., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. Proceedings of the 3rd International Conference on Learning Representations, USA, 1–15, https://doi.org/10.48550/arXiv.1409.0473
    2. Bounaas, C., Zemni, B., Shehri, A. F., & Mimouna, Z. (2023). Effects of pre-editing operations on audiovisual translation using TRADOS: An experimental analysis of Saudi students’ translations. Texto Livre, 16(2), 1–15. http://doi.org/10.1590/1983-3652.2023.45539
    3. Cheng, Y., Yue, S., Li, J., Deng, L., & Quan, Q. (2021). Errors of machine translation of terminology in the patent text from English into Chinese. ASP Transactions on Computers, 1(1), 12–17. https://www.sciencegate.app/document/10.52810/tc.2021.100022
    4. Farhana, B. C. D., Baharuddin, W. A. L., & Farmasari, S. (2023). Academic text quality improvement by English department students of University of Mataram: A study on pre-editing of Google neural machine translation. Jurnal Ilmiah Profesi Pendidikan, 8(1), 247– 254. http://doi.org/10.29303/jipp.v8i1.1186
    5. Feifei, F., Rong, C., & Xiao, W. (2022). A study of pre-editing methods at the lexical level in the process of machine translation. Arab World English Journal for Translation & Literary Studies, 6(2), 54–69. https://doi.org/10.31235/osf.io/3yrej
    6. Fujii, R., Mita, M., Abe, K., Hanawa, K., Morishita, M., Suzuki, J., & Inui, K. (2021). Phenomenon-wise evaluation dataset towards analyzing robustness of machine translation models. Natural Language Processing, 28(2), 450–478. https://doi.org/10.5715/jnlp.28.450
    7. Hiraoka, Y., & Yamada, M. (2019). Pre-editing plus neural machine translation for subtitling: Effective pre-editing rules for subtitling of TED talks. Proceedings of Machine Translation Summit XVII: Translator, Project and User Tracks, Ireland, 2, 64–72. https://www.aclweb.org/anthology/W19-6710.pdf
    8. Hyland, K., & Jiang, K. F. (2017). Is academic writing becoming more informal? English for Specific Purposes, 45, 40–51.
    9. Iwasaki, C. (2021). Analysis of issues expressed in first-year students’ reports and the contribution of the writing center. Kansai University journal of higher education, 12, 25–35. https://cir.nii.ac.jp/crid/1390009225309702528
    10. Kim, Y., Tran. D. T., & Ney, H. (2019). When and why is document-level context useful in neural machine translation? Proceedings of the Fourth Workshop on Discourse in Machine Translation, China, 24–34. https://doi.org/10.18653/v1/D19-6503
    11. Kokanova, E. S., Berendyaev, M. V., & Kulikov, N. Y. (2022). Pre-editing English news texts for machine translation into Russian. Language Studies and Modern Humanities, 4(1), 25–30. https://doi.org/10.33910/2686-830X-2022-4-1-25-30
    12. Miculicich, L., Ram, D., Pappas N., & Henderson, J. (2018). Document-level neural machine translation with hierarchical attention networks. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Belgium, 2947–2954. http://doi.org/10.18653/v1/D18-1325
    13. Mikami, A. (1975). Mikami Akira ronbunsyū [Collected academic papers written by Akira Mikami]. Kurosio Publishers.
    14. Miyata, R., & Fujita, A. (2021). Understanding pre-editing for black-box neural machine translation. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 1539–1550. https://doi.org/10.48550/arXiv.2102.02955
    15. Oshima, Y. (2010). Types of problems in university students’ essays: Toward a framework to promote collaborative learning to improve academic writing skills. Kyoto University Researches in Higher Education, 16, 25–36. https://cir.nii.ac.jp/crid/1050001335709671040
    16. Pym, P. J. (1990). Pre-editing and the use of simplified writing for MT: An engineer’s experience of operating an MT system. In P., Mayorcas (Eds.), Translating and the computer 10. (pp. 80– 96). Aslib.
    17. Sánchez-Gijón, P., & Kenny, D. (2022). Selecting and preparing texts for machine translation: Preediting and writing for a global audience. In D. Kenny (Eds.), Machine translation for everyone: Empowering users in the age of artificial intelligence (pp. 81–104). Language Science Press.
    18. Seretan, V., Bouillon, P., & Gerlach, J. (2014). A large-scale evaluation of pre-editing strategies for improving user-generated content translation. Proceedings of the Ninth International Conference on Language Resources and Evaluation, Iceland, 1793–1799. https://aclanthology.org/L14-1532/
    19. Shih, C. (2021). How to empower machine-translation-to-web pre-editing from the perspective of Grice’s cooperative maxims. Theory and Practice in Language Studies, 11(12), 1554–1561. https://doi.org/10.17507/tpls.1112.07
    20. Simonova, V., & Patiniotaki, E. (2022). Pre-editing for the translation of life-science texts from Russian into English via Google Translate. Proceedings of New Trends in Translation and Technology 2022, Greece, 259–265. https://www.researchgate.net/profile/AbdelalahAlsolami/publication/371681915
    21. Sutskever, I., Vinyals, O., & Le, V. Q. (2014). Sequence to sequence learning with neural networks. Proceedings of the 27th International Conference on Neural Information Processing Systems, Canada, 2, 3104–3112. https://doi.org/10.48550/arXiv.1409.3215
    22. Taufik, A. (2020). Pre-editing of Google neural machine translation. Journal of English Language & Culture, 10(2), 64–74. http://doi.org/10.30813/jelc.v10i2.2137
    23. Tsuji, K. (2021). Developing and evaluating a scoring rubric for argumentative essays: A module based approach. Urban Scope, 12, 1–13. https://urbanscope.lit.osakacu.ac.jp/journal/pdf/vol012/01-tsuji.pdf
    24. Tsuji, K. (2022). Bogo parafureizingu no kyōikuteki kōka ni kansuru chōsa: Kikaihonyaku no seigengengo ni chakumoku shite [The effect of L1 paraphrasing on L2 writing: Focusing on pre-editing activity for machine translation], Studies in the Humanities, 73, 33–49. https://doi.org/10.24544/ocu.20220416-008
    25. Tsuji, K. (2024). Identifying MT errors for higher-quality target language writing. International Journal of Translation, Interpretation, and Applied Linguistics, 6(1), 1–17. http://doi.org/10.4018/IJTIAL.335899
    26. Tuzcu, A. (2021). The impact of Google Translate on creativity in writing activities. Language Education and Technology, 1(1), 40–52. https://langedutech.com/letjournal/index.php/let/article/view/18/5
    27. Zheng, Y., Peng, C., & Mu, Y. (2022). Designing controlled Chinese rules for MT pre-editing of product description text. International Journal of Translation, Interpretation, and Applied Linguistics, 4(2), 1–13. http://doi.org/10.4018/IJTIAL.313919
    28. Zhivotova, A. A., Berdonosov, V. D., & Redkolis, E. V. (2020). Improving the quality of scientific articles machine translation while writing original text. Proceedings of 2020 International Multi-Conference on Industrial Engineering and Modern Technologies, Russia, 1783–1786. https://doi.org/10.1109/FarEastCon50210.2020.9271