ASTREC

NICT

先進的翻訳技術研究室

【学術論文】

  • Rubino Raphael,Marie Benjamin,Dabre Raj,Fujita Atsushi,Uchiyama Masao, and Sumita Eiichiro.Extremely low-resource neural machine translation for Asian languages.Machine Translation Special Issue: Machine Translation for Low-Resource Languages,Vol.32,No.4,pp.347-382,December,2020.
  • Marie Benjamin and Atsushi Fujita. Synthesizing Parallel Data of User-Generated Texts with Zero-Shot Neural Machine Translation. Transactions of the Association for Computational Linguistics(TACL), Vol.8, pp.710-725, November 14, 2020.
  • 今村 賢治.非自己回帰デコーディング型ニューラル機械翻訳の改善.一般社団法人日本特許情報機構 Japio YEAR BOOK 2020,pp.296-299, November 2, 2020.
  • Chunpeng Ma,田村 晃裕,内山 将夫,隅田 英一郎,Tiejun Zhao.Encoder-Decoder Attention ≠ Word Alignment: Axiomatic Method of Learning Word Alignments for Neural Machine Translation. 自然言語処理,Vol.27, No.3, pp.531-552,September 15, 2020.
  • Chunpeng Ma,田村 晃裕, 内山 将夫,Eiichiro Sumita, Tiejun Zhao. Syntax-based Transformer for Neural Machine Translation.自然言語処理,Vol.27, No.2, pp.445-466, June 15, 2020.
  • Marie Benjamin and Atsushi Fujita. Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems. ACM Transactions on Asian and Low-Resource Language Information Processing(TALLIP), ACM Journals, Vol.19, No.5, June 1, 2020.
  • Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao. Unsupervised Neural Machine Translation with Cross-lingual Language Representation Agreement. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020
  • Zuchao Li, Chaoyu Guan, Hai Zhao, Rui Wang, Kevin Parnow, and Zhuosheng Zhang. Memory Network for Linguistic Structure Parsing. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020
  • Mingming Yang, Rui Wang, Kehai Chen, Xing Wang, Tiejun Zhao, and Min Zhang. A Novel Sentence-Level Agreement Architecture for Neural Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020
  • Haipeng Sun, Rui Wang, Masao Utiyama, Benjamin Marie, Kehai Chen, Eiichiro Sumita, and Tiejun Zhao. Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2020
  • Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao, Muyun Yang, and Hai Zhao. Towards More Diverse Input Representation for Neural Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), vol. 28, pp. 1586-1597, December 2020
  • A survey of multilingual neural machine translation. Raj Dabre, Chenhui Chu, A Kunchukuttan. ACM Computing Surveys (CSUR) 53 (5), 1-38 September, 2020
  • Marie, B., Fujita, A. (2020). Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems. In TALLIP Vol. 19 issue 5 (2020).
  • Shohei Higashiyama, Masao Utiyama, Eiichiro Sumita, Masao Ideuchi, Yoshiaki Oida, Yohei Sakamoto, Isaac Okada, Yuji Matsumoto. Character-to-Word Attention for Word Segmentation. Journal of Natural Language Processing, Vol. 27, No. 3, pages 573-598, September 2020.
  • Shohei Higashiyama, Masao Utiyama, Yuji Matsumoto, Taro Watanabe, Eiichiro Sumita. Auxiliary Lexicon Word Prediction for Cross-Domain Word Segmentation. Journal of Natural Language Processing, Vol. 27, No. 3, pages 499-530, September 2020.
  • 今村賢治, 藤田篤, 隅田英一郎. サンプリング生成に基づく複数逆翻訳を用いたニューラル機械翻訳. 人工知能学会論文誌, 第35巻3号, pp. A-JA9_1-9, May 2020.
  • Chenchen Ding, Sann Su Su Yee, Win Pa Pa, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita. A Burmese (Myanmar) Treebank: Guideline and Analysis. ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 19 Issue 3, Article No. 40, 2020.

【国際会議・ワークショップ】

  • Rubino Raphael and Eiichiro Sumita. Intermediate Self-supervised Learning for Machine Translation Quality Estimation. In Proceedings of The 28th International Conference on Computational Linguistics (COLING), Barcelona,Spain, pp.4355-4560, December 8-13, 2020.
  • Diptesh Kanojia, Dabre Raj, Shubham Dewangan, Pushpak Bhattacharyya, Gholamreza Haffari, and Malhar Kulkarni. Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), Barcelona, Spain, pp.1384-1395, December 8-13, 2020.
  • Dabre Raj and Chakrabarty Abhisek. NICT's Submission To WAT 2020: How Effective Are Simple Many-To-Many Neural Machine Translation Models? In Proceedings of the 7th Workshop on Asian Translation(WAT), Suzhou, China, pp.98-102, December 4, 2020.
  • Kenji Imamura and Eiichiro Sumita. Transformer-based Double-token Bidirectional Autoregressive Decoding in Neural Machine Translation. In Proceedings of the 7th Workshop on Asian Translation(WAT), Suzhou, China, pp.50-57, December 4, 2020.
  • 中澤 敏明, 中山 英樹, 丁 塵辰, Dabre Raj, 東山 翔平, 美野 秀弥, 後藤 功雄, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, 黒橋 禎夫. Overview of the 7th Workshop on Asian Translation. In Proceedings of the 7th Workshop on Asian Translation(WAT), Suzhou, China, pp.1-44, December 4,2020.
  • Marie Benjamin, Rubino Raphael and Atsushi Fujita. Combination of Neural Machine Translation Systems at WMT20. In Proceedings of the FIFTH CONFERENCE ON MACHINE TRANSLATION (WMT20), pp.229-237, November 19-20,2020.
  • Dabre Raj and Atsushi Fujita, Combining Sequence Distillation and Transfer Learning for Efficient Low-Resource Neural Machine Translation Models. In Proceedings of the FIFTH CONFERENCE ON MACHINE TRANSLATION (WMT20), pp.490-500,November 19-20,2020.
  • Rubino Raphael. NICT Kyoto Submission for the WMT’20 Quality Estimation Task: Intermediate Training for Domain and Task Adaptation. In Proceedings of the FIFTH CONFERENCE ON MACHINE TRANSLATION (WMT20),November 19-20,2020.
  • Haipeng Sun, Rui Wang, Kehai Chen, Xugang Lu, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao. Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training. The 28th International Conference on Computational Linguistics (COLING-2020), Barcelona, Spain
  • Abhisek Chakrabarty, Raj Dabre, Chenchen Ding, Masao Utiyama, and Eiichiro Sumita. Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation. COLING, 2020.
  • Zhenyu Zhao, Shuangzhi Wu, Muyun Yang, Kehai Chen, Tiejun Zhao. Robust Machine Reading Comprehension by Learning Soft labels. The 28th International Conference on Computational Linguistics (COLING-2020), Barcelona, Spain, December, 2020
  • Zuchao Li, Hai Zhao, Rui Wang and Kevin Parnow. High-order Semantic Role Labeling. The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
  • Zuchao Li, Hai Zhao, Rui Wang, Masao Utiyama and Eiichiro Sumita. Reference Language based Unsupervised Neural Machine Translation. The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
  • Aye Thida, Nway Nway Han, Sheinn Thawtar Oo, Sheng Li, and Chenchen Ding. VOIS: The First Speech Therapy App in the World for Myanmar Hearing-Impaired Children. O-COCOSDA, 2020.
  • Masaru Yamada, Mayuka Yamamoto, Nanami Onishi, Atsushi Fujita, Rei Miyata, and Kyo Kageura. Metalanguage for the Translation Process. In Proceedings of the 5th Conference on Translation in Transition: Human and Machine Intelligence (TT5), 46-51, Oct., 2020.
  • 藤田篤. 機械翻訳のしくみ,翻訳との違い. 韓国日本語学会第42回学術大会, Sep., 2020.
  • Zuchao Li, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao. Data-dependent Gaussian Prior Objective for Language Generation. International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
  • Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, and Hai Zhao. Neural Machine Translation with Universal Visual Representation. International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
  • Zuchao Li, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao. Explicit Sentence Compression for Neural Machine Translation. Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
  • Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao, and Rui Wang. SG-Net: Syntax-Guided Machine Reading Comprehension. Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
  • Xuancai Li, Kehai Chen, Tiejun Zhao, and Muyun Yang. End-to-End Speech Translation with Adversarial Training. Proceedings of the First Workshop on Automatic Simultaneous Translation, Seattle, USA, July 2020
  • Raj Dabre, Raphael Rubino, Atsushi Fujita. Balancing Cost and Benefit with Tied-Multi Transformers. Proceedings of the Fourth Workshop on Neural Generation and Translation, 24–34, July, 2020
  • Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi, Eiichiro Sumita. Pre-training via Leveraging Assisting Languages for Neural Machine Translation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 279–285, July, 2020
  • Xintong Li, Lemao Liu, Rui Wang, Guoping Huang, and Max Meng. Regularized Context Gates on Transformer for Machine Translation. The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA
  • Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao. Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation. The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA
  • Marie, B., Rubino, R., Fujita, A. (2020). Tagged Back-translation Revisited: Why Does It Really Work?. In ACL 2020.
  • Chenchen Ding, Masao Utiyama, and Eiichiro Sumita. A Three-Parameter Rank-Frequency Relation in Natural Languages. In Proc. of ACL, pp. 460--464, 2020.
  • Kehai Chen, Rui Wang, Masao Utiyama, and Eiichiro Sumita. Content Word Aware Neural Machine Translation. The 58th Annual Meeting of the Association for Computational Linguistics (ACL), Seattle, USA, July 2020
  • Sheng Li, X Lu, Raj Dabre, Peng Shen, Hishashi Kawai. Joint Training End-to-End Speech Recognition Systems with Speaker Attributes. Proceedings of Odyssey 2020: The Speaker and Language Recognition Workshop, 385-390, May, 2020
  • Yuqin Lin, Longbiao Wang, Sheng Li, Jianwu Dang, and Chenchen Ding. Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription. In Proc. of INTERSPEECH, pp. 4791--4795, 2020.
  • Hao Shi, Longbiao Wang, Sheng Li, Chenchen Ding, Meng Ge, Nan Li, Jianwu Dang, and Hiroshi Seki. Singing Voice Extraction with Attention based Spectrograms Fusion. In Proc. of INTERSPEECH, pp. 2412--2416, 2020.
  • Zhuoyuan Mao, F Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi. JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation. Proceedings of the 12th Language Resources and Evaluation Conference, 3683–3691, May, 2020
  • Haiyue Song, Raj Dabre, Atsushi Fujita, Sadao Kurohashi. Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation. Proceedings of the 12th Language Resources and Evaluation Conference, 3640–3649, May, 2020
  • Aye Myat Mon, Chenchen Ding, Hour Kaing, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita. A Myanmar (Burmese)-English Named Entity Transliteration Dictionary. In Proc. of LREC, pp. 2973--2976, 2020.
  • Yuqin Lin, Longbiao Wang, Jianwu Dang, Sheng Li, and Chenchen Ding. End-to-End Articulatory Modeling for Dysarthria Articulatory Attribute Detection. In Proc. of ICASSP, pp. 7349--7353, 2020.

【国内会議・研究会】

  • Benjamin Marie and Atsushi Fujita. Questioning the Use of Bilingual Lexicon Induction as an Evaluation Task for Bilingual Word Embeddings. 言語処理学会第26回年次大会発表論文集, P5-14, pp. 1225-1228, Mar., 2020.
  • 藤田篤. 翻訳時に参照すべき情報が欠けることで生じる問題: ニュース記事の英日機械翻訳・ポストエディットを例題に. 言語処理学会第26回年次大会発表論文集, G2-6, pp. 537-540, Mar., 2020.
  • Haiyue Song, Raj Dabre, Atsushi Fujita, and Sadao Kurohashi. Domain Adaptation of Neural Machine Translation through Multistage Fine-Tuning. 言語処理学会第26回年次大会発表論文集, D2-3, pp. 461-464, Mar., 2020.

【プレプリント】

  • Duan Chaoqun,CHEN KEHAI,Wang Rui,Masao Utiyama, Eiichiro Sumita, Conghui Zhu, and Tiejun Zhao. Modeling Future Cost for Neural Machine Translation. ArXiv:2002.12558v1 [cs.CL] February 28, 2020.
  • Raj Dabre, Atsushi Fujita. Softmax Tempering for Training Neural Machine Translation Models. arXiv preprint arXiv:2009.09372, September, 2020
  • Kehai Chen, Rui Wang, Masao Utiyama, and Eiichiro Sumita. Explicit Reordering for Neural Machine Translation. arXiv preprint arXiv:2004.03818, April 8, 2020

【解説 ほか】

  • Benjamin Marie, Raphael Rubino, 藤田篤. ニューラル機械翻訳における逆翻訳へのタグ付与の効果. 自然言語処理, Vol. 27, No. 3, pp. 689-694, Sep., 2020.
  • Kenji Imamura. Language Translation. Yutaka Kidawara, Eiichiro Sumita, and Hisashi Kawai editors. Speech-toSpeech Translation, Chapter 4, Springer Singapore, ISBN:978-981-15-0594-2.
  • 藤田 篤,山田 優,景浦 峡,翻訳と機械翻訳: 年次大会のテーマセッションを通じての知見.自然言語処理,Vol.27, No.4, pp.975-981, December 15, 2020.