Phrase-Based & Neural Unsupervised Machine Translation.
Guillaume Lample, Myle Ott, Alexis Conneau, Ludovic Denoyer, Marc'Aurelio Ranzato: Phrase-Based & Neural Unsupervised Machine Translation. EMNLP 2018: 5039-5049
View ArticleUnderstanding Back-Translation at Scale.
Sergey Edunov, Myle Ott, Michael Auli, David Grangier: Understanding Back-Translation at Scale. EMNLP 2018: 489-500
View ArticleHow Decoding Strategies Affect the Verifiability of Generated Text.
Luca Massarelli, Fabio Petroni, Aleksandra Piktus, Myle Ott, Tim Rocktäschel, Vassilis Plachouras, Fabrizio Silvestri, Sebastian Riedel: How Decoding Strategies Affect the Verifiability of Generated...
View ArticleUnsupervised Cross-lingual Representation Learning at Scale.
Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov: Unsupervised Cross-lingual...
View ArticleMix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for...
Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng: Mix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for Neural Language Generation Models. CoRR...
View ArticleFacebook AI's WAT19 Myanmar-English Translation Task Submission.
Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato: Facebook AI's WAT19 Myanmar-English Translation Task Submission. CoRR...
View ArticleThe Source-Target Domain Mismatch Problem in Machine Translation.
Jiajun Shen, Peng-Jen Chen, Matt Le, Junxian He, Jiatao Gu, Myle Ott, Michael Auli, Marc'Aurelio Ranzato: The Source-Target Domain Mismatch Problem in Machine Translation. CoRR abs/1909.13151 (2019)
View ArticleOn The Evaluation of Machine Translation Systems Trained With Back-Translation.
Sergey Edunov, Myle Ott, Marc'Aurelio Ranzato, Michael Auli: On The Evaluation of Machine Translation Systems Trained With Back-Translation. CoRR abs/1908.05204 (2019)
View ArticleRoBERTa: A Robustly Optimized BERT Pretraining Approach.
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov: RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR...
View ArticleFacebook FAIR's WMT19 News Translation Task Submission.
Nathan Ng, Kyra Yee, Alexei Baevski, Myle Ott, Michael Auli, Sergey Edunov: Facebook FAIR's WMT19 News Translation Task Submission. CoRR abs/1907.06616 (2019)
View ArticleReal or Fake? Learning to Discriminate Machine from Human Generated Text.
Anton Bakhtin, Sam Gross, Myle Ott, Yuntian Deng, Marc'Aurelio Ranzato, Arthur Szlam: Real or Fake? Learning to Discriminate Machine from Human Generated Text. CoRR abs/1906.03351 (2019)
View Articlefairseq: A Fast, Extensible Toolkit for Sequence Modeling.
Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, Michael Auli: fairseq: A Fast, Extensible Toolkit for Sequence Modeling. CoRR abs/1904.01038 (2019)
View ArticleMixture Models for Diverse Machine Translation: Tricks of the Trade.
Tianxiao Shen, Myle Ott, Michael Auli, Marc'Aurelio Ranzato: Mixture Models for Diverse Machine Translation: Tricks of the Trade. CoRR abs/1902.07816 (2019)
View ArticleTwo New Evaluation Datasets for Low-Resource Machine Translation:...
Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato: Two New Evaluation Datasets for Low-Resource Machine Translation:...
View ArticleFacebook FAIR's WMT19 News Translation Task Submission.
Nathan Ng, Kyra Yee, Alexei Baevski, Myle Ott, Michael Auli, Sergey Edunov: Facebook FAIR's WMT19 News Translation Task Submission. WMT (2) 2019: 314-319
View Articlefairseq: A Fast, Extensible Toolkit for Sequence Modeling.
Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, Michael Auli: fairseq: A Fast, Extensible Toolkit for Sequence Modeling. NAACL-HLT (Demonstrations) 2019: 48-53
View ArticleMixture Models for Diverse Machine Translation: Tricks of the Trade.
Tianxiao Shen, Myle Ott, Michael Auli, Marc'Aurelio Ranzato: Mixture Models for Diverse Machine Translation: Tricks of the Trade. ICML 2019: 5719-5728
View ArticleThe FLORES Evaluation Datasets for Low-Resource Machine Translation:...
Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato: The FLORES Evaluation Datasets for Low-Resource Machine...
View ArticleFacebook AI's WAT19 Myanmar-English Translation Task Submission.
Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato: Facebook AI's WAT19 Myanmar-English Translation Task Submission....
View ArticleFew-shot Sequence Learning with Transformers.
Lajanugen Logeswaran, Ann Lee, Myle Ott, Honglak Lee, Marc'Aurelio Ranzato, Arthur Szlam: Few-shot Sequence Learning with Transformers. CoRR abs/2012.09543 (2020)
View ArticleGeneral Purpose Text Embeddings from Pre-trained Language Models for Scalable...
Jingfei Du, Myle Ott, Haoran Li, Xing Zhou, Veselin Stoyanov: General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference. CoRR abs/2004.14287 (2020)
View ArticleRecipes for building an open-domain chatbot.
Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric Michael Smith, Y-Lan Boureau, Jason Weston: Recipes for building an open-domain...
View ArticleResidual Energy-Based Models for Text Generation.
Yuntian Deng, Anton Bakhtin, Myle Ott, Arthur Szlam, Marc'Aurelio Ranzato: Residual Energy-Based Models for Text Generation. CoRR abs/2004.11714 (2020)
View ArticleEnergy-Based Models for Text.
Anton Bakhtin, Yuntian Deng, Sam Gross, Myle Ott, Marc'Aurelio Ranzato, Arthur Szlam: Energy-Based Models for Text. CoRR abs/2004.10188 (2020)
View ArticleResidual Energy-Based Models for Text Generation.
Yuntian Deng, Anton Bakhtin, Myle Ott, Arthur Szlam, Marc'Aurelio Ranzato: Residual Energy-Based Models for Text Generation. ICLR 2020
View ArticleGeneral Purpose Text Embeddings from Pre-trained Language Models for Scalable...
Jingfei Du, Myle Ott, Haoran Li, Xing Zhou, Veselin Stoyanov: General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference. EMNLP (Findings) 2020: 3018-3030
View ArticleHow Decoding Strategies Affect the Verifiability of Generated Text.
Luca Massarelli, Fabio Petroni, Aleksandra Piktus, Myle Ott, Tim Rocktäschel, Vassilis Plachouras, Fabrizio Silvestri, Sebastian Riedel: How Decoding Strategies Affect the Verifiability of Generated...
View ArticlePretrained Language Models for Biomedical and Clinical Tasks: Understanding...
Patrick S. H. Lewis, Myle Ott, Jingfei Du, Veselin Stoyanov: Pretrained Language Models for Biomedical and Clinical Tasks: Understanding and Extending the State-of-the-Art. ClinicalNLP@EMNLP 2020: 146-157
View ArticleUnsupervised Cross-lingual Representation Learning at Scale.
Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov: Unsupervised Cross-lingual...
View ArticleOn The Evaluation of Machine Translation SystemsTrained With Back-Translation.
Sergey Edunov, Myle Ott, Marc'Aurelio Ranzato, Michael Auli: On The Evaluation of Machine Translation SystemsTrained With Back-Translation. ACL 2020: 2836-2846
View ArticleEfficient Large Scale Language Modeling with Mixtures of Experts.
Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil...
View ArticleFew-shot Learning with Multilingual Language Models.
Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav...
View ArticleSustainable AI: Environmental Implications, Challenges and Opportunities.
Carole-Jean Wu, Ramya Raghavendra, Udit Gupta, Bilge Acun, Newsha Ardalani, Kiwan Maeng, Gloria Chang, Fiona Aga Behram, James Huang, Charles Bai, Michael Gschwind, Anurag Gupta, Myle Ott, Anastasia...
View ArticleNormFormer: Improved Transformer Pretraining with Extra Normalization.
Sam Shleifer, Jason Weston, Myle Ott: NormFormer: Improved Transformer Pretraining with Extra Normalization. CoRR abs/2110.09456 (2021)
View ArticleOn Anytime Learning at Macroscale.
Lucas Caccia, Jing Xu, Myle Ott, Marc'Aurelio Ranzato, Ludovic Denoyer: On Anytime Learning at Macroscale. CoRR abs/2106.09563 (2021)
View ArticleLarger-Scale Transformers for Multilingual Masked Language Modeling.
Naman Goyal, Jingfei Du, Myle Ott, Giri Anantharaman, Alexis Conneau: Larger-Scale Transformers for Multilingual Masked Language Modeling. CoRR abs/2105.00572 (2021)
View ArticleLarger-Scale Transformers for Multilingual Masked Language Modeling.
Naman Goyal, Jingfei Du, Myle Ott, Giri Anantharaman, Alexis Conneau: Larger-Scale Transformers for Multilingual Masked Language Modeling. RepL4NLP@ACL-IJCNLP 2021: 29-33
View ArticleThe Source-Target Domain Mismatch Problem in Machine Translation.
Jiajun Shen, Peng-Jen Chen, Matt Le, Junxian He, Jiatao Gu, Myle Ott, Michael Auli, Marc'Aurelio Ranzato: The Source-Target Domain Mismatch Problem in Machine Translation. EACL 2021: 1519-1533
View ArticleAnalyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain...
Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng: Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models. EACL 2021: 1121-1133
View ArticleRecipes for Building an Open-Domain Chatbot.
Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Eric Michael Smith, Y-Lan Boureau, Jason Weston: Recipes for Building an Open-Domain Chatbot. EACL 2021:...
View ArticleBiological structure and function emerge from scaling unsupervised learning...
Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin, Jason Liu, Demi Guo, Myle Ott, C. Lawrence Zitnick, Jerry Ma, Rob Fergus: Biological structure and function emerge from scaling...
View ArticleResidual Energy-Based Models for Text.
Anton Bakhtin, Yuntian Deng, Sam Gross, Myle Ott, Marc'Aurelio Ranzato, Arthur Szlam: Residual Energy-Based Models for Text. J. Mach. Learn. Res. 22: 40:1-40:41 (2021)
View ArticleOPT: Open Pre-trained Transformer Language Models.
Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona T. Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel...
View ArticleEfficient Language Modeling with Sparse all-MLP.
Ping Yu, Mikel Artetxe, Myle Ott, Sam Shleifer, Hongyu Gong, Ves Stoyanov, Xian Li: Efficient Language Modeling with Sparse all-MLP. CoRR abs/2203.06850 (2022)
View ArticleSustainable AI: Environmental Implications, Challenges and Opportunities.
Carole-Jean Wu, Ramya Raghavendra, Udit Gupta, Bilge Acun, Newsha Ardalani, Kiwan Maeng, Gloria Chang, Fiona Aga Behram, Jinshi Huang, Charles Bai, Michael Gschwind, Anurag Gupta, Myle Ott, Anastasia...
View ArticleEfficient Large Scale Language Modeling with Mixtures of Experts.
Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giridharan Anantharaman, Xian Li, Shuohui Chen,...
View ArticleFew-shot Learning with Multilingual Generative Language Models.
Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav...
View ArticleOn Anytime Learning at Macroscale.
Lucas Caccia, Jing Xu, Myle Ott, Marc'Aurelio Ranzato, Ludovic Denoyer: On Anytime Learning at Macroscale. CoLLAs 2022: 165-182
View ArticlePyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Bernard Nguyen, Geeta Chauhan, Yuchen...
View ArticlePyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel.
Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-Chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta...
View Article
More Pages to Explore .....