Year |
Conf. |
Topic |
Cited |
Paper |
Authors |
Url |
2019 |
ACL |
# optim-adam, norm-layer, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-transformer, task-seq2seq |
2 |
Incremental Transformer with Deliberation Decoder for Document Grounded Conversations |
Zekang Li, Cheng Niu, Fandong Meng, Yang Feng, Qian Li, Jie Zhou |
https://www.aclweb.org/anthology/P19-1002.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, arch-rnn, arch-att, arch-selfatt, arch-memo, arch-transformer, comb-ensemble, search-beam, pre-bert, latent-vae, task-condlm, task-seq2seq |
1 |
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems |
Hung Le, Doyen Sahoo, Nancy Chen, Steven Hoi |
https://www.aclweb.org/anthology/P19-1564.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, train-active, arch-lstm, arch-bilstm, arch-att, pre-glove, pre-elmo, pre-bert, struct-crf, task-lm |
0 |
Learning Emphasis Selection for Written Text in Visual Media from Crowd-Sourced Label Distributions |
Amirreza Shirani, Franck Dernoncourt, Paul Asente, Nedim Lipka, Seokhwan Kim, Jose Echevarria, Thamar Solorio |
https://www.aclweb.org/anthology/P19-1112.pdf |
2019 |
ACL |
# reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-bilinear, arch-coverage, arch-transformer, comb-ensemble, pre-glove, pre-elmo, pre-bert, task-relation |
8 |
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank |
Junru Zhou, Hai Zhao |
https://www.aclweb.org/anthology/P19-1230.pdf |
2019 |
ACL |
# norm-layer, train-mll, train-transfer, arch-att, arch-subword, arch-transformer, task-textclass, task-spanlab, task-lm, task-seq2seq |
5 |
Large-Scale Transfer Learning for Natural Language Generation |
Sergey Golovanov, Rauf Kurbanov, Sergey Nikolenko, Kyryl Truskovskyi, Alexander Tselousov, Thomas Wolf |
https://www.aclweb.org/anthology/P19-1608.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, train-transfer, arch-rnn, arch-cnn, arch-att, arch-selfatt, arch-coverage, arch-subword, arch-transformer, task-seq2seq, task-relation |
1 |
Neural Machine Translation with Reordering Embeddings |
Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita |
https://www.aclweb.org/anthology/P19-1174.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-att, arch-selfatt, comb-ensemble, task-seq2seq, task-tree |
0 |
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions |
Jierui Li, Lei Wang, Jipeng Zhang, Yan Wang, Bing Tian Dai, Dongxiang Zhang |
https://www.aclweb.org/anthology/P19-1619.pdf |
2019 |
ACL |
# optim-adam, optim-projection, reg-dropout, norm-layer, arch-rnn, arch-att, arch-selfatt, arch-copy, arch-coverage, arch-transformer, search-beam, latent-vae, task-lm, task-seq2seq |
2 |
Improving Abstractive Document Summarization with Salient Information Modeling |
Yongjian You, Weijia Jia, Tianyi Liu, Wenmian Yang |
https://www.aclweb.org/anthology/P19-1205.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-decay, norm-layer, train-augment, arch-gnn, arch-att, arch-selfatt, arch-copy, arch-transformer, search-greedy, pre-glove, pre-bert, task-textclass, task-lm, task-seq2seq, task-tree, task-graph |
2 |
Generating Logical Forms from Graph Representations of Text and Entities |
Peter Shaw, Philip Massey, Angelica Chen, Francesco Piccinno, Yasemin Altun |
https://www.aclweb.org/anthology/P19-1010.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, train-augment, arch-rnn, arch-att, arch-selfatt, arch-residual, arch-subword, arch-transformer, search-beam, task-lm, task-seq2seq |
7 |
Learning Deep Transformer Models for Machine Translation |
Qiang Wang, Bei Li, Tong Xiao, Jingbo Zhu, Changliang Li, Derek F. Wong, Lidia S. Chao |
https://www.aclweb.org/anthology/P19-1176.pdf |
2019 |
ACL |
# reg-dropout, norm-layer, pool-mean, arch-rnn, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-memo, arch-subword, arch-transformer, search-beam, adv-train, task-seq2seq |
0 |
Reference Network for Neural Machine Translation |
Han Fu, Chenghao Liu, Jianling Sun |
https://www.aclweb.org/anthology/P19-1287.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-labelsmooth, norm-layer, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-att, arch-memo, arch-copy, arch-coverage, arch-subword, arch-transformer, task-extractive, task-seq2seq, task-tree |
1 |
Keeping Notes: Conditional Natural Language Generation with a Scratchpad Encoder |
Ryan Benmalek, Madian Khabsa, Suma Desu, Claire Cardie, Michele Banko |
https://www.aclweb.org/anthology/P19-1407.pdf |
2019 |
ACL |
# optim-adam, reg-stopping, norm-layer, train-mtl, train-transfer, arch-lstm, arch-bilstm, arch-att, arch-selfatt, arch-memo, arch-copy, arch-transformer, search-beam, latent-vae, task-lm, task-seq2seq, task-tree |
4 |
Decomposable Neural Paraphrase Generation |
Zichao Li, Xin Jiang, Lifeng Shang, Qun Liu |
https://www.aclweb.org/anthology/P19-1332.pdf |
2019 |
ACL |
# optim-adam, norm-layer, arch-rnn, arch-lstm, arch-gru, arch-cnn, arch-att, arch-transformer, pre-glove, pre-skipthought, pre-bert, task-textpair, task-lm, task-tree |
0 |
Towards Lossless Encoding of Sentences |
Gabriele Prato, Mathieu Duchesneau, Sarath Chandar, Alain Tapp |
https://www.aclweb.org/anthology/P19-1153.pdf |
2019 |
ACL |
# optim-adam, optim-adadelta, reg-dropout, reg-labelsmooth, norm-layer, train-parallel, arch-rnn, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-residual, arch-subword, pre-glove, pre-bert, struct-crf, task-seqlab, task-spanlab, task-lm, task-seq2seq |
1 |
A Lightweight Recurrent Network for Sequence Modeling |
Biao Zhang, Rico Sennrich |
https://www.aclweb.org/anthology/P19-1149.pdf |
2019 |
ACL |
# optim-adam, init-glorot, reg-dropout, reg-labelsmooth, norm-layer, arch-rnn, arch-lstm, arch-treelstm, arch-gnn, arch-cnn, arch-att, arch-selfatt, arch-residual, arch-energy, arch-transformer, search-beam, task-seq2seq |
2 |
Self-Attentional Models for Lattice Inputs |
Matthias Sperber, Graham Neubig, Ngoc-Quan Pham, Alex Waibel |
https://www.aclweb.org/anthology/P19-1115.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-labelsmooth, norm-layer, train-parallel, arch-rnn, arch-lstm, arch-att, arch-subword, search-beam, task-seq2seq, task-alignment |
10 |
Monotonic Infinite Lookback Attention for Simultaneous Machine Translation |
Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel |
https://www.aclweb.org/anthology/P19-1126.pdf |
2019 |
ACL |
# optim-adam, init-glorot, reg-dropout, reg-stopping, norm-layer, pool-max, arch-rnn, arch-lstm, arch-att, comb-ensemble, search-viterbi, struct-crf, struct-cfg, latent-vae, task-seqlab, task-lm, task-graph |
4 |
Compound Probabilistic Context-Free Grammars for Grammar Induction |
Yoon Kim, Chris Dyer, Alexander Rush |
https://www.aclweb.org/anthology/P19-1228.pdf |
2019 |
ACL |
# optim-adam, optim-projection, norm-layer, train-mll, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-coverage, arch-transformer, pre-bert, task-seq2seq |
9 |
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention |
Wenhu Chen, Jianshu Chen, Pengda Qin, Xifeng Yan, William Yang Wang |
https://www.aclweb.org/anthology/P19-1360.pdf |
2019 |
ACL |
# optim-sgd, optim-adam, norm-layer, arch-att, arch-selfatt, arch-coverage, arch-transformer, pre-bert |
8 |
Matching the Blanks: Distributional Similarity for Relation Learning |
Livio Baldini Soares, Nicholas FitzGerald, Jeffrey Ling, Tom Kwiatkowski |
https://www.aclweb.org/anthology/P19-1279.pdf |
2019 |
ACL |
# optim-adagrad, reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-cnn, arch-att, arch-gating, arch-memo, arch-transformer, pre-glove, pre-skipthought, task-textpair, task-lm, task-seq2seq |
0 |
You Only Need Attention to Traverse Trees |
Mahtab Ahmed, Muhammad Rifayat Samee, Robert E. Mercer |
https://www.aclweb.org/anthology/P19-1030.pdf |
2019 |
ACL |
# optim-adam, init-glorot, norm-layer, train-mtl, arch-rnn, arch-lstm, arch-cnn, arch-att, arch-selfatt, arch-transformer, pre-glove, pre-elmo, pre-bert, latent-topic, task-textclass, task-lm, task-seq2seq |
0 |
Text Categorization by Learning Predominant Sense of Words as Auxiliary Task |
Kazuya Shimura, Jiyi Li, Fumiyo Fukumoto |
https://www.aclweb.org/anthology/P19-1105.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-worddropout, reg-stopping, reg-patience, reg-labelsmooth, norm-layer, arch-rnn, arch-att, arch-subword, arch-transformer, search-beam, task-seqlab, task-lm, task-seq2seq, task-alignment |
5 |
Revisiting Low-Resource Neural Machine Translation: A Case Study |
Rico Sennrich, Biao Zhang |
https://www.aclweb.org/anthology/P19-1021.pdf |
2019 |
ACL |
# optim-adam, norm-layer, train-transfer, pool-max, arch-rnn, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-subword, arch-transformer, search-beam, task-lm, task-seq2seq |
1 |
A Compact and Language-Sensitive Multilingual Translation Method |
Yining Wang, Long Zhou, Jiajun Zhang, Feifei Zhai, Jingfang Xu, Chengqing Zong |
https://www.aclweb.org/anthology/P19-1117.pdf |
2019 |
ACL |
# optim-adam, optim-projection, norm-layer, arch-rnn, arch-lstm, arch-gru, arch-att, arch-transformer, pre-bert |
5 |
SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking |
Hwaran Lee, Jinsik Lee, Tae-Yoon Kim |
https://www.aclweb.org/anthology/P19-1546.pdf |
2019 |
ACL |
# optim-sgd, optim-adam, optim-projection, reg-dropout, reg-stopping, norm-layer, arch-lstm, arch-bilstm, arch-att, arch-selfatt, arch-coverage, arch-subword, arch-transformer, pre-fasttext, task-lm |
13 |
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction |
Antoine Bosselut, Hannah Rashkin, Maarten Sap, Chaitanya Malaviya, Asli Celikyilmaz, Yejin Choi |
https://www.aclweb.org/anthology/P19-1470.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-att, arch-selfatt, arch-memo, pre-word2vec, pre-elmo, pre-bert, task-textclass |
4 |
One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues |
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan |
https://www.aclweb.org/anthology/P19-1001.pdf |
2019 |
ACL |
# optim-sgd, optim-adam, optim-projection, reg-dropout, norm-layer, pool-max, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-att, arch-selfatt, arch-gating, arch-transformer, comb-ensemble, pre-fasttext, task-spanlab, task-seq2seq |
1 |
Token-level Dynamic Self-Attention Network for Multi-Passage Reading Comprehension |
Yimeng Zhuang, Huadong Wang |
https://www.aclweb.org/anthology/P19-1218.pdf |
2019 |
ACL |
# optim-adam, optim-adagrad, reg-dropout, reg-labelsmooth, norm-layer, pool-max, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-subword, arch-transformer, search-beam, pre-paravec, task-seq2seq |
9 |
Hierarchical Transformers for Multi-Document Summarization |
Yang Liu, Mirella Lapata |
https://www.aclweb.org/anthology/P19-1500.pdf |
2019 |
EMNLP |
# optim-sgd, optim-projection, reg-dropout, reg-stopping, norm-layer, train-mll, train-transfer, train-parallel, arch-att, arch-selfatt, arch-coverage, arch-subword, arch-transformer, pre-bert, task-lm, task-seq2seq |
1 |
Simple, Scalable Adaptation for Neural Machine Translation |
Ankur Bapna, Orhan Firat |
https://www.aclweb.org/anthology/D19-1165.pdf |
2019 |
EMNLP |
# optim-sgd, optim-adam, norm-layer, arch-rnn, arch-lstm, arch-att, arch-subword, arch-transformer, task-seq2seq |
0 |
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training |
Alham Fikri Aji, Kenneth Heafield, Nikolay Bogoychev |
https://www.aclweb.org/anthology/D19-1373.pdf |
2019 |
EMNLP |
# optim-adam, init-glorot, reg-dropout, norm-layer, train-mll, arch-rnn, arch-lstm, arch-bilstm, arch-cnn, arch-att, arch-selfatt, arch-residual, search-beam, struct-crf, task-lm, task-seq2seq |
0 |
Efficient Convolutional Neural Networks for Diacritic Restoration |
Sawsan Alqahtani, Ajay Mishra, Mona Diab |
https://www.aclweb.org/anthology/D19-1151.pdf |
2019 |
EMNLP |
# optim-sgd, optim-projection, reg-dropout, norm-layer, train-augment, arch-rnn, arch-lstm, arch-bilstm, arch-att, adv-train, latent-vae, task-condlm, task-seq2seq |
0 |
Semi-supervised Text Style Transfer: Cross Projection in Latent Space |
Mingyue Shang, Piji Li, Zhenxin Fu, Lidong Bing, Dongyan Zhao, Shuming Shi, Rui Yan |
https://www.aclweb.org/anthology/D19-1499.pdf |
2019 |
EMNLP |
# optim-adam, reg-decay, norm-layer, train-mll, pool-mean, arch-att, arch-selfatt, arch-transformer, comb-ensemble, pre-bert, task-spanlab, task-lm, task-seq2seq |
0 |
Cross-Lingual Machine Reading Comprehension |
Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang, Guoping Hu |
https://www.aclweb.org/anthology/D19-1169.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, norm-layer, arch-att, arch-selfatt, arch-residual, arch-subword, arch-transformer, task-seq2seq |
1 |
Synchronously Generating Two Languages with Interactive Decoding |
Yining Wang, Jiajun Zhang, Long Zhou, Yuchen Liu, Chengqing Zong |
https://www.aclweb.org/anthology/D19-1330.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, reg-decay, norm-layer, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-subword, arch-transformer, search-beam, pre-bert, task-textclass, task-seqlab, task-lm, task-seq2seq |
0 |
Subword Language Model for Query Auto-Completion |
Gyuwan Kim |
https://www.aclweb.org/anthology/D19-1507.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, norm-layer, train-mtl, arch-lstm, arch-bilstm, arch-cnn, arch-att, arch-selfatt, arch-coverage, pre-word2vec, pre-elmo, pre-bert, struct-crf, task-seqlab, task-lm, task-seq2seq |
0 |
Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word Representations |
Christian Hadiwinoto, Hwee Tou Ng, Wee Chung Gan |
https://www.aclweb.org/anthology/D19-1533.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, norm-layer, norm-batch, struct-crf, adv-train, task-textclass, task-seqlab |
0 |
Open Event Extraction from Online Text using a Generative Adversarial Network |
Rui Wang, Deyu Zhou, Yulan He |
https://www.aclweb.org/anthology/D19-1027.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, norm-layer, train-mtl, train-mll, arch-subword, pre-word2vec, pre-fasttext, pre-glove, pre-skipthought, pre-elmo, loss-svd, task-textpair, task-seq2seq |
0 |
Parameter-free Sentence Embedding via Orthogonal Basis |
Ziyi Yang, Chenguang Zhu, Weizhu Chen |
https://www.aclweb.org/anthology/D19-1059.pdf |
2019 |
EMNLP |
# optim-adam, optim-adagrad, norm-layer, pool-mean, arch-rnn, arch-lstm, arch-bilstm, arch-cnn, arch-att, arch-selfatt, arch-transformer, search-beam, pre-glove, pre-paravec, adv-train, latent-topic, task-textpair, task-extractive, task-spanlab, task-lm, task-condlm, task-seq2seq |
0 |
Topic-Guided Coherence Modeling for Sentence Ordering by Preserving Global and Local Information |
Byungkook Oh, Seungmin Seo, Cheolheon Shin, Eunju Jo, Kyong-Ho Lee |
https://www.aclweb.org/anthology/D19-1232.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, reg-dropout, norm-layer, arch-rnn, arch-att, arch-selfatt, arch-copy, arch-subword, arch-transformer, search-beam, task-seq2seq, task-alignment |
0 |
Contrastive Attention Mechanism for Abstractive Sentence Summarization |
Xiangyu Duan, Hongfei Yu, Mingming Yin, Min Zhang, Weihua Luo, Yue Zhang |
https://www.aclweb.org/anthology/D19-1301.pdf |
2019 |
EMNLP |
# optim-sgd, optim-adam, optim-projection, reg-dropout, norm-layer, train-mll, arch-lstm, arch-att, arch-selfatt, arch-coverage, arch-subword, pre-elmo, pre-bert, struct-crf, loss-triplet, task-seqlab, task-lm, task-seq2seq, task-context |
14 |
Cloze-driven Pretraining of Self-attention Networks |
Alexei Baevski, Sergey Edunov, Yinhan Liu, Luke Zettlemoyer, Michael Auli |
https://www.aclweb.org/anthology/D19-1539.pdf |
2019 |
EMNLP |
# optim-adam, init-glorot, reg-dropout, norm-layer, arch-rnn, arch-cnn, arch-att, arch-selfatt, task-textpair, task-condlm |
0 |
DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization |
Chujie Lu, Long Chen, Chilie Tan, Xiaolin Li, Jun Xiao |
https://www.aclweb.org/anthology/D19-1518.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, norm-layer, pool-max, pool-mean, arch-lstm, arch-bilstm, arch-att, arch-coverage, comb-ensemble, pre-fasttext, pre-glove, task-textpair, task-condlm |
0 |
Asynchronous Deep Interaction Network for Natural Language Inference |
Di Liang, Fubao Zhang, Qi Zhang, Xuanjing Huang |
https://www.aclweb.org/anthology/D19-1271.pdf |
2019 |
EMNLP |
# optim-adam, norm-layer, train-mll, train-augment, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-bilinear, arch-transformer, pre-elmo, pre-bert, task-lm, task-condlm, task-seq2seq |
19 |
LXMERT: Learning Cross-Modality Encoder Representations from Transformers |
Hao Tan, Mohit Bansal |
https://www.aclweb.org/anthology/D19-1514.pdf |
2019 |
EMNLP |
# optim-adam, norm-layer, arch-lstm, arch-bilstm, arch-att, arch-selfatt, arch-transformer, pre-word2vec, pre-elmo, pre-bert, task-textclass, task-textpair, task-extractive, task-spanlab, task-lm, task-seq2seq, task-cloze |
0 |
Fine-tune BERT with Sparse Self-Attention Mechanism |
Baiyun Cui, Yingming Li, Ming Chen, Zhongfei Zhang |
https://www.aclweb.org/anthology/D19-1361.pdf |
2019 |
EMNLP |
# optim-adam, norm-layer, pool-max, pool-mean, arch-rnn, arch-lstm, arch-gru, arch-cnn, arch-att, arch-selfatt, arch-memo, arch-transformer, pre-glove, pre-bert, latent-topic, task-lm, task-seq2seq |
1 |
Knowledge-Enriched Transformer for Emotion Detection in Textual Conversations |
Peixiang Zhong, Di Wang, Chunyan Miao |
https://www.aclweb.org/anthology/D19-1016.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-copy, arch-bilinear, arch-coverage, arch-transformer, search-beam, latent-vae, task-seq2seq, task-relation, task-tree, task-graph |
0 |
Core Semantic First: A Top-down Approach for AMR Parsing |
Deng Cai, Wai Lam |
https://www.aclweb.org/anthology/D19-1393.pdf |
2019 |
EMNLP |
# reg-dropout, reg-stopping, norm-layer, arch-rnn, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-bilinear, arch-subword, arch-transformer, search-beam, task-lm, task-seq2seq |
1 |
Joey NMT: A Minimalist NMT Toolkit for Novices |
Julia Kreutzer, Joost Bastings, Stefan Riezler |
https://www.aclweb.org/anthology/D19-3019.pdf |
2019 |
EMNLP |
# optim-adam, optim-adadelta, optim-projection, init-glorot, reg-dropout, norm-layer, train-mtl, arch-lstm, arch-bilstm, arch-att, arch-selfatt, arch-residual, arch-bilinear, arch-coverage, arch-transformer, pre-glove, pre-elmo, pre-bert, task-textclass, task-seq2seq, task-relation, task-tree |
0 |
Syntax-Enhanced Self-Attention-Based Semantic Role Labeling |
Yue Zhang, Rui Wang, Luo Si |
https://www.aclweb.org/anthology/D19-1057.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, init-glorot, reg-dropout, reg-labelsmooth, norm-layer, norm-gradient, arch-rnn, arch-att, arch-selfatt, arch-residual, arch-subword, arch-transformer, search-beam, pre-bert, task-lm, task-seq2seq |
2 |
Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention |
Biao Zhang, Ivan Titov, Rico Sennrich |
https://www.aclweb.org/anthology/D19-1083.pdf |
2019 |
EMNLP |
# optim-adam, optim-projection, reg-dropout, norm-layer, pool-max, arch-lstm, arch-bilstm, arch-cnn, arch-att, arch-selfatt, latent-topic, task-lm |
0 |
A Hierarchical Location Prediction Neural Network for Twitter User Geolocation |
Binxuan Huang, Kathleen Carley |
https://www.aclweb.org/anthology/D19-1480.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, reg-decay, norm-layer, train-mll, train-parallel, arch-rnn, arch-att, arch-subword, arch-transformer, search-greedy, search-beam, pre-bert, task-lm, task-seq2seq, task-cloze |
2 |
Mask-Predict: Parallel Decoding of Conditional Masked Language Models |
Marjan Ghazvininejad, Omer Levy, Yinhan Liu, Luke Zettlemoyer |
https://www.aclweb.org/anthology/D19-1633.pdf |
2019 |
EMNLP |
# optim-projection, reg-dropout, norm-layer, pool-mean, arch-rnn, arch-att, arch-selfatt, arch-subword, arch-transformer, pre-bert, task-textclass, task-textpair |
0 |
Learning Invariant Representations of Social Media Users |
Nicholas Andrews, Marcus Bishop |
https://www.aclweb.org/anthology/D19-1178.pdf |
2019 |
NAA-CL |
# optim-adam, norm-layer, arch-rnn, arch-lstm, arch-att, arch-selfatt, arch-residual, arch-coverage, arch-subword, arch-transformer, search-beam |
19 |
MuST-C: a Multilingual Speech Translation Corpus |
Mattia A. Di Gangi, Roldano Cattoni, Luisa Bentivogli, Matteo Negri, Marco Turchi |
https://www.aclweb.org/anthology/N19-1202.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, reg-labelsmooth, norm-layer, train-transfer, arch-att, arch-selfatt, arch-subword, arch-transformer, task-seq2seq |
7 |
Non-Parametric Adaptation for Neural Machine Translation |
Ankur Bapna, Orhan Firat |
https://www.aclweb.org/anthology/N19-1191.pdf |
2019 |
NAA-CL |
# optim-sgd, optim-adam, reg-dropout, reg-labelsmooth, norm-layer, arch-att, arch-selfatt, arch-subword, arch-transformer, pre-elmo, task-textclass, task-lm, task-seq2seq |
10 |
Pre-trained language model representations for language generation |
Sergey Edunov, Alexei Baevski, Michael Auli |
https://www.aclweb.org/anthology/N19-1409.pdf |
2019 |
NAA-CL |
# reg-dropout, reg-worddropout, norm-layer, pool-max, arch-cnn, arch-att, arch-selfatt, arch-bilinear, arch-transformer |
5 |
Relation Extraction using Explicit Context Conditioning |
Gaurav Singh, Parminder Bhatia |
https://www.aclweb.org/anthology/N19-1147.pdf |
2019 |
NAA-CL |
# optim-adam, init-glorot, reg-dropout, norm-layer, arch-rnn, arch-lstm, arch-att, arch-coverage, pre-fasttext, loss-svd, task-condlm |
2 |
AudioCaps: Generating Captions for Audios in The Wild |
Chris Dongjoo Kim, Byeongchang Kim, Hyunmin Lee, Gunhee Kim |
https://www.aclweb.org/anthology/N19-1011.pdf |
2019 |
NAA-CL |
# optim-adam, init-glorot, norm-layer, arch-rnn, arch-lstm, arch-gru, arch-att, arch-selfatt, arch-transformer, task-extractive, task-seq2seq, task-relation |
4 |
Single Document Summarization as Tree Induction |
Yang Liu, Ivan Titov, Mirella Lapata |
https://www.aclweb.org/anthology/N19-1173.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, norm-layer, train-augment, arch-lstm, arch-att, arch-copy, arch-coverage, search-beam, pre-word2vec, task-seq2seq, task-tree, task-graph |
2 |
Factorising AMR generation through syntax |
Kris Cao, Stephen Clark |
https://www.aclweb.org/anthology/N19-1223.pdf |
2019 |
NAA-CL |
# optim-adam, init-glorot, reg-labelsmooth, norm-layer, pool-max, arch-rnn, arch-cnn, arch-att, arch-selfatt, arch-memo, pre-fasttext, latent-vae, task-extractive, task-seq2seq |
9 |
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks |
Byeongchang Kim, Hyunwoo Kim, Gunhee Kim |
https://www.aclweb.org/anthology/N19-1260.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, norm-layer, train-transfer, arch-rnn, arch-lstm, arch-att, arch-subword, search-greedy, search-beam, struct-cfg, nondif-gumbelsoftmax, task-lm, task-seq2seq |
5 |
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation |
Xing Niu, Weijia Xu, Marine Carpuat |
https://www.aclweb.org/anthology/N19-1043.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, reg-worddropout, norm-layer, arch-rnn, arch-lstm, arch-att, pre-glove, nondif-reinforce, latent-vae, task-lm, task-condlm, task-seq2seq, task-tree |
4 |
SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression |
Christos Baziotis, Ion Androutsopoulos, Ioannis Konstas, Alexandros Potamianos |
https://www.aclweb.org/anthology/N19-1071.pdf |