Year |
Conf. |
Topic |
Cited |
Paper |
Authors |
Url |
2019 |
ACL |
# optim-adagrad, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-gnn, arch-att, search-beam, search-viterbi, pre-glove, pre-skipthought, pre-elmo, task-lm, task-relation |
0 |
Multi-Relational Script Learning for Discourse Relations |
I-Ta Lee, Dan Goldwasser |
https://www.aclweb.org/anthology/P19-1413.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, train-mtl, arch-rnn, arch-birnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-memo, struct-crf |
0 |
Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference |
He Bai, Yu Zhou, Jiajun Zhang, Chengqing Zong |
https://www.aclweb.org/anthology/P19-1541.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, train-mtl, train-mll, pool-max, pool-mean, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-coverage, pre-glove, pre-skipthought, pre-elmo, pre-bert, adv-train, task-textpair, task-lm, task-cloze, task-relation |
2 |
DisSent: Learning Sentence Representations from Explicit Discourse Relations |
Allen Nie, Erin Bennett, Noah Goodman |
https://www.aclweb.org/anthology/P19-1442.pdf |
2019 |
ACL |
# optim-adam, optim-projection, reg-dropout, train-mtl, train-transfer, pool-max, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-coverage, comb-ensemble, search-viterbi, struct-crf, adv-train, task-textclass, task-seq2seq, task-relation |
0 |
A Unified Multi-task Adversarial Learning Framework for Pharmacovigilance Mining |
Shweta Yadav, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya |
https://www.aclweb.org/anthology/P19-1516.pdf |
2019 |
ACL |
# optim-adam, optim-projection, reg-dropout, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-selfatt, arch-transformer, pre-glove, pre-elmo, struct-crf, task-textclass |
0 |
Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes |
Jie Cao, Michael Tanana, Zac Imel, Eric Poitras, David Atkins, Vivek Srikumar |
https://www.aclweb.org/anthology/P19-1563.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, norm-gradient, arch-lstm, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-memo, pre-glove, task-spanlab, task-seq2seq |
0 |
Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text |
Jianxing Yu, Zhengjun Zha, Jian Yin |
https://www.aclweb.org/anthology/P19-1217.pdf |
2019 |
ACL |
# reg-dropout, train-mll, arch-rnn, arch-gru, arch-bigru, arch-att, arch-subword, pre-fasttext, pre-glove, task-textclass, task-lm, task-seq2seq |
1 |
Towards Integration of Statistical Hypothesis Tests into Deep Neural Networks |
Ahmad Aghaebrahimian, Mark Cieliebak |
https://www.aclweb.org/anthology/P19-1557.pdf |
2019 |
ACL |
# optim-adam, reg-stopping, pool-max, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-selfatt, arch-transformer, pre-glove, pre-elmo, pre-bert, task-condlm, task-seq2seq |
1 |
Constructing Interpretive Spatio-Temporal Features for Multi-Turn Responses Selection |
Junyu Lu, Chenbin Zhang, Zeying Xie, Guang Ling, Tom Chao Zhou, Zenglin Xu |
https://www.aclweb.org/anthology/P19-1006.pdf |
2019 |
ACL |
# reg-dropout, train-mtl, train-transfer, pool-max, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, adv-train, task-seqlab |
0 |
Exploiting Entity BIO Tag Embeddings and Multi-task Learning for Relation Extraction with Imbalanced Data |
Wei Ye, Bo Li, Rui Xie, Zhonghao Sheng, Long Chen, Shikun Zhang |
https://www.aclweb.org/anthology/P19-1130.pdf |
2019 |
ACL |
# optim-adam, init-glorot, train-mtl, pool-max, arch-rnn, arch-gru, arch-bigru, arch-att, search-beam, latent-vae, task-textclass, task-lm, task-seq2seq |
2 |
Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking |
Masaru Isonuma, Junichiro Mori, Ichiro Sakata |
https://www.aclweb.org/anthology/P19-1206.pdf |
2019 |
ACL |
# reg-dropout, train-transfer, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-transformer, comb-ensemble, pre-word2vec, task-seqlab, task-lm, task-seq2seq, task-relation |
4 |
Open Vocabulary Learning for Neural Chinese Pinyin IME |
Zhuosheng Zhang, Yafang Huang, Hai Zhao |
https://www.aclweb.org/anthology/P19-1154.pdf |
2019 |
ACL |
# optim-adam, init-glorot, reg-dropout, reg-worddropout, reg-stopping, train-mtl, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-transformer, pre-glove, pre-bert, task-textclass, task-lm, task-condlm |
1 |
Neural Legal Judgment Prediction in English |
Ilias Chalkidis, Ion Androutsopoulos, Nikolaos Aletras |
https://www.aclweb.org/anthology/P19-1424.pdf |
2019 |
ACL |
# optim-amsgrad, init-glorot, reg-dropout, reg-labelsmooth, norm-gradient, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-memo, arch-bilinear, search-beam, task-lm, task-condlm, task-seq2seq |
0 |
Ordinal and Attribute Aware Response Generation in a Multimodal Dialogue System |
Hardik Chauhan, Mauajama Firdaus, Asif Ekbal, Pushpak Bhattacharyya |
https://www.aclweb.org/anthology/P19-1540.pdf |
2019 |
ACL |
# optim-adam, reg-stopping, norm-gradient, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-copy, search-beam, latent-vae, latent-topic, task-textclass, task-seq2seq |
3 |
Topic-Aware Neural Keyphrase Generation for Social Media Language |
Yue Wang, Jing Li, Hou Pong Chan, Irwin King, Michael R. Lyu, Shuming Shi |
https://www.aclweb.org/anthology/P19-1240.pdf |
2019 |
ACL |
# optim-adam, norm-gradient, arch-rnn, arch-gru, arch-bigru, arch-att, pre-word2vec, latent-vae, task-seq2seq |
1 |
Modeling Semantic Relationship in Multi-turn Conversations with Hierarchical Latent Variables |
Lei Shen, Yang Feng, Haolan Zhan |
https://www.aclweb.org/anthology/P19-1549.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-patience, activ-relu, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, comb-ensemble, pre-glove, task-textclass, task-condlm |
1 |
Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model |
Yitao Cai, Huiyu Cai, Xiaojun Wan |
https://www.aclweb.org/anthology/P19-1239.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, reg-decay, train-mtl, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, pre-fasttext, pre-glove, struct-crf, task-textclass, task-seqlab, task-lm, task-seq2seq, task-relation, task-alignment |
1 |
Exploring Sequence-to-Sequence Learning in Aspect Term Extraction |
Dehong Ma, Sujian Li, Fangzhao Wu, Xing Xie, Houfeng Wang |
https://www.aclweb.org/anthology/P19-1344.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, activ-relu, pool-max, arch-rnn, arch-gru, arch-bigru, task-lm, task-seq2seq |
1 |
Numeracy-600K: Learning Numeracy for Detecting Exaggerated Information in Market Comments |
Chung-Chi Chen, Hen-Hsen Huang, Hiroya Takamura, Hsin-Hsi Chen |
https://www.aclweb.org/anthology/P19-1635.pdf |
2019 |
ACL |
# optim-adam, train-mtl, train-mll, train-transfer, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, comb-ensemble, struct-crf, task-seq2seq, task-tree |
2 |
Robust Zero-Shot Cross-Domain Slot Filling with Example Values |
Darsh Shah, Raghav Gupta, Amir Fayazi, Dilek Hakkani-Tur |
https://www.aclweb.org/anthology/P19-1547.pdf |
2019 |
ACL |
# optim-adam, optim-adadelta, reg-dropout, pool-max, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-memo, search-beam, pre-glove, pre-elmo, pre-bert, task-textpair, task-spanlab, task-seq2seq |
4 |
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering |
Yair Feldman, Ran El-Yaniv |
https://www.aclweb.org/anthology/P19-1222.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, train-transfer, train-active, pool-max, pool-mean, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-gating, arch-subword, pre-fasttext, pre-glove, task-lm, task-seq2seq |
1 |
Low-resource Deep Entity Resolution with Transfer and Active Learning |
Jungo Kasai, Kun Qian, Sairam Gurajada, Yunyao Li, Lucian Popa |
https://www.aclweb.org/anthology/P19-1586.pdf |
2019 |
ACL |
# optim-sgd, optim-adam, init-glorot, reg-dropout, reg-patience, train-mtl, train-mll, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-gating, search-viterbi, pre-glove, struct-crf, adv-examp, task-seqlab, task-lm, task-seq2seq |
1 |
Sentiment Tagging with Partial Labels using Modular Architectures |
Xiao Zhang, Dan Goldwasser |
https://www.aclweb.org/anthology/P19-1055.pdf |
2019 |
ACL |
# optim-adam, reg-dropout, train-transfer, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-bilinear, pre-glove, pre-elmo, pre-bert, struct-crf, task-seq2seq, task-relation |
3 |
A Unified Linear-Time Framework for Sentence-Level Discourse Parsing |
Xiang Lin, Shafiq Joty, Prathyusha Jwalapuram, M Saiful Bari |
https://www.aclweb.org/anthology/P19-1410.pdf |
2019 |
EMNLP |
# init-glorot, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-selfatt, arch-transformer, comb-ensemble, pre-bert, latent-vae, task-lm, task-seq2seq |
0 |
Parallel Iterative Edit Models for Local Sequence Transduction |
Abhijeet Awasthi, Sunita Sarawagi, Rasna Goyal, Sabyasachi Ghosh, Vihari Piratla |
https://www.aclweb.org/anthology/D19-1435.pdf |
2019 |
EMNLP |
# arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, pre-bert, struct-crf, task-seqlab, task-lm, task-seq2seq |
0 |
Weakly Supervised Attention Networks for Entity Recognition |
Barun Patra, Joel Ruben Antony Moniz |
https://www.aclweb.org/anthology/D19-1652.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-memo, arch-coverage, pre-word2vec, struct-crf, task-seq2seq |
0 |
A Knowledge Regularized Hierarchical Approach for Emotion Cause Analysis |
Chuang Fan, Hongyu Yan, Jiachen Du, Lin Gui, Lidong Bing, Min Yang, Ruifeng Xu, Ruibin Mao |
https://www.aclweb.org/anthology/D19-1563.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-bilinear, pre-word2vec, pre-elmo, task-seq2seq, task-relation |
0 |
Hierarchical Pointer Net Parsing |
Linlin Liu, Xiang Lin, Shafiq Joty, Simeng Han, Lidong Bing |
https://www.aclweb.org/anthology/D19-1093.pdf |
2019 |
EMNLP |
# optim-adam, init-glorot, norm-gradient, arch-gru, arch-bigru, arch-att, adv-train, latent-vae, task-seq2seq |
0 |
Answer-guided and Semantic Coherent Question Generation in Open-domain Conversation |
Weichao Wang, Shi Feng, Daling Wang, Yifei Zhang |
https://www.aclweb.org/anthology/D19-1511.pdf |
2019 |
EMNLP |
# reg-dropout, reg-worddropout, train-transfer, pool-max, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, latent-vae, latent-topic, task-textclass, task-lm, task-seq2seq |
0 |
A Topic Augmented Text Generation Model: Joint Learning of Semantics and Structural Features |
Hongyin Tang, Miao Li, Beihong Jin |
https://www.aclweb.org/anthology/D19-1513.pdf |
2019 |
EMNLP |
# optim-sgd, optim-adam, activ-tanh, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-subword, arch-transformer, pre-bert, loss-nce, task-seq2seq, task-relation |
0 |
Minimally Supervised Learning of Affective Events Using Discourse Relations |
Jun Saito, Yugo Murawaki, Sadao Kurohashi |
https://www.aclweb.org/anthology/D19-1581.pdf |
2019 |
EMNLP |
# optim-adam, arch-rnn, arch-gru, arch-bigru, arch-att, arch-copy, arch-subword, pre-bert, task-textclass, task-lm, task-seq2seq |
1 |
Generating Personalized Recipes from Historical User Preferences |
Bodhisattwa Prasad Majumder, Shuyang Li, Jianmo Ni, Julian McAuley |
https://www.aclweb.org/anthology/D19-1613.pdf |
2019 |
EMNLP |
# optim-adam, init-glorot, reg-dropout, train-mtl, train-transfer, pool-max, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-memo, adv-train |
0 |
Domain Adaptation for Person-Job Fit with Transferable Deep Global Match Network |
Shuqing Bian, Wayne Xin Zhao, Yang Song, Tao Zhang, Ji-Rong Wen |
https://www.aclweb.org/anthology/D19-1487.pdf |
2019 |
EMNLP |
# reg-dropout, train-transfer, pool-max, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, pre-glove, task-relation |
0 |
Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction |
Qinyuan Ye, Liyuan Liu, Maosen Zhang, Xiang Ren |
https://www.aclweb.org/anthology/D19-1397.pdf |
2019 |
EMNLP |
# optim-adam, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, pre-glove, task-textpair |
0 |
Modeling the Relationship between User Comments and Edits in Document Revision |
Xuchao Zhang, Dheeraj Rajagopal, Michael Gamon, Sujay Kumar Jauhar, ChangTien Lu |
https://www.aclweb.org/anthology/D19-1505.pdf |
2019 |
EMNLP |
# arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-treelstm, arch-cnn, arch-att, arch-selfatt, arch-subword, pre-glove, task-textclass, task-seq2seq |
0 |
Investigating Dynamic Routing in Tree-Structured LSTM for Sentiment Analysis |
Jin Wang, Liang-Chih Yu, K. Robert Lai, Xuejie Zhang |
https://www.aclweb.org/anthology/D19-1343.pdf |
2019 |
EMNLP |
# optim-adam, reg-dropout, train-mtl, arch-lstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-selfatt, loss-cca, task-seq2seq |
0 |
Context-aware Interactive Attention for Multi-modal Sentiment and Emotion Analysis |
Dushyant Singh Chauhan, Md Shad Akhtar, Asif Ekbal, Pushpak Bhattacharyya |
https://www.aclweb.org/anthology/D19-1566.pdf |
2019 |
EMNLP |
# optim-adam, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-memo, arch-transformer, pre-glove, pre-bert, task-seq2seq |
0 |
A Challenge Dataset and Effective Models for Aspect-Based Sentiment Analysis |
Qingnan Jiang, Lei Chen, Ruifeng Xu, Xiang Ao, Min Yang |
https://www.aclweb.org/anthology/D19-1654.pdf |
2019 |
EMNLP |
# optim-adam, init-glorot, reg-dropout, train-mll, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-memo, search-viterbi, struct-crf, latent-vae, task-seqlab, task-seq2seq |
1 |
Learning Explicit and Implicit Structures for Targeted Sentiment Analysis |
Hao Li, Wei Lu |
https://www.aclweb.org/anthology/D19-1550.pdf |
2019 |
EMNLP |
# arch-rnn, arch-gru, arch-bigru, arch-gnn, arch-att, task-seq2seq |
0 |
CaRe: Open Knowledge Graph Embeddings |
Swapnil Gupta, Sreyash Kenkre, Partha Talukdar |
https://www.aclweb.org/anthology/D19-1036.pdf |
2019 |
EMNLP |
# train-mtl, arch-gru, arch-bigru, arch-att, arch-selfatt, arch-transformer, pre-bert |
1 |
SUM-QE: a BERT-based Summary Quality Estimation Model |
Stratos Xenouleas, Prodromos Malakasiotis, Marianna Apidianaki, Ion Androutsopoulos |
https://www.aclweb.org/anthology/D19-1618.pdf |
2019 |
EMNLP |
# optim-sgd, init-glorot, reg-dropout, arch-rnn, arch-birnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, arch-memo, search-viterbi, struct-crf, task-seqlab, task-relation |
0 |
Enhancing Dialogue Symptom Diagnosis with Global Attention and Symptom Graph |
Xinzhu Lin, Xiahui He, Qin Chen, Huaixiao Tou, Zhongyu Wei, Ting Chen |
https://www.aclweb.org/anthology/D19-1508.pdf |
2019 |
EMNLP |
# optim-projection, arch-lstm, arch-gru, arch-bigru, arch-att, arch-selfatt, task-textclass, task-extractive, task-lm, task-condlm, task-seq2seq |
0 |
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining |
Alexandre Garcia, Pierre Colombo, Florence d’Alché-Buc, Slim Essid, Chloé Clavel |
https://www.aclweb.org/anthology/D19-1556.pdf |
2019 |
EMNLP |
# optim-sgd, optim-adam, reg-dropout, train-mtl, pool-max, pool-mean, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-coverage, pre-word2vec, latent-topic, task-spanlab, task-seq2seq |
0 |
A Neural Citation Count Prediction Model based on Peer Review Text |
Siqing Li, Wayne Xin Zhao, Eddy Jing Yin, Ji-Rong Wen |
https://www.aclweb.org/anthology/D19-1497.pdf |
2019 |
EMNLP |
# optim-adam, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-copy, arch-coverage, search-beam, pre-glove, latent-vae, task-spanlab, task-lm, task-condlm, task-seq2seq |
1 |
Mixture Content Selection for Diverse Sequence Generation |
Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi |
https://www.aclweb.org/anthology/D19-1308.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, train-mll, arch-rnn, arch-gru, arch-bigru, arch-att, arch-selfatt, latent-topic, task-textpair |
2 |
Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering |
Lahari Poddar, Leonardo Neves, William Brendel, Luis Marujo, Sergey Tulyakov, Pradeep Karuturi |
https://www.aclweb.org/anthology/N19-2020.pdf |
2019 |
NAA-CL |
# optim-adadelta, reg-dropout, train-transfer, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-cnn, arch-att, pre-glove, pre-elmo, pre-bert, task-lm |
4 |
Structural Scaffolds for Citation Intent Classification in Scientific Publications |
Arman Cohan, Waleed Ammar, Madeleine van Zuylen, Field Cady |
https://www.aclweb.org/anthology/N19-1361.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, arch-rnn, arch-lstm, arch-gru, arch-bigru, arch-att, arch-memo, arch-copy, search-beam, latent-topic, task-seq2seq |
2 |
Microblog Hashtag Generation via Encoding Conversation Contexts |
Yue Wang, Jing Li, Irwin King, Michael R. Lyu, Shuming Shi |
https://www.aclweb.org/anthology/N19-1164.pdf |
2019 |
NAA-CL |
# optim-sgd, optim-adam, reg-dropout, reg-patience, arch-rnn, arch-birnn, arch-gru, arch-bigru, arch-cnn, arch-att, arch-selfatt, arch-memo, arch-subword, pre-word2vec, pre-glove, pre-skipthought, pre-elmo, struct-crf, adv-train, latent-vae, task-textclass, task-lm |
6 |
Dialogue Act Classification with Context-Aware Self-Attention |
Vipul Raheja, Joel Tetreault |
https://www.aclweb.org/anthology/N19-1373.pdf |
2019 |
NAA-CL |
# optim-adam, arch-rnn, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, pre-glove, task-spanlab, task-lm, task-seq2seq |
0 |
The Lower The Simpler: Simplifying Hierarchical Recurrent Models |
Chao Wang, Hui Jiang |
https://www.aclweb.org/anthology/N19-1402.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, arch-gru, arch-bigru, arch-att, arch-memo, arch-copy, task-lm, task-seq2seq |
0 |
Disentangling Language and Knowledge in Task-Oriented Dialogs |
Dinesh Raghu, Nikhil Gupta, Mausam |
https://www.aclweb.org/anthology/N19-1126.pdf |
2019 |
NAA-CL |
# optim-adam, reg-dropout, train-mtl, arch-lstm, arch-gru, arch-bigru, arch-att, arch-selfatt, pre-glove |
3 |
Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis |
Md Shad Akhtar, Dushyant Chauhan, Deepanway Ghosal, Soujanya Poria, Asif Ekbal, Pushpak Bhattacharyya |
https://www.aclweb.org/anthology/N19-1034.pdf |
2019 |
NAA-CL |
# optim-adam, reg-decay, train-augment, arch-lstm, arch-bilstm, arch-gru, arch-bigru, arch-att, arch-selfatt, comb-ensemble, pre-glove |
0 |
On Knowledge distillation from complex networks for response prediction |
Siddhartha Arora, Mitesh M. Khapra, Harish G. Ramaswamy |
https://www.aclweb.org/anthology/N19-1382.pdf |