A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Filters
Hierarchical Neural Story Generation
[article]
2018
arXiv
pre-print
We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new
arXiv:1805.04833v1
fatcat:bzodwbsrkvaazfk3kblokz5csq
more »
... ed multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.
Multilingual AMR-to-Text Generation
[article]
2020
arXiv
pre-print
Rather than model the graph structure directly, following Fan et al. (2019a) , we model the graph using a graph embedding. ...
This pretraining incorporates various noise operations, such as masking (Devlin et al., 2019) , span masking (Fan et al., 2019a) , and shuffling. ...
arXiv:2011.05443v1
fatcat:qigjt3cqabevnojukorr5itpsi
LOST, FAN CULTURE AND THE NEO-BAROQUE
[chapter]
2012
Anuario calderoniano 5 (2012)
For example, in addition to all the fan-run online activity, ABCthe show's production company-has a website that targets fans of Lost 12 . ...
Add to this online chat-rooms, blogs, fan sites and, more recently, studio-hosted fan sites and we see emerge what Henry Jenkins calls the collective intelligence of participatory culture. ...
doi:10.31819/9783865279880-002
fatcat:xkxopvrkmvf4haaafuyyyu3it4
Comparing Psychopathological Symptoms in Portuguese Football Fans and Non-Fans
2020
Behavioral Sciences
Results showed that football fans and non-fans are mostly male, have an affective relationship, are childless, have secondary education or a high degree, and are employed or students; fans are more likely ...
The present study aims to characterize football fans and non-fans and to compare their psychopathological symptoms with the latest normative values for the Portuguese population from Canavarro in 2007. ...
fans. ...
doi:10.3390/bs10050085
pmid:32370078
pmcid:PMC7287926
fatcat:xsx43lxavvgxneozc5rwjj4r7m
Controllable Abstractive Summarization
[article]
2018
arXiv
pre-print
For example, a sports fan reading about a recent game might want to focus the summary on the performance of their favorite player. ...
arXiv:1711.05217v2
fatcat:3ucaupnv5ndrjmpzgvhkmknu3i
The FAnConjecture for Coxeter groups
2006
Algebraic and Geometric Topology
We study global fixed points for actions of Coxeter groups on nonpositively curved singular spaces. In particular, we consider property FA_n, an analogue of Serre's property FA for actions on CAT(0) complexes. Property FA_n has implications for irreducible representations and complex of groups decompositions. In this paper, we give a specific condition on Coxeter presentations that implies FA_n and show that this condition is in fact equivalent to FA_n for n=1 and 2. As part of the proof, we
doi:10.2140/agt.2006.6.2117
fatcat:bkws5rqy6vgvxh2k7tup2dl2qq
more »
... pute the Gersten-Stallings angles between special subgroups of Coxeter groups.
Multi-Dimensional Gender Bias Classification
[article]
2020
arXiv
pre-print
Angela Fan, David Grangier, and Michael Auli. 2017.
Controllable abstractive summarization.
arXiv
preprint arXiv:1711.05217.
Angela Fan, Mike Lewis, and Yann Dauphin. 2018. ...
Emily Dinan, Stephen Roller, Kurt Shuster, Angela
Fan, Michael Auli, and Jason Weston. 2019d. Wiz-
ard of Wikipedia: Knowledge-powered conversa-
tional agents. ...
arXiv:2005.00614v1
fatcat:o3lgzjeouvhepmp6bkmw2jk7jm
Reducing Transformer Depth on Demand with Structured Dropout
[article]
2019
arXiv
pre-print
We follow the Transformer Big architecture and training procedure of Fan et al. Long Form Question Answering. ...
We consider the Long Form Question Answering Dataset ELI5
of Fan et al. (2019), which consists of 272K question answer pairs from the subreddit Explain Like
I'm
Table 3 , 3 applying Layer-Drop to ...
arXiv:1909.11556v1
fatcat:yhf6lreaz5alhdq3rhl2ga77su
Strategies for Structuring Story Generation
[article]
2019
arXiv
pre-print
Baselines We compare our results to the Fusion model from Fan et al. (2018) which generates the full story directly from the prompt. ...
., 2017; Fan et al., 2018) to allow the model to refer to previously generated words and improve the ability to model long-range context. ...
arXiv:1902.01109v2
fatcat:xyvw6fyhx5f4nog4o64usxogyq
Tricks for Training Sparse Translation Models
[article]
2021
arXiv
pre-print
., 2015; Domhan and Hieber, 2017) , where a single model is trained to translate between many language pairs (Fan et al., 2021) . ...
However, these architectures can overfit to low-resource languages and often overall have worse performance than dense architectures (Fan et al., 2021; Tran et al., 2021) , which utilize all of their ...
arXiv:2110.08246v1
fatcat:cbtnsy5g7fcjve3jnb2jjvb5au
Nearest Neighbor Machine Translation
[article]
2021
arXiv
pre-print
We introduce k-nearest-neighbor machine translation (kNN-MT), which predicts tokens with a nearest neighbor classifier over a large datastore of cached examples, using representations from a neural translation model for similarity search. This approach requires no additional training and scales to give the decoder direct access to billions of examples at test time, resulting in a highly expressive model that consistently improves performance across many settings. Simply adding nearest neighbor
arXiv:2010.00710v2
fatcat:wwsbr2okdbgppobnyw33zebarq
more »
... earch improves a state-of-the-art German-English translation model by 1.5 BLEU. kNN-MT allows a single model to be adapted to diverse domains by using a domain-specific datastore, improving results by an average of 9.2 BLEU over zero-shot transfer, and achieving new state-of-the-art results – without training on these domains. A massively multilingual model can also be specialized for particular language pairs, with improvements of 3 BLEU for translating from English into German and Chinese. Qualitatively, kNN-MT is easily interpretable; it combines source and target context to retrieve highly relevant examples.
CNN-Based Signal Detection for Banded Linear Systems
[article]
2018
arXiv
pre-print
Banded linear systems arise in many communication scenarios, e.g., those involving inter-carrier interference and inter-symbol interference. Motivated by recent advances in deep learning, we propose to design a high-accuracy low-complexity signal detector for banded linear systems based on convolutional neural networks (CNNs). We develop a novel CNN-based detector by utilizing the banded structure of the channel matrix. Specifically, the proposed CNN-based detector consists of three modules:
arXiv:1809.03682v1
fatcat:zwvbm2sx5nctfngxzucdd5orum
more »
... input preprocessing module, the CNN module, and the output postprocessing module. With such an architecture, the proposed CNN-based detector is adaptive to different system sizes, and can overcome the curse of dimensionality, which is a ubiquitous challenge in deep learning. Through extensive numerical experiments, we demonstrate that the proposed CNN-based detector outperforms conventional deep neural networks and existing model-based detectors in both accuracy and computational time. Moreover, we show that CNN is flexible for systems with large sizes or wide bands. We also show that the proposed CNN-based detector can be easily extended to near-banded systems such as doubly selective orthogonal frequency division multiplexing (OFDM) systems and 2-D magnetic recording (TDMR) systems, in which the channel matrices do not have a strictly banded structure.
Generating Fact Checking Briefs
[article]
2020
arXiv
pre-print
(Lewis and Fan, 2018; Dong et al., 2019; Radford et al.; Raffel et al., 2019; Lewis et al., 2020). ...
., 2017) , the questions and answers are from Trivia enthusiasts, and in ELI5 (Fan et al., 2019) , the questions and answers are from Reddit question answering subreddits. ...
arXiv:2011.05448v1
fatcat:warsibbfmjeepf5weluu6sq6hq
Internationalisation of Chinese medical schools
2013
The Lancet
More than 100 aftershocks were above magnitude 3. 196 people have been killed, 11 470 people were injured, and more than 1 500 000 people *Angela Fan, Russell Kosik, Qi Chen fan_angela@hotmail.com National ...
doi:10.1016/s0140-6736(13)61199-x
pmid:23746897
fatcat:lbf4bmvxfzcs5ng6kg2yyyzqeq
Facebook AI WMT21 News Translation Task Submission
[article]
2021
arXiv
pre-print
., 2019; Fan et al., 2021) . ...
However, this has a significant computational cost, as each forward pass activates all parameters -at the limit, models become incredibly slow to train and produce translations (Fan et al., 2021) . ...
arXiv:2108.03265v1
fatcat:s3ffxqhxyre3tpuptnnwcuneta
« Previous
Showing results 1 — 15 out of 9,825 results