A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings
[article]
2022
arXiv
pre-print
Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. Prior work relies on discrete citation relations to generate contrast samples. However, discrete citations enforce a hard cut-off to similarity. This is counter-intuitive to similarity-based learning, and ignores that scientific papers can be very similar
arXiv:2202.06671v1
fatcat:uyyagcrslza7dhi6mjuvloa4v4