kiddothe2b
/

biomedical-longformer-large

Model card Files Files and versions

Biomedical Longformer (base)

This is a derivative model based on microsoft/BiomedNLP-PubMedBERT-large-uncased-abstract BERT model developed in the work "Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing" by Tinn et al. (2021). All model parameters where cloned from the original model, while the positional embeddings were extended by cloning the original embeddings multiple times following Beltagy et al. (2020) using a python script similar to this one (https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb).

Downloads last month: 8

Papers for kiddothe2b/biomedical-longformer-large

Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing

Paper • 2112.07869 • Published Dec 15, 2021

Longformer: The Long-Document Transformer

Paper • 2004.05150 • Published Apr 10, 2020 • 4