CheMLT-F / README.md

Update README.md

48be0f3 verified 29 days ago

1.41 kB

license: apache-2.0
datasets:
  - scikit-fingerprints/MoleculeNet_ToxCast
  - scikit-fingerprints/MoleculeNet_SIDER
  - scikit-fingerprints/MoleculeNet_MUV
  - scikit-fingerprints/MoleculeNet_Tox21
  - scikit-fingerprints/MoleculeNet_ClinTox
  - scikit-fingerprints/MoleculeNet_HIV
  - scikit-fingerprints/MoleculeNet_BACE
  - scikit-fingerprints/MoleculeNet_BBBP
  - scikit-fingerprints/MoleculeNet_Lipophilicity
  - scikit-fingerprints/MoleculeNet_ESOL
  - scikit-fingerprints/MoleculeNet_FreeSolv
language:
  - en
base_model:
  - microsoft/deberta-v3-base
pipeline_tag: feature-extraction

CheMLT-F

CheMLT-F is a family of pre-trained multitask Transformer models based on DeBERTa for chemical analysis, drug property prediction, and binding affinity prediction. The models are designed to be extendable to new datasets and easy to adapt, retrain, and evaluate through a standardized training pipeline.

Currently, the models support 13 benchmark datasets spanning toxicity and bioactivity, physicochemical property prediction, and binding affinity prediction, with 680+ prediction points across tasks: ToxCast, SIDER, MUV, Tox21, ClinTox, HIV, BACE, BBBP, Lipophilicity (Lipo), Delaney (ESOL), FreeSolv, KIBA, and Davis.

Citation

Paper: CheMLT-F: multitask learning in biochemistry through transformer fusion

If you use this model, please cite the publication.