DistillBERT

Fine Tuning DistillBERT on the FiNER-139 dataset

Model

Checkppoints

The model Checkpoints are in the distilbert-finetuned-ner directory at the root. checkpoint-1407 is the one on which all the evaluation has been done

Saved Model

This is in the distilber-finer-tuned directory. TODO: To be published to HuggingFace

ONNX runtime

See TODOs

Directory Structure

----DISTILLBERT
  |_ distilbert-finetuned-ner
  |_ src
  |  |_ data_preparation
  |  |_ training
  |_ DatExploration.ipynb

Data Exploration

The file in which we examine the dataset and see the distribution of the tokens and of the labels is DatExploration.ipynb The 4 labels chosen to evaluate on are:

B-ShareBasedCompensationArrangementByShareBasedPaymentAwardAwardVestingPeriod1
I-ShareBasedCompensationArrangementByShareBasedPaymentAwardAwardVestingPeriod1
B-DebtInstrumentMaturityDate
I-DebtInstrumentMaturityDate

The obtained metrics on model evaluation are:

eval_loss	eval_precision	eval_recall	eval_f1	eval_accuracy
0.044684	0.770968	0.784893	0.777868	0.976305

Dependencies

All the dependencies are defined in requirements.txt They should be installed in a new venv by running python -m pip install -r requirements.txt from the repo root.

TODOs

Export to ONNX
Evaluate performance on ONNX runtime compared to the original Distil-BERT model
Write to Hugging Face

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
DatExploration.ipynb		DatExploration.ipynb
LICENSE		LICENSE
README.md		README.md
metrics.json		metrics.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DistillBERT

Model

Checkppoints

Saved Model

ONNX runtime

Directory Structure

Data Exploration

The obtained metrics on model evaluation are:

Dependencies

TODOs

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

akshaypardhanani/DistillBERT

Folders and files

Latest commit

History

Repository files navigation

DistillBERT

Model

Checkppoints

Saved Model

ONNX runtime

Directory Structure

Data Exploration

The obtained metrics on model evaluation are:

Dependencies

TODOs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages