AAU-NLP/HiFi-KPI
Viewer • Updated • 1.65M • 160 • 4
How to use AAU-NLP/Pre-BERT-SL1000 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("token-classification", model="AAU-NLP/Pre-BERT-SL1000") # Load model directly
from transformers import AutoTokenizer, AutoModelForTokenClassification
tokenizer = AutoTokenizer.from_pretrained("AAU-NLP/Pre-BERT-SL1000")
model = AutoModelForTokenClassification.from_pretrained("AAU-NLP/Pre-BERT-SL1000")This model was presented in the paper HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings.
Pre-BERT-SL1000 is a BERT-based sequence labeling model fine-tuned on the HiFi-KPI dataset for extracting financial key performance indicators (KPIs) from SEC earnings filings (10-K & 10-Q). It specializes in identifying entities that are one level up the presentation taxonomy, such as revenueAbstract, earnings, and financial ratios, using token classification.
This model is trained specifically on n=1 with the presentation taxonomy labels from HiFi-KPI, focusing on entity identification.
If you use this model or dataset, please cite:
@article{aavang2025hifikpi,
title={HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings},
author={Aavang, Rasmus and Rizzi, Giovanni and B{\o}ggild, Rasmus and Iolov, Alexandre and Zhang, Mike and Bjerva, Johannes},
journal={arXiv preprint arXiv:2502.15411},
year={2025}
}
Base model
google-bert/bert-base-uncased