SIGNALAI·Jun 2, 2026, 4:00 AMSignal75Short term

HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

Source: arXiv cs.CL

Share
HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

arXiv:2502.15411v4 Announce Type: replace Abstract: Accurate tagging of earnings reports can yield significant short-term returns for stakeholders. The machine-readable inline eXtensible Business Reporting Language (iXBRL) is mandated for public financial filings. Yet, its complex, fine-grained taxonomy limits the cross-company transferability of tagged Key Performance Indicators (KPIs). To address this, we introduce the Hierarchical Financial Key Performance Indicator (HiFi-KPI) dataset, a large-scale corpus of 1.65M paragraphs and 198k unique, hierarchically organized labels linked to iXBRL

Why this matters
Why now

The proliferation of AI in financial analysis and the mandate for iXBRL in public filings have created a critical demand for structured, machine-readable KPI data.

Why it’s important

This dataset significantly enhances AI's ability to extract and analyze financial performance indicators across companies, improving investment decisions and regulatory oversight.

What changes

The HiFi-KPI dataset offers a standardized, hierarchical approach to KPI extraction, reducing the complexity of iXBRL and enabling more reliable cross-company financial comparisons.

Winners
  • · Quantitative hedge funds
  • · Financial AI/ML developers
  • · Investment analysts
  • · Regulatory bodies
Losers
  • · Manual financial data extractors
  • · Companies with opaque reporting
  • · Static financial data providers
Second-order effects
Direct

Improved efficiency and accuracy in financial statement analysis through AI-driven KPI extraction.

Second

Increased transparency and comparability of financial performance across different companies, potentially leading to more efficient capital allocation.

Third

The development of new financial products and services predicated on granular, real-time access to standardized KPI data, possibly accelerating market shifts.

Editorial confidence: 90 / 100 · Structural impact: 55 / 100
Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.CL
Tracked by The Continuum Brief · live intelligence network
Share
The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.

By subscribing, you agree to receive updates from THE CONTINUUM BRIEF. You can unsubscribe at any time.