Skip to content

Data v1

Compare
Choose a tag to compare
@Spico197 Spico197 released this 27 Apr 17:49
· 6 commits to main since this release
  • ChCatExt: Containing BidAnn, FinAnn and CreRat as the paper demonstrates. The containing DomainMix folder is the concatenation of three -domains (i.e. the whole ChCatExt dataset).
  • ChCatExtForPipelinesBaseline: For reproducing pipeline baseline.
  • DataForAnalysisExp: For reproducing analysis experiments.
  • Wiki: Wikipedia data for pretraining WikiBert.
  • OriginalRawData: Raw files, including HTMLs and PDFs.