Skip to content

Commit

Permalink
+20 texts from digilibLT
Browse files Browse the repository at this point in the history
  • Loading branch information
PonteIneptique committed May 1, 2021
1 parent bec0248 commit 0fccf66
Show file tree
Hide file tree
Showing 17 changed files with 736,717 additions and 35 deletions.
70 changes: 35 additions & 35 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ This corpus contains the whole set of Capitains compliant classical and late Lat
The texts are distributed using the same licence as the original, annotation are CC-BY-SA 4.0.

<!--START-NB-->
**Number of tokens**: 20,505,245 (17,205,071 without punctuation)
**Number of tokens**: 21,222,911 (17,804,769 without punctuation)
<!--END-NB-->

## Information about the model
Expand Down Expand Up @@ -117,38 +117,38 @@ Tokens use the standard TEI annotation elements `@pos`, `@msd` and `@lemma`:
<!---START-STATS--->
| POS | Tokens |
|------------|----------|
| | 274 |
| ADJadv.mul | 7336 |
| ADJadv.ord | 20543 |
| ADJcar | 158149 |
| ADJdis | 13445 |
| ADJmul | 3912 |
| ADJord | 58668 |
| ADJqua | 1256850 |
| ADV | 969879 |
| ADVint | 64136 |
| ADVint.neg | 3279 |
| ADVneg | 271514 |
| ADVrel | 178543 |
| CON | 171893 |
| CONcoo | 1248073 |
| CONsub | 585788 |
| FOR | 35381 |
| INJ | 28799 |
| NOM | 28 |
| NOMcom | 4135950 |
| NOMpro | 661178 |
| PRE | 1163671 |
| PROdem | 755853 |
| PROind | 366737 |
| PROint | 77899 |
| PROper | 238727 |
| PROpos | 132758 |
| PROpos.ref | 81081 |
| PROref | 77243 |
| PROrel | 499688 |
| PUNC | 3300174 |
| UNK | 734 |
| VER | 3934784 |
| _ | 2278 |
| | 277 |
| ADJadv.mul | 7552 |
| ADJadv.ord | 20979 |
| ADJcar | 166840 |
| ADJdis | 13946 |
| ADJmul | 4069 |
| ADJord | 61220 |
| ADJqua | 1306665 |
| ADV | 999425 |
| ADVint | 65373 |
| ADVint.neg | 3406 |
| ADVneg | 276097 |
| ADVrel | 184487 |
| CON | 179121 |
| CONcoo | 1289510 |
| CONsub | 602726 |
| FOR | 43801 |
| INJ | 29551 |
| NOM | 29 |
| NOMcom | 4280591 |
| NOMpro | 712007 |
| PRE | 1203684 |
| PROdem | 776658 |
| PROind | 378232 |
| PROint | 79374 |
| PROper | 241477 |
| PROpos | 134255 |
| PROpos.ref | 83421 |
| PROref | 79985 |
| PROrel | 514888 |
| PUNC | 3418142 |
| UNK | 839 |
| VER | 4061999 |
| _ | 2285 |
<!---END-STATS--->
281 changes: 281 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:phi996.phi004.digilibLT-lat1.xml

Large diffs are not rendered by default.

7,047 changes: 7,047 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0012a.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

3,817 changes: 3,817 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0012a.stoa003.digilibLT-lat1.xml

Large diffs are not rendered by default.

6,226 changes: 6,226 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0020b.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

141,758 changes: 141,758 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0023.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

12,890 changes: 12,890 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0043.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

146,794 changes: 146,794 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0123a.stoa002.digilibLT-lat1.xml

Large diffs are not rendered by default.

15,145 changes: 15,145 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0128b.stoa004.digilibLT-lat1.xml

Large diffs are not rendered by default.

10,663 changes: 10,663 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0128e.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

122,685 changes: 122,685 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0171a.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

104,503 changes: 104,503 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0186.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

46,824 changes: 46,824 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0186.stoa002.digilibLT-lat1.xml

Large diffs are not rendered by default.

4,384 changes: 4,384 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0187a.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

24,202 changes: 24,202 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0228b.stoa002.digilibLT-lat1.xml

Large diffs are not rendered by default.

4,494 changes: 4,494 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0329.stoa001.digilibLT-lat1.xml

Large diffs are not rendered by default.

84,969 changes: 84,969 additions & 0 deletions lemmatized/xml/urn:cts:latinLit:stoa0357c.stoa004.digilibLT-lat1.xml

Large diffs are not rendered by default.

0 comments on commit 0fccf66

Please sign in to comment.